Jakub Macina @dmacjam X Profile

Jakub Macina

@dmacjam

Followers

217

Following

190

Media

15

Statuses

104

AI/ML Scientist, mountain biker

Zurich

Joined June 2013

Don't wanna be here? Send us removal request.

Jakub Macina

@dmacjam

28 days

Paper:

0

Jakub Macina

@dmacjam

28 days

TutorRL-7B-think: TutorRL-7B: Github:

1

0

2

Jakub Macina

@dmacjam

28 days

AI alignment for tutoring🎓 We use full online RL with conversation-level rewards—not just single-turn signals like DPO. Did the student actually learn by the end?.Using GRPO, the model learns real teaching strategies like when to hint or when to correct. Explore models below⤵️.

Rohan Paul

@rohanpaul_ai

1 month

This paper introduces an online reinforcement learning framework using simulated student-tutor interactions. It trains LLMs to prioritize guiding students pedagogically instead of simply revealing solutions, aligning models with better teaching methods. This helps students

1

3

12

Jakub Macina

@dmacjam

4 months

🔥 Try it now!.Run MathTutorBench locally with your own models or submit them to our leaderboard. Open-source! 👉@ndaheim_ @idohakimi @ Manu Kapur @IGurevych @mrinmayasachan @ETH_AI_Center.

0

2

Jakub Macina

@dmacjam

4 months

🤔 𝐌𝐨𝐫𝐞 𝐤𝐧𝐨𝐰𝐥𝐞𝐝𝐠𝐞 ≠ 𝐛𝐞𝐭𝐭𝐞𝐫 𝐭𝐞𝐚𝐜𝐡𝐢𝐧𝐠?.Subject expertise does not always correlate with effective teaching; instead, pedagogy and subject knowledge may present a trade-off.

0

3

Jakub Macina

@dmacjam

4 months

🎯 How do we measure teaching quality?.We train a reward model that scores open-ended teacher responses and accurately distinguishes expert-level from novice teaching.

0

1

Jakub Macina

@dmacjam

4 months

📚 Teaching is more than just knowing the answer. Our benchmark goes beyond testing solving ability, evaluating three essential teaching skills (expertise, student understanding, pedagogical ability) across seven diverse tasks using a curated collection of datasets and metrics.

0

1

Jakub Macina

@dmacjam

4 months

🚀 𝐇𝐨𝐰 𝐰𝐞𝐥𝐥 𝐜𝐚𝐧 𝐋𝐋𝐌𝐬 𝐭𝐞𝐚𝐜𝐡?.Evaluating LLMs for education is key to making real progress, yet we lack a reliable and simple benchmark. Introducing 𝐌𝐚𝐭𝐡𝐓𝐮𝐭𝐨𝐫𝐁𝐞𝐧𝐜𝐡—an open-source benchmark designed to assess holistic tutoring capabilities in AI.

4

3

8

Jakub Macina

@dmacjam

8 months

I've been recognized as an Outstanding Reviewer for #EMNLP2024 ! 🚀🚀Contributing to the community is always rewarding and every review is an opportunity to learn and grow.

EMNLP 2025

@emnlpmeeting

8 months

We're kicking off the awards session at #EMNLP2024 by announcing our (many) **Outstanding Reviewers**!

0

13

Jakub Macina

@dmacjam

8 months

Mistakes are key learning opportunities!🧑‍🎓 Can LLMs help students learn from them through dialog? 💬 While they often struggle to diagnose student errors when generating responses directly, adding a verification step ✅ could make a difference. #EMNLP2024.

UKP Lab

@UKPLab

8 months

𝗖𝗮𝗻 𝗟𝗟𝗠𝘀 𝗵𝗲𝗹𝗽 𝘀𝘁𝘂𝗱𝗲𝗻𝘁𝘀 𝗹𝗲𝗮𝗿𝗻 𝗳𝗿𝗼𝗺 𝗺𝗶𝘀𝘁𝗮𝗸𝗲𝘀?.Models struggle to spot student errors, but a verification step could help. More below!. 🧵(1/9) #EMNLP2024. 📰

0

3

18

Jakub Macina

@dmacjam

11 months

Chat with Junling about our work of generating and evaluating the quality of multi-turn teacher-student conversations grounded in textbooks. #ACL2024 #ACL2024NLP

Junling Wang

@JunlingWang1999

11 months

Excited to share that our paper, "Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots," has been accepted at ACL 2024 Findings! . We will go to Bangkok to attend ACL 2024!.

1

0

7

Jakub Macina

@dmacjam

2 years

Excited to be at #EMNLP2023 in Singapore to present our paper.

UKP Lab

@UKPLab

2 years

We are happy to announce the release of 🧮 MathDial, a dataset of one-to-one teacher-student tutoring dialogues grounded in multi-step math reasoning problems. Discover more about our latest #EMNLP2023 Findings paper in this 🧵 (1/8). #dialogue #NLProcessing #MathWordProblems

0

1

20

Jakub Macina

@dmacjam

2 years

Consider joining fellowship programs at ETH AI Center.

ETH AI Center

@ETH_AI_Center

2 years

⏰ Deadline Approaching! . 🚀Apply by Nov 22 to become an ETH AI Center #PhD or #Postdoc! . We're hosting an EXTRA Online Q&A Session.📅Monday, Nov 20, 16:15 - 17:00 CET. Zoom link on our website 🔗 Don't miss this chance to get the info you need!

0

2

Jakub Macina

@dmacjam

2 years

So many interesting ideas and discussions at #LaunchAIXSummit2023 @ETH_AI_Center.

Julia Chatain

@JuliaChatain

2 years

Had a great time at the X+AI summit last week!.Many exciting and inspiring questions, looking forward to the next steps. Thanks @dmacjam for the invitation and organization!

0

9

Jakub Macina

@dmacjam

2 years

RT @arkrause: Doctoral and Postdoc Fellowships at the @ETH_AI_Center! Applications accepted until November 22 2023. .

0

26

0

Jakub Macina

@dmacjam

2 years

Interested in AI for Education and how LLMs to improve education? Come and join us at the #LaunchAIXSummit2023 workshop tomorrow (10:30)! .@ETH_AI_Center.

2

3

16

Jakub Macina

@dmacjam

2 years

LLMs get better at reasoning, but can they act as a tutors? Turns out, they're quick to spill out the answers.🤖 Check out the new 🧮MathDial dataset, built with input from teachers and roleplaying students.

0

5

18

Jakub Macina

@dmacjam

2 years

Link to the preprint:

0

Jakub Macina

@dmacjam

2 years

RT @wangchunshu: Tired of the output length limit of ChatGPT? Try RecurrentGPT, a language simulacra of the recurrence mechanism in RNNs. Y….

0

41

0

Jakub Macina

@dmacjam

2 years

RT @ETH_AI_Center: Celebrating Jakub Macina's remarkable achievement as he secures a coveted spot on Forbes' 30 Under 30 list in Science an….

0

3

0