mattgroh Profile Banner
Matt Groh Profile
Matt Groh

@mattgroh

Followers
2K
Following
10K
Media
278
Statuses
2K

Assistant professor @NorthwesternU @KelloggSchool | PhD @MIT @medialab | human AI collaboration | computational social science | cognitive science

Cambridge, MA
Joined May 2012
Don't wanna be here? Send us removal request.
@mattgroh
Matt Groh
10 months
🚨 New paper in @NatureComms 🚨. We created deepfakes of the current & former @POTUS giving speeches (w/ voices from voice actors & @elevenlabsio) to study what drives how well people can tell fake speeches from real ones. Time to update the "Seeing is Believing" narrative.👇.
6
81
314
@mattgroh
Matt Groh
4 days
How do should we measure the value of an explanation?. First, we need a goal of what the explanation should do. Then, we need to evaluate how the explanation is moving a decision maker towards that goal. Must read for thinking about human-AI collaboration.
@JessicaHullman
Jessica Hullman
4 days
Explainable AI has long frustrated me by lacking a clear theory of what explanations should do. Improve use of a model for what? How? Given a task what's max effect explanation can have? It's complicated bc most methods are functions of features & prediction but not true state 1/.
0
0
2
@mattgroh
Matt Groh
4 days
The human touch matters a great deal for empathic support! Awesome new research to check out.
@MatanRubin1
Matan Rubin
5 days
🚨New paper Alert! So many people ask AI for emotional support – but is it like support from a human? Our new paper published in @NatureHumBehav explores whether people value #AI - generated #empathy as much as human empathy, in 9 preregistered studies with 6,282 participants.🧵.
1
0
5
@mattgroh
Matt Groh
5 days
RLCF: Reinforcement Learning with Community Feedback. This is an awesome Human-AI Collaboration research agenda.
@bakkermichiel
Michiel Bakker
5 days
🚨🚨 Excited to share a new paper led by @Li_Haiwen_ with the @CommunityNotes team!. LLMs will reshape the information ecosystem. Community Notes offers a promising model for keeping human judgment central but it's an open question how to best integrate LLMs. Thread👇
Tweet media one
1
2
6
@mattgroh
Matt Groh
6 days
RT @Diyi_Yang: Our study led by @ChengleiSi reveals an “ideation–execution gap” 😲. Ideas from LLMs may sound novel, but when experts spend….
0
25
0
@mattgroh
Matt Groh
10 days
@aakriti1kumar
Aakriti Kumar
19 days
How do we reliably judge if AI companions are performing well on subjective, context-dependent, and deeply human tasks? 🤖. Excited to share the first paper from my postdoc (!!) investigating when LLMs are reliable judges - with empathic communication as a case study 🧐. 🧵👇
Tweet media one
0
0
1
@mattgroh
Matt Groh
10 days
What is Claude's EQ?. Awesome look into affective convos w/ LLMs. Highlights:. - 3% of all convos are affective.- Claude pushes back 10% of time.- People on avg express more positivity over course of convo. Related q: How well do LLMs recognize effective empathic support? See 👇.
@AnthropicAI
Anthropic
10 days
New Anthropic Research: How people use Claude for emotional support. From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.
Tweet media one
2
2
9
@mattgroh
Matt Groh
12 days
"My survival as an artist will depend on whether I’ll be able to offer something that A.I. can’t: drawings that are as powerful as a birthday doodle from a child.". Epic visual story by @abstractsundayby in NYT Magazine
Tweet media one
0
0
3
@mattgroh
Matt Groh
14 days
Awesome work showing the metaphors the general public use to describe AI. And interesting to juxtapose this to the popular metaphors described by Melanie Mitchell
@chengmyra1
Myra Cheng
2 months
How does the public conceptualize AI? Rather than self-reported measures, we use metaphors to understand the nuance and complexity of people’s mental models. In our #FAccT2025 paper, we analyzed 12,000 metaphors collected over 12 months to track shifts in public perceptions.
0
0
9
@mattgroh
Matt Groh
19 days
When are LLMs-as-judge reliable? . That's a big question for frontier labs and it's a big question for computational social science. Excited to share our findings (led by @aakriti1kumar!) on how to address this question for any subjective task & specifically for empathic comms.
@aakriti1kumar
Aakriti Kumar
19 days
How do we reliably judge if AI companions are performing well on subjective, context-dependent, and deeply human tasks? 🤖. Excited to share the first paper from my postdoc (!!) investigating when LLMs are reliable judges - with empathic communication as a case study 🧐. 🧵👇
Tweet media one
0
6
31
@mattgroh
Matt Groh
20 days
Awesome tutorial!.
@HsseinMzannar
Hussein Mozannar
21 days
Curious how to build agents that can control a browser? I just wrote up a full tutorial on how to do it completely from scratch and with Magentic-UI. My goal is to demystify browser-use and CUA agents, it's fun to follow along!. Link: Jupyter notebook:
Tweet media one
0
0
1
@mattgroh
Matt Groh
1 month
Veo3 represents a paradigm shift in AI capabilities for realistic media that tells provocative, fabricated stories. This video + thread offer a quick tutorial of the capabilities and limitations (like malformed text & character consistency) that are easy to creatively bypass.
@henkvaness
𝚑𝚎𝚗𝚔 𝚟𝚊𝚗 𝚎𝚜𝚜
1 month
I decided to use my lunch time to show you how easy it is to make a fake news story in 30 minutes with Veo3 (I didn't try to perfect it). First: the footage. A mayor comes with a crazy idea and people hate it: (1/10) #verification #ai
0
1
6
@mattgroh
Matt Groh
2 months
RT @talboger: Looking at Van Gogh’s Starry Night, we see not only its content (a French village beneath a night sky) but also its *style*.….
0
12
0
@mattgroh
Matt Groh
2 months
RT @NICOatNU: This Wednesday, NICO is thrilled to once again host Lightning Talks! This term we are lucky to have three amazing researchers….
0
1
0
@mattgroh
Matt Groh
2 months
Awesome write up in Kellogg Insight on our paper published at #CHI2025 this week! .
0
2
10
@mattgroh
Matt Groh
2 months
RT @n3gRain: I’m at #chi2025 and today I’m presenting our paper on characterizing photorealism and artifacts in diffusion model-generated i….
0
5
0
@mattgroh
Matt Groh
2 months
RT @JessicaHullman: Decision studies appear in HCI, vis, & AI/ML, but how “good decision” is defined is often ad-hoc. My #CHI2025 talk to….
0
16
0
@mattgroh
Matt Groh
2 months
If you're curious about learning more, say hi to @n3gRain at #CHI2025 and see links below. Awesome collaboration with @frogspitsimulat, @aakriti1kumar, @chatzimparmpas.@JessicaHullman. Video: Preprint: CHI:
0
2
5
@mattgroh
Matt Groh
2 months
This taxonomy offers a shared language (and see our how to guide on arXiv for many examples) to help people better communicate what looks or feels off. It's also a framework that can generalize to multimedia. Consider this, what do you notice at the 16s mark about her legs?
1
0
1
@mattgroh
Matt Groh
2 months
Based on generating thousands of images, reading the AI-generated images and digital forensics literatures (and social media and journalistic commentary), analyzing 30k+ participant comments, we propose a taxonomy for characterizing diffusion model artifacts in images
Tweet media one
1
0
2
@mattgroh
Matt Groh
2 months
Scene complexity, artifact types, display time, and human curation of AI-generated images are play significant roles in how accurately people distinguish real and AI-generated images.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
0
2