Sayash Kapoor @sayashk X Profile

Sayash Kapoor

@sayashk

Followers

10K

Following

4K

Media

159

Statuses

1K

CS PhD candidate @PrincetonCITP and senior fellow at @Mozilla. I tweet about agents, evaluation, reproducibility, AI for science. Book: https://t.co/tb2lXSP2gB

Princeton

Joined March 2015

Don't wanna be here? Send us removal request.

Sayash Kapoor

@sayashk

4 days

The mainstream view of AI for science says AI will rapidly accelerate science, and that we're on track to cure cancer, double the human lifespan, colonize space, and achieve a century of progress in the next decade. In a new AI Snake Oil essay, @random_walker and I argue that

13

61

228

Sayash Kapoor

@sayashk

10 hours

RT @evijitghosh: New blog post alert! 🚨"What is the Hugging Face Community Building?", with @YJernite and @IreneSolaiman . The AI narrative….

0

4

0

Sayash Kapoor

@sayashk

3 days

RT @random_walker: If we compared AI capabilities against humans with no access to tools, such as the internet, we would probably find that….

0

27

0

Sayash Kapoor

@sayashk

4 days

RT @RishiBommasani: In the running for my favorite blog post from Sayash and Arvind!. When people ask me for areas I am most excited about….

0

4

0

Sayash Kapoor

@sayashk

4 days

@random_walker Per-capita research slowing down is one thing, but by many metrics even aggregate research is slowing down or constant. (We summarize these in this table.) . We don’t think this is inevitable, and there are many interventions worth considering.

Alon Albalak

@AlbalakAlon

4 days

@sayashk @random_walker I wholeheartedly agree with the sentiment of this post! However, if the number of papers has increased 500 fold, and the average disruption of a paper has decreased 10 fold, doesn't that still suggest that 50 times more disruptive discoveries have been made?.

0

3

12

Sayash Kapoor

@sayashk

4 days

@random_walker Link to essay:

aisnakeoil.com

Confronting the production-progress paradox

1

4

17

Sayash Kapoor

@sayashk

4 days

RT @random_walker: We ourselves are enthusiastic users of AI in our scientific workflows. On a day-to-day basis, it all feels very exciting….

0

12

0

Sayash Kapoor

@sayashk

4 days

RT @random_walker: Some aspects of AI discourse seem to come from a different planet, oblivious to basic realities on Earth. AI for science….

0

74

0

Sayash Kapoor

@sayashk

6 days

RT @kennylpeng: Are LLMs correlated when they make mistakes? In our new ICML paper, we answer this question using responses of >350 LLMs. W….

0

46

0

Sayash Kapoor

@sayashk

11 days

After we invented the dynamo, it took us 40 years to electrify factories. In the process, we had to redesign the entire factory layout — electrifying existing factories didn't cut it. Software engineering will likewise need to undergo drastic changes to truly benefit from AI.

METR

@METR_Evals

11 days

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

14

33

194

Sayash Kapoor

@sayashk

11 days

RT @METR_Evals: We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The resu….

0

1K

0

Sayash Kapoor

@sayashk

11 days

RT @snewmanpv: How much time do AI coding tools save? @METR_Evals just released a rigorous study with a startling result: developers take 1….

0

11

0

Sayash Kapoor

@sayashk

12 days

RT @CarnegieIndia: 🎙️ New #InterpretingIndia episode!. @NidhiSinghLive joins @sayashk to explore the hype, hope, and hazards of artificial….

0

2

0

Sayash Kapoor

@sayashk

13 days

RT @daniel_d_kang: As AI agents near real-world use, how do we know what they can actually do? Reliable benchmarks are critical but agentic….

0

29

0

Sayash Kapoor

@sayashk

18 days

RT @jordanmcgillis: AI tools can detect truck driver fatigue and prevent deadly crashes. But the Teamsters are blocking their rollout. M….

0

79

0

Sayash Kapoor

@sayashk

21 days

RT @random_walker: When coding with agents, my ideal GUI for context engineering would look like this. Key features:.* Visually pick, resiz….

0

9

0

Sayash Kapoor

@sayashk

1 month

RT @random_walker: The origin story of “AI as Normal Technology”, and lessons learned. Many people have asked how the “AI as Normal Technol….

0

21

0

Sayash Kapoor

@sayashk

1 month

RT @random_walker: A post by Stripe engineer @thegautam on building a successful payments foundation model for fraud detection recently wen….

0

18

0