Aarush Sah @aarush X Profile

Aarush Sah

@aarush

Followers

8K

Following

12K

Media

367

Statuses

3K

@NVIDIA | prev. Head of Evals @GroqInc | Creator of OpenBench

https://t.co/vQSrADuzZm

South Bay

Joined September 2022

Don't wanna be here? Send us removal request.

Aarush Sah

@aarush

4 days

Joined @NVIDIA today! Time to cook

788

285

12K

Sam Altman

@sama

1 year

you can just do things

2K

3K

25K

Aarush Sah

@aarush

16 days

Christmas came early :)

14

1

224

Aarush Sah

@aarush

19 days

I haven’t even been out and about and I’m feeling sick - what is going around SF right now

1

19

Aarush Sah

@aarush

24 days

Powered by @GroqInc 🫡

Theo - t3.gg

@theo

24 days

And finally, my favorite change. The default model is now Kimi K2. I love how it writes. I love talking to it. I've found it to be a significantly better chat model than anything else I've tried. It's so good that I'm scared it will hurt conversion to paid tiers.

0

36

Aarush Sah

@aarush

27 days

groq wrapped

6

0

21

Aarush Sah

@aarush

28 days

👀

Hatice Ozen

@ozenhati

28 days

we've been heads down @groqinc and you'll see why soon

1

0

26

Aarush Sah

@aarush

29 days

Anecdotally, GPT-5.2 seems to hallucinate a lot more than GPT-5.1 and GPT-5 when used as a chat model. Has anyone else noticed this?

1

0

9

Jason Zhang

@minisounds

29 days

(1/3) developing good intuition and "feel" for concepts in ai (architectures, theory, etc) is crucial in order to be productive, but not many talk about how to build it. wrote a quick read on my <30 min process for building robust intuition, quickly:

1

5

21

Aarush Sah

@aarush

29 days

Great blog post from my good friend @minisounds on learning intuition with AI. Would strongly recommend reading!

Jason Zhang

@minisounds

29 days

(1/3) developing good intuition and "feel" for concepts in ai (architectures, theory, etc) is crucial in order to be productive, but not many talk about how to build it. wrote a quick read on my <30 min process for building robust intuition, quickly:

1

0

6

Aarush Sah

@aarush

1 month

Tomorrow morning - openbench 0.5.3 :)

Toven

@pingToven

1 month

chat, help me convince @aarush to cut an openbench release after he's back from neurips pls

1

0

15

Aarush Sah

@aarush

1 month

Congrats @LandoNorris on the WDC 🎉 🎉🎉

Groq Inc

@GroqInc

1 month

Mega congrats to @LandoNorris, 2025 Drivers' World Champion! 🧡🏆 @McLarenF1

1

0

24

Aarush Sah

@aarush

1 month

I’m at the Groq booth at NeurIPS! Swing by and say hi to the team - we’re right by Google

6

3

87

Aarush Sah

@aarush

1 month

Another day of @GuillaumeLample being at NeurIPS 2024

0

11

Aarush Sah

@aarush

1 month

Three years ago today, our lives changed more than we could possibly imagine!

Sam Altman

@sama

3 years

today we launched ChatGPT. try talking with it here: https://t.co/uWra8LKFMN

1

0

11

Aarush Sah

@aarush

2 months

Someone's benchmarking GLM-4.6 through @OpenRouterAI with openbench 👀

0

2

24

Aarush Sah

@aarush

2 months

Orange is the new black. @GroqInc 🤝 @McLarenF1

5

1

78

Shaunak Joshi

@shaunakjoshi

2 months

Want to influence AI development? Build evals, not models. How: • Find questions frontier models struggle with (<70% accuracy) • Test GPT-5, Claude, Qwen, DeepSeek, etc. • Open source the dataset • Write up your findings Labs actively track and optimize for public