aarush Profile Banner
Aarush Sah Profile
Aarush Sah

@aarush

Followers
8K
Following
12K
Media
367
Statuses
3K

@NVIDIA | prev. Head of Evals @GroqInc | Creator of OpenBench

South Bay
Joined September 2022
Don't wanna be here? Send us removal request.
@aarush
Aarush Sah
4 days
Joined @NVIDIA today! Time to cook
788
285
12K
@sama
Sam Altman
1 year
you can just do things
2K
3K
25K
@aarush
Aarush Sah
16 days
Christmas came early :)
14
1
224
@aarush
Aarush Sah
19 days
I haven’t even been out and about and I’m feeling sick - what is going around SF right now
1
1
19
@aarush
Aarush Sah
24 days
Powered by @GroqInc 🫑
@theo
Theo - t3.gg
24 days
And finally, my favorite change. The default model is now Kimi K2. I love how it writes. I love talking to it. I've found it to be a significantly better chat model than anything else I've tried. It's so good that I'm scared it will hurt conversion to paid tiers.
0
0
36
@aarush
Aarush Sah
27 days
groq wrapped
6
0
21
@aarush
Aarush Sah
28 days
πŸ‘€
@ozenhati
Hatice Ozen
28 days
we've been heads down @groqinc and you'll see why soon
1
0
26
@aarush
Aarush Sah
29 days
Anecdotally, GPT-5.2 seems to hallucinate a lot more than GPT-5.1 and GPT-5 when used as a chat model. Has anyone else noticed this?
1
0
9
@minisounds
Jason Zhang
29 days
(1/3) developing good intuition and "feel" for concepts in ai (architectures, theory, etc) is crucial in order to be productive, but not many talk about how to build it. wrote a quick read on my <30 min process for building robust intuition, quickly:
1
5
21
@aarush
Aarush Sah
29 days
Great blog post from my good friend @minisounds on learning intuition with AI. Would strongly recommend reading!
@minisounds
Jason Zhang
29 days
(1/3) developing good intuition and "feel" for concepts in ai (architectures, theory, etc) is crucial in order to be productive, but not many talk about how to build it. wrote a quick read on my <30 min process for building robust intuition, quickly:
1
0
6
@aarush
Aarush Sah
1 month
Tomorrow morning - openbench 0.5.3 :)
@pingToven
Toven
1 month
chat, help me convince @aarush to cut an openbench release after he's back from neurips pls
1
0
15
@aarush
Aarush Sah
1 month
Congrats @LandoNorris on the WDC πŸŽ‰ πŸŽ‰πŸŽ‰
@GroqInc
Groq Inc
1 month
Mega congrats to @LandoNorris, 2025 Drivers' World Champion! πŸ§‘πŸ† @McLarenF1
1
0
24
@aarush
Aarush Sah
1 month
I’m at the Groq booth at NeurIPS! Swing by and say hi to the team - we’re right by Google
6
3
87
@aarush
Aarush Sah
1 month
Another day of @GuillaumeLample being at NeurIPS 2024
0
0
11
@aarush
Aarush Sah
1 month
Three years ago today, our lives changed more than we could possibly imagine!
@sama
Sam Altman
3 years
today we launched ChatGPT. try talking with it here: https://t.co/uWra8LKFMN
1
0
11
@aarush
Aarush Sah
2 months
Someone's benchmarking GLM-4.6 through @OpenRouterAI with openbench πŸ‘€
0
2
24
@aarush
Aarush Sah
2 months
Orange is the new black. @GroqInc 🀝 @McLarenF1
5
1
78
@shaunakjoshi
Shaunak Joshi
2 months
Want to influence AI development? Build evals, not models. How: β€’ Find questions frontier models struggle with (<70% accuracy) β€’ Test GPT-5, Claude, Qwen, DeepSeek, etc. β€’ Open source the dataset β€’ Write up your findings Labs actively track and optimize for public
Tweet card summary image
openbench.dev
Provider-agnostic, open-source evaluation infrastructure for language models
1
1
7
@aarush
Aarush Sah
2 months
GPT-5.1 is in ChatGPT πŸ‘€
1
0
7
@aarush
Aarush Sah
2 months
I wonder how much economic value is lost due to the Caltrain having spotty WiFi
3
0
31