Dhruv Saini Profile
Dhruv Saini

@oofbaroomf

Followers
41
Following
25K
Media
20
Statuses
69

built promptmaxx: https://t.co/MlUXPeuF8J

Joined April 2024
Don't wanna be here? Send us removal request.
@oofbaroomf
Dhruv Saini
25 days
just made PromptMaxx - it's for programming alongside @cline with Qwen Coder on.@CerebrasSystems when you need stuff thats only available on web LLM chat apps. it uses a fast model to quickly apply pasted edits to your files and gathers llm-friendly context to copy. link in bio
5
1
32
@oofbaroomf
Dhruv Saini
15 days
i think this metr benchmark may end similarly.
0
0
2
@grok
Grok
1 day
Join millions who have switched to Grok.
97
179
1K
@oofbaroomf
Dhruv Saini
15 days
arc-agi didn't flex the fact that there was no earlier progress on their benchmark, they flexed the fact that initial scores were low, and we already saw them fail (o3 is not agi despite getting 87.5% on arg-agi-1).
1
0
3
@oofbaroomf
Dhruv Saini
15 days
most benchmarks seem to be becoming solved fairly quickly, because the people who make them are optimizing for lower initial scores rather than slow growth.
4
0
2
@oofbaroomf
Dhruv Saini
15 days
3. we see exponential growth on benchmarks right after their releases because they are designed to start out low. this was released in late 2024.
2
0
2
@oofbaroomf
Dhruv Saini
15 days
2. we are only seeing part of the y-axis. 2->2.5 hours seems like a lot, but 25->31% doesn't seem like a lot (i believe this goes up to 8 hours).
1
0
2
@oofbaroomf
Dhruv Saini
15 days
1. models have been releasing more and more frequently. so each model release might not be as big a jump, but when time is the x axis its much faster.
1
0
2
@oofbaroomf
Dhruv Saini
15 days
why does it seem like ai progress is plateauing, yet we have graphs like this? i think it's because of three reasons. (1/4)
Tweet media one
1
0
3
@oofbaroomf
Dhruv Saini
15 days
awwww
Tweet media one
1
0
3
@oofbaroomf
Dhruv Saini
21 days
the destruction of the death star showed how sometimes scaling up doesn't solve all the problems.
@sama
Sam Altman
22 days
Tweet media one
2
0
5
@oofbaroomf
Dhruv Saini
21 days
?????????? i don't even know what to say
Tweet media one
0
0
3
@oofbaroomf
Dhruv Saini
21 days
oh no
Tweet media one
0
0
2
@oofbaroomf
Dhruv Saini
21 days
apple-level innovation
Tweet media one
0
0
3
@oofbaroomf
Dhruv Saini
21 days
gpt-5 is probably a giant model. the tps is soo slow.
0
0
0
@oofbaroomf
Dhruv Saini
21 days
they're attacking mistral
Tweet media one
0
0
2
@oofbaroomf
Dhruv Saini
21 days
sloPT-5
Tweet media one
0
0
3
@oofbaroomf
Dhruv Saini
21 days
sounds like openai just scaled their existing paradigm + hybrid thinking - doesn't seem like they cracked some crazy code or anything like anthropic had since 3.5 sonnet.
0
0
3
@oofbaroomf
Dhruv Saini
21 days
*on swebench.
0
0
0
@oofbaroomf
Dhruv Saini
21 days
seiously? only .4 more than opus???
Tweet media one
1
0
3
@oofbaroomf
Dhruv Saini
1 month
why do people keeping saying objective for things that are extremely subjective is it supposed to be funny or something.
1
0
3