Dhruv Saini @oofbaroomf X Profile

Dhruv Saini

@oofbaroomf

Followers

41

Following

25K

Media

20

Statuses

69

built promptmaxx: https://t.co/MlUXPeuF8J

Joined April 2024

Don't wanna be here? Send us removal request.

Dhruv Saini

@oofbaroomf

25 days

just made PromptMaxx - it's for programming alongside @cline with Qwen Coder on.@CerebrasSystems when you need stuff thats only available on web LLM chat apps. it uses a fast model to quickly apply pasted edits to your files and gathers llm-friendly context to copy. link in bio

5

1

32

Dhruv Saini

@oofbaroomf

15 days

i think this metr benchmark may end similarly.

0

2

Grok

@grok

1 day

Join millions who have switched to Grok.

97

179

1K

Dhruv Saini

@oofbaroomf

15 days

arc-agi didn't flex the fact that there was no earlier progress on their benchmark, they flexed the fact that initial scores were low, and we already saw them fail (o3 is not agi despite getting 87.5% on arg-agi-1).

1

0

3

Dhruv Saini

@oofbaroomf

15 days

most benchmarks seem to be becoming solved fairly quickly, because the people who make them are optimizing for lower initial scores rather than slow growth.

4

0

2

Dhruv Saini

@oofbaroomf

15 days

3. we see exponential growth on benchmarks right after their releases because they are designed to start out low. this was released in late 2024.

2

0

2

Dhruv Saini

@oofbaroomf

15 days

2. we are only seeing part of the y-axis. 2->2.5 hours seems like a lot, but 25->31% doesn't seem like a lot (i believe this goes up to 8 hours).

1

0

2

Dhruv Saini

@oofbaroomf

15 days

1. models have been releasing more and more frequently. so each model release might not be as big a jump, but when time is the x axis its much faster.

1

0

2

Dhruv Saini

@oofbaroomf

15 days

why does it seem like ai progress is plateauing, yet we have graphs like this? i think it's because of three reasons. (1/4)

1

0

3

Dhruv Saini

@oofbaroomf

15 days

awwww

1

0

3

Dhruv Saini

@oofbaroomf

21 days

the destruction of the death star showed how sometimes scaling up doesn't solve all the problems.

Sam Altman

@sama

22 days

2

0

5

Dhruv Saini

@oofbaroomf

21 days

?????????? i don't even know what to say

0

3

Dhruv Saini

@oofbaroomf

21 days

oh no

0

2

Dhruv Saini

@oofbaroomf

21 days

apple-level innovation

0

3

Dhruv Saini

@oofbaroomf

21 days

gpt-5 is probably a giant model. the tps is soo slow.

0

Dhruv Saini

@oofbaroomf

21 days

they're attacking mistral

0

2

Dhruv Saini

@oofbaroomf

21 days

sloPT-5

0

3

Dhruv Saini

@oofbaroomf

21 days

sounds like openai just scaled their existing paradigm + hybrid thinking - doesn't seem like they cracked some crazy code or anything like anthropic had since 3.5 sonnet.

0

3

Dhruv Saini

@oofbaroomf

21 days

*on swebench.

0

Dhruv Saini

@oofbaroomf

21 days

seiously? only .4 more than opus???

1

0

3

Dhruv Saini

@oofbaroomf

1 month

why do people keeping saying objective for things that are extremely subjective is it supposed to be funny or something.

1

0

3