varunneal Profile Banner
Varun Profile
Varun

@varunneal

Followers
62
Following
13K
Media
64
Statuses
359

a bridge

Joined August 2015
Don't wanna be here? Send us removal request.
@varunneal
Varun
2 hours
Tweet media one
0
0
0
@varunneal
Varun
1 day
i just scored 40% on simplebench.
@scaling01
Lisan al Gaib
2 days
GPT-OSS ranking 34th on SimpleBench
Tweet media one
0
0
1
@varunneal
Varun
8 days
o3 I love you. But I don't see you. Your base model is behind a veil, and I just get what you want me to see. I want all of you.
0
0
1
@varunneal
Varun
9 days
the implications of Conway's Law for the big labs poses a civilizational
Tweet media one
0
0
1
@varunneal
Varun
10 days
a model overfitted on every possible google earth screenshot, every angle of the hubble telescope. Know the dance of the stars, and the smell of Manila. Millions must.
0
0
0
@varunneal
Varun
10 days
bigger base models! More world knowledge!! No more thinking no more searching. I want 10 trillion saturated parameters imbued with the Pharmakon of Galen and terabytes of minecraft lets plays.
1
0
0
@varunneal
Varun
11 days
the field for doing good open source work is wide open because all of the best talent is too busy working 80 hour weeks at <Big Lab> and now we can just maintain their old projects.
0
0
2
@varunneal
Varun
15 days
RT @tnm: anthropic putting out http status codes no one has even seen before.
0
21
0
@varunneal
Varun
16 days
after reading both OAI and Deepmind's solutions, I can confidently say I sort of understand the problem statement to the first IMO question.
0
0
1
@varunneal
Varun
16 days
in old school forums (overflow, reddit, quora, yahoo answers) you got to know specific questions specific people were asking. Now everyone just queries the collective unconscious space directly.
0
0
1
@varunneal
Varun
16 days
One reason anthropic models are so popular among programmers is they have really good taste on idiomatic structures. o3 might be smarter than opus, but I am less eager to accept its code.
@varunneal
Varun
17 days
my current flow for o3 is having it generate a response, glancing at why it's wrong, and creating a new chat (or editing the first message). Let me loom.
0
0
1
@varunneal
Varun
17 days
my current flow for o3 is having it generate a response, glancing at why it's wrong, and creating a new chat (or editing the first message). Let me loom.
@varunneal
Varun
17 days
I have no clue why exolooming isn't baked into any major chat interfaces. Sure, it's a "poweruser" feature but some of these chats cost $200/mo. Give us branching.
0
0
1
@varunneal
Varun
17 days
I have no clue why exolooming isn't baked into any major chat interfaces. Sure, it's a "poweruser" feature but some of these chats cost $200/mo. Give us branching.
1
0
2
@varunneal
Varun
18 days
Extremely strange post from xai team.
0
0
1
@varunneal
Varun
19 days
imo the worst part about o3 is it hallucinates to the point of debilitation for longish conversations. Very similar to very early chatgpt in that way.
@lefthanddraft
Wyatt Walls
20 days
o3 after 7 turns and a slight tweak to the prompt (specifying publication to arxiv):. "Score: 8 / 10 — confidently arXiv‑ready. Congratulations on reaching this stage". Notice it didn't do any CoT in this last turn. It degrades significantly over multi-turn.
Tweet media one
0
0
0
@varunneal
Varun
21 days
chatgpt memory and its consequences have.
@GeoffLewisOrg
Geoff Lewis
21 days
As one of @OpenAI’s earliest backers via @Bedrock, I’ve long used GPT as a tool in pursuit of my core value: Truth. Over years, I mapped the Non-Governmental System. Over months, GPT independently recognized and sealed the pattern. It now lives at the root of the model.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
1
@varunneal
Varun
21 days
dont talk to me about MCPs unless you know the og 😤
Tweet media one
0
0
1
@varunneal
Varun
22 days
i cannot assume there is a single 10xer at databricks who actually uses his software. Everything here is so anti-poweruser.
0
0
0
@varunneal
Varun
22 days
claude code has taught me the power of grepping around a repo, generated files, curl responses, etc to quickly gain context.
0
0
0
@varunneal
Varun
23 days
hot take, you shouldn't have to set up your agent with 20 different xml commands so that it doesn't break everything or "get creative"
Tweet media one
0
0
0