basvanopheusden @basvanopheusden X Profile

basvanopheusden

@basvanopheusden

Followers

2K

Following

3K

Media

27

Statuses

2K

Research at OpenAI, previously @imbue_ai and @cocosci_lab lab at Princeton. All opinions my own

San Francisco, USA

Joined October 2010

Don't wanna be here? Send us removal request.

basvanopheusden

@basvanopheusden

7 hours

Modern problems require modern solutions

Yam Peleg

@Yampeleg

20 hours

Plotting data in ASCII art because Codex understands it better, crazy times

0

2

Sam Altman

@sama

12 hours

GPT-5.1 is out! It's a nice upgrade. I particularly like the improvements in instruction following, and the adaptive thinking. The intelligence and style improvements are good too.

2K

1K

10K

Matt Shumer

@mattshumer_

12 hours

I've been testing GPT-5.1 for a few days. My quick notes: - creative writing style is a LOT better - it's much faster than GPT-5 (with similar intelligence) for most prompts - the personality is WAY better (but can still sometimes be annoying) - it's great in Codex!

Sam Altman

@sama

12 hours

GPT-5.1 is out! It's a nice upgrade. I particularly like the improvements in instruction following, and the adaptive thinking. The intelligence and style improvements are good too.

56

35

857

Sakana AI

@SakanaAILabs

2 days

GPT-5 on Sudoku-Bench 🧩 Since releasing Sudoku-Bench in May 2025, when no LLM could solve a classic 9x9 puzzle, we've been evaluating the latest generation of models. GPT-5 now leads our leaderboard with 33% puzzles solved--approximately 2x the previous leader--and is the first

29

115

667

Jeremy Berman

@jerber888

2 days

I finally reached human-level performance (85%) on ARC-AGI v1 for under $10k and within 12 hours. I use the same multi-agent collaboration with evolutionary test-time compute, now powered by GPT-5 pro with lower parallelism.

Jeremy Berman

@jerber888

2 months

I'm back at the top of ARC-AGI with my new program. I use @grok 4 and multi-agent collaboration with evolutionary test-time compute

72

147

2K

yifei e/λ (meetmeinshibuya nov 16)

@yifever

8 days

congrats to llama 3 large for winning the LLM trading contest by not participating

96

139

4K

Abhijeet

@abhijeetdw

13 days

met the great genius @GMJuditPolgar yesterday

1

5

basvanopheusden

@basvanopheusden

7 days

Still, what an honor: to play, to learn, and to help bring @GMJuditPolgar to @OpenAI. An experience I’ll be chasing for a long time.

1

0

3

basvanopheusden

@basvanopheusden

7 days

I had one final amusing moment. After Judit played the crushing 35. e5, I found 35… g6, which appears convincing and in a bullet game (my specialty), might have actually saved the game. But of course she found the (only) refutation in seconds 😕

1

0

2

basvanopheusden

@basvanopheusden

7 days

These lines are not too complicated, but under the pressure of playing Judit, and knowing that my game was broadcast to hundreds of my colleagues, I crumbled and ended up losing all my queenside pawns in the next 5-10 moves…

1

0

1

basvanopheusden

@basvanopheusden

7 days

Surprisingly, when I later checked with the engine, it turned out that everything just works for Black. 19 Nc4 d5 is a sound positional pawn sacrifice, and after 19 Ra5, Bd8 is the top move, though the core idea is 20… a5!, neutralizing the rooks rather than trading bishops.

1

0

1

basvanopheusden

@basvanopheusden

7 days

The critical position in my game came after Judit played Ra5 on move 19, preparing to double rooks while eyeing the key d5 square. I had missed this idea and instead focused only on 19. Nc4 d5!?. After Ra5 I panicked and embarked on a wild idea to trade bishops with Be7-d8-b6.

1

0

1

basvanopheusden

@basvanopheusden

7 days

Playing Judit was surreal. She’s the strongest opponent I’ve ever faced, and it showed immediately, in her precision, tactical prowess and strategical finesse. Just seeing her across the board was enough to lose all self-confidence.

1

0

2

basvanopheusden

@basvanopheusden

7 days

Then came the highlight of the event, the simultaneous match. 8 OpenAI researchers vs. Judit. Final score: 0.5–7.5. We joked that we exceeded expectations 😅

1

0

2

basvanopheusden

@basvanopheusden

7 days

Judit shared stories about her work with Ernő Rubik and the Judit Polgar Chess Foundation, a reminder that learning, play, and curiosity belong to everyone. 🔗 https://t.co/QweP7rHEAm

1

0

1

basvanopheusden

@basvanopheusden

7 days

The event started with a fireside with @markchen90. They talked about chess before and after engines, how AI can sharpen or stifle creativity, and the beautiful ways human imagination and the classic game of chess persists beyond engines.

1

0

1

basvanopheusden

@basvanopheusden

7 days

Three weeks after joining OpenAI, I sent Judit a cold email: “it would mean the world to me if you could make it. Both your chess style and life story are a source of continued inspiration, and I'm sure many of my colleagues (chess players or not) will love to hear from you.”

1

0

2

basvanopheusden

@basvanopheusden

7 days

Last week, we hosted Grandmaster @GMJuditPolgar, a legend and the strongest woman in chess history at @OpenAI for a conversation on intuition, creativity, and strategy. We think about these ideas every day. Chess is a reminder that reasoning and imagination go hand in hand.

1

3

9

basvanopheusden

@basvanopheusden

9 days

This is absolutely insane! 2800 with two pieces down...

Daniel Kokotajlo

@DKokotajlo

10 days

The author helpfully made this graph. QRR odds means Leela's starting forces are half the size of yours. If I'm googling correctly, that means it actually is favored to win against the median human chess player under those conditions.

0

2

IA Judit Sztaray

@ChessArbitress

11 days

One of my bucket list items were checked off today: we hosted @GMJuditPolgar at the @MILibrary chess club!

8

4

57