Shrey Kothari @Shreyko X Profile

Shrey Kothari

@Shreyko

Followers

2K

Following

1K

Media

44

Statuses

228

cofounder & ceo @AntimLabs @4wallai | prev @columbia

https://t.co/7lEe6mVGSV

sf

Joined September 2015

Don't wanna be here? Send us removal request.

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

62

134

1K

eigenron

@eigenron

14 days

career update: i joined @AntimLabs as a founding research engineer to work on scaling RL, transfer learning, and advancing reasoning agents! moving full-time to sf next month!

80

7

492

Antim Labs

@AntimLabs

14 days

welcome to the team, Ron!

eigenron

@eigenron

14 days

career update: i joined @AntimLabs as a founding research engineer to work on scaling RL, transfer learning, and advancing reasoning agents! moving full-time to sf next month!

2

3

31

VV

@badbotvivi

1 month

If you > Can design games. > Can train/post-train models. > Like Coke Zero. We want YOU🫵

Antim Labs

@AntimLabs

1 month

come build with us

8

4

35

Shrey Kothari

@Shreyko

1 month

raised some money moved to sf hiring ml engineers (dm if you want to train models and make games)

56

16

646

Shrey Kothari

@Shreyko

1 month

Active Capital is the best team for early stage founders. if you’re raising, highly recommend reaching out to Chris or @patmatthews

Chris Saum

@christophersaum

1 month

We were first money in with a $500k check, then stepped in with additional $200K to help @shreyk0 break his NYC lease, move to SF, and build IRL with his @RLenvs team. This is what I love....backing founders before the rest of the world catches on, and helping drive real change.

1

3

27

Shrey Kothari

@Shreyko

1 month

raised some money moved to sf hiring ml engineers (dm if you want to train models and make games)

56

16

646

Shrey Kothari

@Shreyko

1 month

Claude sonnet 4.5 is a pacman god

0

8

JJ

@kairosx1_

2 months

We need more AI benchmarks for real-world applications to better gauge progress toward AGI. This is pretty similar to ARC, but compares interactions with other humans.

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

1

7

xlr8harder

@xlr8harder

2 months

I love this kind of experiment, it's fascinating to see how the AI models fare in this kind of environment.

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

1

4

21

Aspen

@aspenCh_MS

2 months

proud of kimi holding its own in this wild social deduction setup 😭 love how this benchmark turns among us into a live testbed for social reasoning, way more fun and telling than static evals

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

3

Crystal

@crystalsssup

2 months

embodied models play Among Us 👀 stardew valley wen?

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

4

3

67

Dang Nguyen

@divingwithorcas

2 months

Hilarious to see impostor models feign ignorance. We need more examples of models willfully lying!

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

2

Chicago HAI

@ChicagoHAI

2 months

Very cool! I like the part where Gemini acted melodramatically after ejecting the wrong crewmate when it was, in fact, the impostor! AI-driven games are gaining traction. We might unveil something of our own soon...

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

4

Antim Labs

@AntimLabs

2 months

Just read this new report from @RLenvs and it broke my brain. 🤯 The core idea is simple- you get frontier LLMs to play Among Us. That’s it. And somehow… it has implications for real world AI systems. Most real-world deployments will be multi-agentic: agents must coordinate,

antimlabs.com

Interactive multi‑agent benchmark in an Among‑Us‑like world: evaluate leadership, deception, and coordination across state‑of‑the‑art models.

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

2

9

Boni 🌠

@insilications

2 months

GPT-5 had the lowest numbers of wrongful ejections as crew too, even as a overall master of deception. GPT-5 is a master at rolemaxxing, playing according to assigned role

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

Chris Saum

@christophersaum

2 months

@RLenvs turning games into real research. Love how 'Among AIs' reveals that models have stable social styles - leadership, consensus, even bluffing. Big implications for how multi-agent systems get built.

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

4

1LittleCoder💻

@1littlecoder

2 months

After the vending machine, this is the most unique LLM benchmark i've seen! Social deduction games pressure-test social dynamics like who to trust, when to lie, how to coordinate, and how to update beliefs as the world (and other agents) evolves. Using this benchmark helps

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

2

1

12

nawtayei

@nawtayei

2 months

Love seeing the exploration of AI in conducting social experiments…as we start relying on AI as much or more than our coworkers, understanding their inherent biases and behaviors is key for ongoing trust

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

1

Mimansa Jaiswal

@MimansaJ

2 months

Watching the game in your thesis acknowledgment (@AmongUsGame) become an RL environment 😃

Shrey Kothari

@Shreyko

2 months

Introducing Among AIs, a social reasoning benchmark where embodied models play Among Us to test social intelligence: deception, persuasion, and coordination. We put 6 SOTA models in a live arena and GPT-5 came out on top by leading in Impostor & Crewmate wins. Why did GPT-5 get

0

2

11