ʟᴇɢɪᴛ @legit_api X Profile

ʟᴇɢɪᴛ

@legit_api

Followers

10K

Following

33K

Media

944

Statuses

3K

api guy with little-to-no wisdom, 0.5x engineer

⍨⧍⍨

Joined October 2018

Don't wanna be here? Send us removal request.

ʟᴇɢɪᴛ

@legit_api

2 months

Made by Veo 3. "A squirrel attempts to launch a homemade rocket into space"

16

34

431

ʟᴇɢɪᴛ

@legit_api

4 hours

summit - invent your own UFO game. it made a game about abducting cows

4

7

130

ʟᴇɢɪᴛ

@legit_api

16 hours

cuttlefish - new model on LM Arena

7

6

161

ʟᴇɢɪᴛ

@legit_api

17 hours

zenith - most powerful or successful

7

12

216

ʟᴇɢɪᴛ

@legit_api

1 day

lobster - fully working Fruit Ninja game. sound included 🔊

10

15

328

ʟᴇɢɪᴛ

@legit_api

2 days

zenith model SVG test on LM Arena. whichever lab made this cooked so hard

18

17

328

ʟᴇɢɪᴛ

@legit_api

2 days

Lobster is the closest to o3-alpha. better than starfish and nectarine. o3-alpha > lobster > nectarine > starfish

9

17

196

ʟᴇɢɪᴛ

@legit_api

3 days

starfish model - a flappy birds clone. successfully induces rage bait 😅

3

2

75

ʟᴇɢɪᴛ

@legit_api

3 days

starfish - new model in LM Arena

22

10

204

ʟᴇɢɪᴛ

@legit_api

5 days

new models landed in LM Arena

9

8

153

ʟᴇɢɪᴛ

@legit_api

8 days

RT @btibor91: gpt-5-reasoning-alpha-2025-07-13. h/t @swishfever

0

164

0

ʟᴇɢɪᴛ

@legit_api

11 days

Gemini Tutor coming soon…

3

10

154

ʟᴇɢɪᴛ

@legit_api

11 days

new stealth models on LM Arena

6

7

134

ʟᴇɢɪᴛ

@legit_api

11 days

like, just use this blue gradient 🥲

1

0

28

ʟᴇɢɪᴛ

@legit_api

11 days

alright, who was it?. which one of you made this abomination

14

1

97

ʟᴇɢɪᴛ

@legit_api

12 days

new stealth models on LM Arena. - cresylux.- nettle.- clownfish.- octopus

14

13

206

ʟᴇɢɪᴛ

@legit_api

17 days

RT @apples_jimmy: Grok 4:. Still no wall. 50.7% with Grok 4 heavy on humanity’s last exam. 41% with tools . 26.9% without tools. " Grok 4….

0

212

0

ʟᴇɢɪᴛ

@legit_api

17 days

RT @jie_bing: Grok 4 live stream.

0

51

0

ʟᴇɢɪᴛ

@legit_api

23 days

If they use “Test Time Compute” as a reference to cons@n metric. then Standard is likely the public Grok 4 reasoning model for us. the other one might measure e.g. consensus from n attempts which checks for most frequent answer and that usually improves score. focus on Standard.

Chase Brower

@ChaseBrowe32432

23 days

@HCSolakoglu @legit_api They previously used the same term to refer to cons@n. The standard is most likely what will be the publicly available reasoning model, and TTC is cons@32 or cons@64. As long as they also report standard scores (looks like they are here) I think it’s fine tbh.

2

0

87

ʟᴇɢɪᴛ

@legit_api

23 days

Grok-4 and Grok-4 Code on benchmarks. - 35% on HLE, 45% with reasoning!!.- 87-88% on GPQA.- 72-75% on SWE Bench (Grok 4 Code)

126

134

1K

ʟᴇɢɪᴛ

@legit_api

25 days

RT @legit_api: Steve does not perform too great. general consensus in servers is that it’s a small DS model or a 3rd party distilled model….

0

2

0