legit_api Profile Banner
ʟᴇɢɪᴛ Profile
ʟᴇɢɪᴛ

@legit_api

Followers
10K
Following
33K
Media
938
Statuses
3K

api guy with little-to-no wisdom, 0.5x engineer

⍨⧍⍨
Joined October 2018
Don't wanna be here? Send us removal request.
@legit_api
ʟᴇɢɪᴛ
2 months
Made by Veo 3. "A squirrel attempts to launch a homemade rocket into space"
16
34
429
@legit_api
ʟᴇɢɪᴛ
16 hours
starfish model - a flappy birds clone. successfully induces rage bait 😅
3
2
64
@legit_api
ʟᴇɢɪᴛ
18 hours
starfish - new model in LM Arena
Tweet media one
21
9
194
@legit_api
ʟᴇɢɪᴛ
3 days
new models landed in LM Arena
Tweet media one
8
8
151
@legit_api
ʟᴇɢɪᴛ
6 days
RT @btibor91: gpt-5-reasoning-alpha-2025-07-13. h/t @swishfever
Tweet media one
0
163
0
@legit_api
ʟᴇɢɪᴛ
9 days
Gemini Tutor coming soon…
Tweet media one
3
10
150
@legit_api
ʟᴇɢɪᴛ
9 days
new stealth models on LM Arena
Tweet media one
6
7
133
@legit_api
ʟᴇɢɪᴛ
9 days
like, just use this blue gradient 🥲
Tweet media one
1
0
27
@legit_api
ʟᴇɢɪᴛ
9 days
alright, who was it?. which one of you made this abomination
Tweet media one
14
1
95
@legit_api
ʟᴇɢɪᴛ
10 days
new stealth models on LM Arena. - cresylux.- nettle.- clownfish.- octopus
Tweet media one
14
13
205
@legit_api
ʟᴇɢɪᴛ
15 days
RT @apples_jimmy: Grok 4:. Still no wall. 50.7% with Grok 4 heavy on humanity’s last exam. 41% with tools . 26.9% without tools. " Grok 4….
0
211
0
@legit_api
ʟᴇɢɪᴛ
15 days
RT @jie_bing: Grok 4 live stream.
0
51
0
@legit_api
ʟᴇɢɪᴛ
21 days
If they use “Test Time Compute” as a reference to cons@n metric. then Standard is likely the public Grok 4 reasoning model for us. the other one might measure e.g. consensus from n attempts which checks for most frequent answer and that usually improves score. focus on Standard.
@ChaseBrowe32432
Chase Brower
21 days
@HCSolakoglu @legit_api They previously used the same term to refer to cons@n. The standard is most likely what will be the publicly available reasoning model, and TTC is cons@32 or cons@64. As long as they also report standard scores (looks like they are here) I think it’s fine tbh.
2
0
89
@legit_api
ʟᴇɢɪᴛ
21 days
Grok-4 and Grok-4 Code on benchmarks. - 35% on HLE, 45% with reasoning!!.- 87-88% on GPQA.- 72-75% on SWE Bench (Grok 4 Code)
Tweet media one
126
134
1K
@legit_api
ʟᴇɢɪᴛ
22 days
RT @legit_api: Steve does not perform too great. general consensus in servers is that it’s a small DS model or a 3rd party distilled model….
0
2
0
@legit_api
ʟᴇɢɪᴛ
22 days
Steve does not perform too great. general consensus in servers is that it’s a small DS model or a 3rd party distilled model based on DS. the latter would explain why it might claim to be DeepSeek even if it might not be. lmarena is hiding its origin:
Tweet media one
@AiBattle_
AiBattle
23 days
Steve 🆚 Deepseek V3 (0324) - Space Invaders . For this prompt Deepseek V3 generated ≈800 lines of code, while Steve produced ≈300 lines. If Steve is a Deepseek model, it might be a smaller model rather than V4. The naming scheme also suggests that the model is Chinese, as we
Tweet media one
Tweet media two
1
2
41
@legit_api
ʟᴇɢɪᴛ
23 days
new stealth model on LM Arena. steve - claims to be from DeepSeek
Tweet media one
12
10
220
@legit_api
ʟᴇɢɪᴛ
1 month
0
1
9
@legit_api
ʟᴇɢɪᴛ
1 month
Gemini 2.5 model family tech report
Tweet media one
4
13
144
@legit_api
ʟᴇɢɪᴛ
1 month
now also available on AI Studio and Gemini API
Tweet media one
1
3
47
@legit_api
ʟᴇɢɪᴛ
1 month
- Gemini 2.5 Pro (stable).- Gemini 2.5 Flash (stable).- Gemini 2.5 Flash Lite (preview). these models are now available on Vertex AI
Tweet media one
@legit_api
ʟᴇɢɪᴛ
1 month
As seen in our server last week. we *should* be getting stable (GA) releases for Gemini 2.5 Pro + 2.5 Flash. and a preview for 2.5 Flash Lite
Tweet media one
15
18
286