Clayton Thorrez @cthorrez X Profile

Clayton Thorrez

@cthorrez

Followers

1K

Following

10K

Media

438

Statuses

3K

Rating systems and paired comparison experimentation enjoyer @arena Previous: ML @umich @umass @microsoft @apple

https://t.co/JkB2g8fzwq

Joined March 2016

Don't wanna be here? Send us removal request.

Clayton Thorrez

@cthorrez

30 days

If you're good at data science/machine learning engineering/data engineering and want to exercise those skills on the most interesting and dynamic human preference dataset collected by mankind @lmarena_ai DM me

2

10

Clayton Thorrez

@cthorrez

8 hours

this seems like suboptimal memory management

0

3

Clayton Thorrez

@cthorrez

15 hours

Next day delivery 💪

lmarena.ai

@arena

15 hours

🚨 Text Leaderboard Update Community votes are in, and @anthropicAI's Claude Haiku 4.5 ranks #22! It has quickly become one of the best value models on the most competitive leaderboard. It delivers a solid punch at a fraction of the cost of its bigger siblings. ⚡️ A few

0

1

6

Clayton Thorrez

@cthorrez

1 day

glicko2 still undefeated in accuracy and log loss I'm actually thinking of offering a bounty on this haha, can anyone come up with a better general purpose dynamic skill rating system than one from 2001?

0

1

Clayton Thorrez

@cthorrez

1 day

EsportsBench v7 49k new matches from 6/30/2025 to 9/30/2025 https://t.co/j07qXG8ZXN

huggingface.co

1

0

3

Clayton Thorrez

@cthorrez

2 days

Come check these cool new models out in our discord! (And then stay and have cool discussions with me about ranking and rating systems in the #leaderboards channel 😃)

lmarena.ai

@arena

2 days

🚨🎬 Veo 3.1 and Veo 3.1 Fast are in the Video Arena! Come see what all the chatter is about by trying it yourself. 🌎 Your real-world prompts will push the @googledeepmind video models to its true limits

0

1

6

Clayton Thorrez

@cthorrez

2 days

the state of type checking in python + jax + jaxtyping 🤣

0

2

LeBron James

@KingJames

4 years

🤔Something is REAL 🐠 🐟 🎣 🐟🐠 going on

5K

6K

71K

Clayton Thorrez

@cthorrez

6 days

lowkey miss standards of business conduct

Gergely Orosz

@GergelyOrosz

6 days

When I worked at Microsoft, it was mandatory for all employees to watch this video about how a former employee made $400K trading company stock w indsider info. Got 2 years of jail. The message was to never inside trade company stock. Crypto is not company stock though, is it?

0

2

Clayton Thorrez

@cthorrez

7 days

LLMs are so RLs by errors in their sandboxes that they explicitly disobey direct instructions

0

2

Clayton Thorrez

@cthorrez

7 days

original on the left: https://t.co/QvX0etlKCv clone on the right:

contra.com

Connect with next-gen talent and tools to get work underway. Hire more independents. Start more projects. Get more creative. Tap into $120M+ in commission-free projects on Contra.

0

1

Clayton Thorrez

@cthorrez

7 days

Look I know it's popular these days to jump on the arena bandwagon but this is wild insane to brand this as "creative"

John Avent

@johnnnavent

7 days

so it kinda looks like Contra the "creative network" basically ripped off @designarena_ai's entire concept and their website design

1

0

5

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

7 days

We'll be hosting an @arena x @felicis happy hour at @PyTorch conference this year! If you're interested in learning more about LMArena, or chatting with us about evaluating AI systems for real world usage, please stop by. I'll be there! You can request to join the event here:

1

3

19

Clayton Thorrez

@cthorrez

7 days

me: writes some code and starts to write a comment LLM: # this is a bit of a hack 😭😭😭

0

1

Clayton Thorrez

@cthorrez

8 days

Oh sweet! @glicko is publishing the videos from NESSIS 2025 on youtube ! https://t.co/0UkaoMKvWy Check them out if you are interested in sports statistics stuff

0

1

Clayton Thorrez

@cthorrez

8 days

My side project of ranking esports players and teams is how I built the skills to do the side project I used to get hired at LMArena Now my full time job is rating and ranking :D

maharshi

@mrsiipa

8 days

my side projects that i did in 2024 gave me a full time role at FAL where i get to do the thing which excites me the most i.e. optimizing ML inference: people underestimate the power of side projects

1

7

Clayton Thorrez

@cthorrez

14 days

very neck and neck

lmarena.ai

@arena

14 days

🚨 Leaderboard Update: we have a four-way tie for #1 in the Arena! 🏆 The very top tier is now tied across the strongest models in the world: 🏆 Claude Sonnet 4.5 32k Thinking 🏆 Claude Sonnet 4.5 standard 🏆 Claude Opus 4.1 🏆 Gemini 2.5 Pro All separated by just a few Arena

0

4

lmarena.ai

@arena

14 days

🎉 Re-introducing Categories in Vision Arena! Since we first introduced categories over two years ago (and Vision Arena last year), the AI evaluation landscape has grown rapidly. Categories let us zoom in on model performance for specific areas, from captioning to diagrams. 🧵

1

9

101

Clayton Thorrez

@cthorrez

17 days

everyone say thank you Mr. 🍍

Armando Ricci

@LancelotRicci

17 days

🍍 has spoken again New model added to LMArena. - glm-4.6

0

3

lmarena.ai

@arena

17 days

🚨 Big leaderboard update on the toughest Arena to crack: Text 📝 Seven new models landed today, and five broke straight into the Top 10 🏎️ 💨 🔹#8: Qwen3-VL-235B-a22b-Instruct & Qwen3-Max-2025-09-23 (tied) by @alibaba_qwen 🔹#9: DeepSeek V3.1 Terminus (Standard & Thinking

11

26

197

Clayton Thorrez

@cthorrez

17 days

https://t.co/ebVE1emnI8

roon

@tszzl

17 days

enders game (&speaker series) is significantly better than anything asimov clark or heinlein have written

0