
Matt Henderson
@matthen2
Followers
80K
Following
8K
Media
881
Statuses
6K
maths, visualisations, conversational AI. VP Research @polyaivoice prev: @RekaAILabs, @Apple AI/ML, @GoogleAI, PhD @Cambridge_Eng
Edinburgh, UK
Joined February 2010
a thread with some of my top animations 📍🧵.
give each pixel a random Pokemon type, and then battle pixels against their neighbors, updating each pixel with the winning type (using the Pokemon type chart). we quickly see areas of fire > water > grass > fire, electric sweeping over, ground frontiers taking over etc etc
29
493
7K
nice library for defining multi-step RL environments-.
To push the open source frontier for RL + LLMs, we need scalable, modular environments with real-world complexity, beyond math benchmarks. Today, we’re releasing *benchmax*. An open-source framework to build, run, & scale useful RL envs for LLM fine-tuning, with integrations to
0
0
7
RT @RekaAILabs: 🎉 Big news! We've raised $110M from new and existing investors, including @nvidia & @Snowflake. This funding reinforces our….
reka.ai
Latest News & Articles
0
19
0
weirdest @vllm_project bug-.model accuracy is fine with 10 or fewer parallel request, and 16 or more- but degrades badly with 11 to 15. ??.
2
0
8
the original animation in @standupmaths' video.
construction of a Julia set through repeated twists, reflections, and shifts. The twists correspond to the complex square root.
2
0
25
cool! I got a shoutout on @standupmaths' latest video on Julia sets. sharing some of my other Julia set stuff in the thread đź§µ
2
2
82
very cool application of RL tuning-.fun to see the reward hacking and how it was solved.
1/ Can codebase-specific RL push the frontier for code LLMs?. At @cgftlabs, we helped a client RL-tune Qwen-2.5-7B on their internal codebase for unit test creation, with coverage-guided GRPO. The result? It beats o4-mini & o3. Here’s how it works (link to full blog in bio) 🧵
0
0
5