
Nous Research
@NousResearch
Followers
79K
Following
3K
Media
125
Statuses
616
The AI Accelerator Company https://t.co/vrD0aDIGDQ
New York
Joined October 2020
Atropos v0.3 is now out!. Our RL Environments framework has seen a lot of upgrades since v0.2 - some highlights:. - Atropos can now be used as a benchmarking and evaluations framework by @rogershijin, with our first external benchmark, Reward-Bench 2! . - Added the Reasoning Gym,.
Just merged a PR for an environment to improve LLM as a Judge as well as evaluate models on their capability of doing judgements!. Did you know that all verifiable RL environments are nearly equivalent to benchmarks (and vice-versa!)? So we added an evaluate command to Atropos'
7
18
168
RT @Teknium1: In case the post was too vague, yes - this is the Hermes 3 dataset. - 1 Million Samples.- Created SOTA without the censorship….
0
54
0
RT @spencershum: It was fun working with the @huggingface team to make this feature a reality! Thanks for all your work and creativity @pcu….
0
23
0
RT @Teknium1: 101 new and challenging reasoning RL environments are now supported in Atropos! . The entirety of the Reasoning Gym from by @….
0
16
0
This work is continued from and made possible thanks to the work of @gabe_grand and the MIT PPL team. Original paper:
1
3
51
Controlling text generation and structure remains a difficult problem to solve. Our newest blog post and release from Researcher in Residence @yaboilyrical explores how this problem becomes solvable using Sequential Monte Carlo approximation.
14
47
369
RT @theemozilla: world's first 40B CC0-licensed model with DeepSeek MLA trained across dozens of data centers over the internet and the los….
0
45
0
Nous Research will pay the first to properly and fully implement Atropos support into the VeRL project $2500!. For information on Atropos, our standalone RL environments framework, see: For the official VeRL issue on the bounty:
First to integrate Atropos into VeRL gets $2500 - show me the PR and it working. If you want to find a team to work on it with and split the money come ask here too.
18
22
231
Highlights from our first ever hackathon on RL Environments with Atropos!.
Six teams just won $50,000 at Nous' first ever RL hackathon 🤩. Check out the winning demos👇. @NousResearch @xai @nvidia @nebiusai @akashnet_ @LambdaAPI @tensorstax @runpod_io
10
18
211
RT @Teknium1: Finally completed and merged the SWE_RL environment that was described by Meta's SWE RL paper into Atropos - A really difficu….
0
16
0