snowclipsed Profile Banner
snow Profile
snow

@snowclipsed

Followers
5K
Following
55K
Media
687
Statuses
11K

latent space surfer, cache-miss eliminator. https://t.co/h8rzm8QyZc

United States
Joined June 2017
Don't wanna be here? Send us removal request.
@snowclipsed
snow
2 hours
new blogpost after a long time! in this series i will talk about how to solve reinforcement learning for long-horizon tasks, incrementally from the most straightforward approaches. (link in replies!) in part I of this series, we throw RL at the cube in its most direct,
8
14
88
@snowclipsed
snow
2 hours
i want a jedi language model
0
0
1
@JustHackingHQ
Just Hacking Training (JHT)
2 days
Black Friday Comes Early 🦃 Code "BlackFriday25" active NOW for 25% off ALL courses on Just Hacking Training including Constructing Defense 2025! Excludes already discounted Bundles. Expires Nov 30 at Midnight ET.
3
13
28
@snowclipsed
snow
2 hours
Sinatras is doing amazing RL work!!
@myainotez
Sinatras
2 hours
PMPP-Eval Update! Opon release of K2-Thinking, i have evaluated it and couple other models that were requested such as R1 and Qwen3 235B over pmpp-eval coding subset. K2-Thinking is now the best open model available, according to results surpassing sonnet 4.5 for cuda tasks.
0
0
8
@TheAhmadOsman
Ahmad
2 hours
a banger of an intro to RL teaching llm to solve rubik's cubes
@snowclipsed
snow
2 hours
new blogpost after a long time! in this series i will talk about how to solve reinforcement learning for long-horizon tasks, incrementally from the most straightforward approaches. (link in replies!) in part I of this series, we throw RL at the cube in its most direct,
1
1
10
@myainotez
Sinatras
2 hours
Great read to close your weekend off, rubricks grpo creating rl envs it really is pretty dense on information check it out
@snowclipsed
snow
2 hours
new blogpost after a long time! in this series i will talk about how to solve reinforcement learning for long-horizon tasks, incrementally from the most straightforward approaches. (link in replies!) in part I of this series, we throw RL at the cube in its most direct,
0
2
7
@snowclipsed
snow
2 hours
also shoutout to @OccupyingM for doing pre-eliminary research on their end about the same as well, their findings highly correlate with mine :) link to their post :
@OccupyingM
krishna
27 days
can your llm rotate a shape inside it's head? i found out yes but it's a fucking idiot when it comes to the upper layer... why? non uniform spatial reasoning.... here's an eval to test the internal latent reasoning of your models.
0
0
3
@snowclipsed
snow
2 hours
many, many blogposts to come! i have another 3 queued already :) exciting times.
0
0
2
@RalstonCollege
Ralston College
5 days
In the final 2025 Sophia Lecture Dr Bret Weinstein @BretWeinstein explores the deep interplay of genes, culture and consciousness in shaping humanity’s path: consciousness, he argues, is an evolutionary tool for novelty, enabling us to build civilizations that outlive each of us.
1
4
16
@snowclipsed
snow
2 hours
also thanks to @fujikanaeda @secemp9 @_ueaj @tokenbender @_vatsadev @myainotez @nyxkrage for nourishing an introduction to RL for me :)
0
0
8
@snowclipsed
snow
6 hours
now I'd listen to this banger
@code_star
Cody Blakeney
6 hours
I saw a cool bodega cat
1
0
12
@snowclipsed
snow
12 hours
the one dimensional flow of a conversation with language models can be severely limiting for many usecases that require deep-end reasoning and have a high error/deviation rate because you can't have detour conversations easily and context rot exists i think comfyui-like UI or
1
0
5
@Spacepointorg
Spacepoint
3 days
As we wait for @nasa @RocketLab @blueorigin EsCAPADE / New Glenn-2 Launch, don't forget, our 2025 GSPC Photo Contest is closing 12/31/25..! Will the EsCAPADE launch win? Stay tuned! 5 Categories... In the meantime, one of 2024's best by Max Evans @_mgde_
2
9
56
@snowclipsed
snow
15 hours
if you think about it, a language model is just a really good prefix tree pruner
0
0
2
@AlpinDale
Alpin
1 day
New weekend blogpost. Some light PTX exploration, and a simple Top-K kernel.
9
47
490
@snowclipsed
snow
17 hours
it is time for the monthly bookmark clearing spell
1
0
3
@snowclipsed
snow
17 hours
what an incredible day
2
0
1
@AJ_Dunkentell
Alonzo Dunkentell Jr.
4 hours
0
4
11
@tenderizzation
tender
2 days
[ENG SUB] how it feels to use eager pytorch in 2025
24
52
381
@kalomaze
kalomaze
18 hours
RL LEARNING WITH LORA: A DIVERSE DEEP DIVE
18
54
646
@snowclipsed
snow
2 days
be afraid of nothing
0
0
2
@snowclipsed
snow
2 days
can confirm
@1thousandfaces_
Hero Thousandfaces
2 days
*australian person seeing snow* it’s snauring
0
0
0
@willccbb
will brown
2 days
if you want the tweet version and not the 10min video version: this is now all it takes to train with prime-rl after installing verifiers
@willccbb
will brown
2 days
verifiers v0.1.7 is released 🚀 this one's all about making RL training and experimentation waaaay easier: - single-command installation for prime-rl - single-command training w/ unified configs - overhauled vf.RLTrainer for hacking on new algorithms quick demo + links below :)
6
7
78