Spencer Cheng Profile
Spencer Cheng

@spenccheng

Followers
2K
Following
638
Media
7
Statuses
212

2x founder | AI + Construction | I build insanely fast simulators for reinforcement learning at https://t.co/JuTqEHQX4O

Dallas, TX
Joined May 2013
Don't wanna be here? Send us removal request.
@spenccheng
Spencer Cheng
3 months
7
12
111
@jsuarez5341
Joseph Suarez 🐡
4 days
Happy Black Friday. Are you using PufferLib for RL professionally? You can now subscribe right here on X for entry level support!
4
1
34
@spenccheng
Spencer Cheng
5 days
Want to know how to waste weeks of research? Start testing a new idea on a messy, unstable environment. Better approach: test complex methods on simple envs first. This helps build your intuition of what good performance looks like. Ask me how I know.
2
0
11
@spenccheng
Spencer Cheng
6 days
RL is so stable at puffer that I almost always assume that the reason an environment does not solve is because there's an env bug.
3
0
51
@fchollet
François Chollet
27 days
ML research is an engineering discipline, not a philosophy seminar. You build, you test, you learn. Untested ideas are just speculation.
109
246
3K
@_trish_xD
trish
29 days
C is NOT a hard language. Most people just don’t have the patience to learn pointers properly.
344
188
4K
@yacineMTB
kache
29 days
pufferlib actually runs the neural networks it trains in your browser btw
@jsuarez5341
Joseph Suarez 🐡
29 days
All these cool in-progress indie games on the timeline need a "play demo now" button that takes you to a browser build
5
1
95
@jsuarez5341
Joseph Suarez 🐡
29 days
This was true for small-model RL. The most widely used libraries were training standard baselines at 500-5k steps/second. With PufferLib, we're training 500k-5M steps/second and faster every update!
@jarredsumner
Jarred Sumner
30 days
Can someone explain to me why ~500 tok/s is fast and what in-the-weeds technical constraints prevent 100,000 tok/s at same quality? My gut is there’s incredible waste due to infinite money and in a world w/ 1/10000th of the capital models would be orders of magnitude better
1
3
77
@yacinelearning
Yacine Mahdid
1 month
we're going live to explore what this puffer has to offer 🐡
4
4
84
@spenccheng
Spencer Cheng
1 month
It is incredibly satisfying to see an agent crush in a new sim.
1
0
6
@jsuarez5341
Joseph Suarez 🐡
1 month
PufferLib lets you train agents in seconds on your laptop. Knowledge kept behind closed doors is easily lost. I'm not going to let that happen to RL.
6
7
138
@jsuarez5341
Joseph Suarez 🐡
1 month
Lonely fish in your area! See all of me at https://t.co/Vo5QDvKMxO. PR your agents to fill the tank. Thanks for 20k followers! This is going to be a good year for RL.
5
5
90
@jsuarez5341
Joseph Suarez 🐡
1 month
RL really sucks. It takes 10 hours just to learn breakout. ... a few years ago. It's <30 seconds on 1 GPU now in PufferLib and still dropping. Write faster code.
54
57
1K
@spenccheng
Spencer Cheng
1 month
When training RL policies on competitive domains, it is often quite useful to have it fight some easy opponent such as random actions or a scripted bot first. Don't try to slay the dragon on day one. Go farm some noobs to make sure your bot can learn.
1
1
22
@spenccheng
Spencer Cheng
1 month
Priceless jewels were stolen from the Louvre apparently this morning. I bet there is a Hollywood producer who’s flying right now to try and make this a limited series.
0
0
4
@spenccheng
Spencer Cheng
2 months
I don’t think people realize that Joseph’s streams are literally better than any class you can take in RL. Frontier lab dev. Access to an expert. Will answer all questions of any skill level.
@jsuarez5341
Joseph Suarez 🐡
2 months
I just stream everything except private sim dev for clients. This is all from this week
5
47
915
@spenccheng
Spencer Cheng
2 months
Engineers can’t fathom the value of good and simple UX. If it’s not complicated, it must be garbo obviously.
@levelsio
@levelsio
2 months
An entire generation that is unaware just serving a combination of AI models in a user friendly interface to regular people is a million to billion dollar business
0
0
7
@spenccheng
Spencer Cheng
2 months
This is my exact story with PufferLib. I found it while waiting for a delivery to my job site. I thought it was cool and asked a bunch of questions. I then tried to be as helpful to @jsuarez5341 as I could be. Now I get paid to build RL solutions for Puffer clients.
@thdxr
dax
2 months
so @rekram11 is joining our team full time - backstory is interesting he heard me say the thing i always say - find an early open source project with potential, contribute, answer questions, be helpful except unlike 99% of people who hear that he actually went and did it so
0
0
21
@spenccheng
Spencer Cheng
2 months
My most technical article on autonomous driving with RL led to new client contracts and adoption of my work in other self driving labs, but it received 10% of the views of my most viral post.
@im_roy_lee
Roy
2 months
the best organic content is not a good ad. the best ad will never be viral.
1
0
7
@spenccheng
Spencer Cheng
2 months
For 99% of RL envs, you can just debug your work by having a good renderer. That last 1% you gotta bite the bullet and write tests.
1
0
11
@jsuarez5341
Joseph Suarez 🐡
2 months
14
12
182