jasonyo Profile Banner
Jason Yosinski Profile
Jason Yosinski

@jasonyo

Followers
7K
Following
2K
Media
57
Statuses
2K

Running experiments @OpenAI + @ml_collective. Prev: @Windscape_AI, Uber AI Labs founding team, adviser @RecursionPharma, Cornell, Montreal, Caltech 🌻

San Francisco, CA
Joined February 2008
Don't wanna be here? Send us removal request.
@jasonyo
Jason Yosinski
14 days
@techreview
MIT Technology Review
14 days
OpenAI has trained its LLM to confess to bad behavior
0
2
6
@jasonyo
Jason Yosinski
14 days
This was a fun project (that I jumped into halfway through) with @ManasJoglekar, Jeremy Chen, @GabrielDWu1, @j_asminewang, @boazbaraktcs, and @mia_glaese. Boaz wrote a good casual summary: https://t.co/IqWSkhCYQD
@boazbaraktcs
Boaz Barak
14 days
1/5 Excited to announce our paper on confessions! We train models to honestly report whether they “hacked”, “cut corners”, “sandbagged” or otherwise deviated from the letter or spirit of their instructions. @ManasJoglekar Jeremy Chen @GabrielDWu1 @jasonyo @j_asminewang
1
3
10
@jasonyo
Jason Yosinski
14 days
We just posted a blog + paper on a a simple but effective approach to model honesty called "Confessions" TL; DR: normal RL training rewards for high performance on a task. Confession training is a separate phase that rewards only for honesty. Test look promising! More:
@OpenAI
OpenAI
14 days
In a new proof-of-concept study, we’ve trained a GPT-5 Thinking variant to admit whether the model followed instructions. This “confessions” method surfaces hidden failures—guessing, shortcuts, rule-breaking—even when the final answer looks correct. https://t.co/4vgG9wS3SE
1
2
18
@j_asminewang
Jasmine Wang
16 days
Today, OpenAI is launching a new Alignment Research blog: a space for publishing more of our work on alignment and safety more frequently, and for a technical audience. https://t.co/n3oIhyDZHd
39
137
1K
@wlthxyz
WLTH
5 days
xAI could be the next Elon Musk company to go public. What separates it from the rest: • $230B pre-money valuation talks after merging with X, more than 4× in under a year, driven by institutional demand and strategic insiders. • 200,000 Nvidia GPUs deployed in ~120 days,
1
5
48
@ml_collective
ML Collective
4 months
A little while ago, many of you gave generously to support a number of MLC-Nigeria researchers in attending Deep Learning Indaba #DLI2025. Here's the crew 👇that attended; from what we hear it was a bustle of talks, posters, mentorship, and sparks of collaboration!
@hemhemoh
Mardiyyah
4 months
Today we say goodbye to @DeepIndaba after six inspiring days in Kigali rich with keynotes, tutorials, workshops, mentorship circles, and insightful posters that kept us learning non-stop. Some of us were only able to make it down to #DLI2025 because of your generous support.
1
15
67
@hemhemoh
Mardiyyah
4 months
Today we say goodbye to @DeepIndaba after six inspiring days in Kigali rich with keynotes, tutorials, workshops, mentorship circles, and insightful posters that kept us learning non-stop. Some of us were only able to make it down to #DLI2025 because of your generous support.
@savvyRL
Rosanne Liu
5 months
The opportunity gap in AI is more striking than ever. We talk way too much about those receiving $100M or whatever for their jobs, but not enough those asking for <$1k to present their work. For 3rd year in a row, @ml_collective is raising funds to support @DeepIndaba attendees.
1
17
94
@jasonyo
Jason Yosinski
5 months
Help send a bunch of researchers to DL Indaba this year! For less than one H100 we can send 25 people!
@savvyRL
Rosanne Liu
5 months
The opportunity gap in AI is more striking than ever. We talk way too much about those receiving $100M or whatever for their jobs, but not enough those asking for <$1k to present their work. For 3rd year in a row, @ml_collective is raising funds to support @DeepIndaba attendees.
0
20
37
@AEGworldwide
AEG
14 days
At AEG, we don’t just create events—we create memories. Join the team behind unforgettable music and sports moments
0
5
31
@ml_collective
ML Collective
6 months
Starting in 1 hour: @thebasepoint presents Anthropic's "Biology of a Large Language Model" work at the DLCT reading group. Paper: https://t.co/gxg8Ixlud2 Come for the chain of thought, stay for the rabbits and habbits. Zoom info below 👇
3
4
13
@jasonyo
Jason Yosinski
7 months
Starting in 30 min!
@ml_collective
ML Collective
7 months
Next Research Jam is in 14 hours, tomorrow morning at 8am PT. Stop by this virtual lab meeting to hear research ideas and updates on projects in progress! Zoom info at https://t.co/bmwCfHTkJR
0
1
4
@jasonyo
Jason Yosinski
7 months
Next MLC Research Jam is tomorrow; sharing two ideas myself to mix things up :)
@ml_collective
ML Collective
7 months
Next Research Jam is in 14 hours, tomorrow morning at 8am PT. Stop by this virtual lab meeting to hear research ideas and updates on projects in progress! Zoom info at https://t.co/bmwCfHTkJR
1
1
6
@jasonyo
Jason Yosinski
7 months
Starting in 15 min!
@ml_collective
ML Collective
7 months
This week at Deep Learning: Classics and Trends we're kicking off a new five part mini-series on LLM Interpretability. Up first: @thesubhashk shows how LLMs represent numbers on a helix and use it to add! Join Friday at 10am PT, zoom here: https://t.co/f5VxrVu4Mg
0
1
4
@MedievalTimes
Medieval Times
24 hours
Our equine entertainers are the true stars of the show! 🐴
0
1
26
@jasonyo
Jason Yosinski
7 months
I am sitting here watching my HF smolagent slowly reason about and click on Captcha squares one a time 🙈. Is this general AI?
1
0
6
@ml_collective
ML Collective
8 months
Tomorrow at 10am PT we'll have our next MLC OpenClubHouse, our 25th 🎉! Stop by to hang out, catch up with friends, and chat about ML or anything else. We'll meet in the MLC Discord #openclubhouse channel: https://t.co/gVKOZbNCVh
Tweet card summary image
discord.com
Discord is great for playing games and chilling with friends, or even building a worldwide community. Customize your own space to talk, play, and hang out.
0
2
6
@ml_collective
ML Collective
8 months
If you're at #ICLR2025, stop by the ML Collective Picnic Lunch on Monday at 12:30, graciously hosted by Alex Bezzubov! All welcome, bring your own lunch and meet new research friends and collaborators. 🥰 https://t.co/wDqKTB2r99
Tweet card summary image
luma.com
ML Collective open picnic lunch at ICLR 2025 It's been a conference or few since our last in-person lunch, so let's get together and eat! Everyone welcome, no…
0
1
10
@ml_collective
ML Collective
1 year
Rajat Modi presenting his work right now on getting Glom to work, poster #6304. Asynchronous Perception Machine for Efficient Test Time Training Rajat Modi · Yogesh Rawat West Ballroom A-D #6304
0
2
12
@JainRohan16
Rohan Jain
1 year
✨Our new @unireps paper tries to answer why the Lottery Ticket Hypothesis (LTH) fails to work for different random inits through the lens of weight-space symmetry. We improve the transferability of LTH masks to new random inits leveraging weight symmetries. 🧵(1/6)
7
26
85
@ml_collective
ML Collective
1 year
And...there we go! TL;DR: we are launching a new event series called "Industry Round Tables" with its first instance on Thursday, August 22! Register here if interested: https://t.co/vPn7Oy6yAX
@ml_collective
ML Collective
1 year
Get ready for our first LinkedIn post!
1
4
18
@paces_ai
Paces
1 year
We speed up renewable energy site selection & due diligence, reducing months of work to minutes. With $11M in new funding led by @navitascapital, we'll improve our software tools to allow quick, informed decisions that accelerate the energy transition. https://t.co/7qGkIho9XX
2
6
13
@useTria
Tria
2 days
Base assets are now usable in everyday life. Top up your Tria card, tap to pay globally, and keep full custody. Use creator coins anywhere Visa or Mastercard are accepted. Onchain meet real world.
414
243
746
@jasonyo
Jason Yosinski
2 years
I had a pretty fun conversation with @JonKrohnLearns the other day on startups, wind energy, the electrical grid in the US, ML, and (of course) how neural networks really work :)
@JonKrohnLearns
Jon Krohn
2 years
One of my all-time favorite A.I. researchers, Dr. Jason Yosinski (@jasonyo), is my guest today! He details how his startup is using ML to collect wind energy more efficiently and digs into visualizing/understanding deep neural networks. Watch here: https://t.co/esAWRX44MZ
1
3
14
@jlfwong
Jamie Wong
2 years
Canadians, if you‘re considering switching to a heat pump to heat and cool your home, and you’re curious about utility bill costs, pay back period, and reduction in greenhouse gas emissions, I made a thing for you. Link 👇
1
5
33