yacinelearning Profile Banner
Yacine Mahdid Profile
Yacine Mahdid

@yacinelearning

Followers
12K
Following
25K
Media
1K
Statuses
8K

(neuro/ai) I make technical deep learning tutorials 👺

Montreal Canada
Joined January 2019
Don't wanna be here? Send us removal request.
@yacinelearning
Yacine Mahdid
2 months
if there is one thing that you must not do is surrender. don’t surrender your dreams, your passion, your curiosity or your freedom. never
Tweet media one
6
19
230
@yacinelearning
Yacine Mahdid
30 minutes
this weekend we are going to figure if the clankers have a theory of mind or not
Tweet media one
2
1
30
@grok
Grok
18 days
Join millions who have switched to Grok.
641
733
5K
@yacinelearning
Yacine Mahdid
9 hours
oh this ties up so well to that neuro paper review we did 3 weeks ago
Tweet media one
@emollick
Ethan Mollick
2 days
This paper finds LLMs' ability to understand that others have different beliefs (Theory of Mind) comes from 0.001% of their parameters. Break those specific weights & the model loses both its ability to track what others know AND language comprehension. Interesting implications.
Tweet media one
2
3
25
@yacinelearning
Yacine Mahdid
10 hours
total RL victory.
@jyo_pari
Jyo Pari
22 hours
For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! . Want to find out why? 👇
Tweet media one
0
0
11
@yacinelearning
Yacine Mahdid
12 hours
me giving the water gun to my 6 yo:
Tweet media one
2
1
24
@yacinelearning
Yacine Mahdid
12 hours
tldw on muon-clip in kimi k2:. - regular muon with weight decay.- save the max attention logit per attention head (s_max in the thumbnail).- for each head calculate the scaling factor mu (at most 1).- scale the weights of Q and K for the next iteration with mu and alpha.- profit.
@yacinelearning
Yacine Mahdid
21 hours
we're going to figure out that muon-clip TODAY no excuse
Tweet media one
0
0
26
@yacinelearning
Yacine Mahdid
21 hours
join in over here folks if you are a youtube gal:
0
1
2
@yacinelearning
Yacine Mahdid
21 hours
we're going to figure out that muon-clip TODAY no excuse
Tweet media one
@yacinelearning
Yacine Mahdid
21 hours
for teacher day we are going to figure out what the heck is muon-clip today live. so hop in and do drop your deep learning questions in the chat
4
5
113
@yacinelearning
Yacine Mahdid
21 hours
for teacher day we are going to figure out what the heck is muon-clip today live. so hop in and do drop your deep learning questions in the chat
4
0
33
@yacinelearning
Yacine Mahdid
22 hours
TO ALL THE HATERS OF THE WAFFLEHOUSE ON HACKERNEWS MICHAEL PLANNED HIS WHOLE LIFE TOO OK THIS IS 100% NORMAL BEHAVIOR
Tweet media one
0
0
14
@yacinelearning
Yacine Mahdid
23 hours
a lot of folks are confused about linkedin but it’s not hard:. its a year long conference with networking event. yeah there’s the cringe talk on stage about this or that product. yeah everyone is boasting about their career. but you’re there to meet people and get their dm info.
9
1
43
@yacinelearning
Yacine Mahdid
2 days
that make so much sense.
@cloneofsimo
Simo Ryu
11 months
One might think shampoo is that weird radical-ass-new-optimizer that promises wild performance. let me tell you something reassuring. shampoo (with blocks), is *generalization* of adam. That is, shampoo with specific hyperparameter is adam. It is logically impossible for.
0
0
4
@yacinelearning
Yacine Mahdid
2 days
the only bits of info we have from ilya is the godamn meme hairline merch?. THE MEME HAIRLINE MERCH?????.
5
0
30
@yacinelearning
Yacine Mahdid
2 days
that type of work for free whaaaaaat.
@andimarafioti
Andi Marafioti
2 days
Fuck it. Today, we open source FineVision: the finest curation of datasets for VLMs, over 200 sources!. > 20% improvement across 10 benchmarks.> 17M unique images.> 10B answer tokens.> New capabilities: GUI navigation, pointing, counting. FineVision 10x’s open-source VLMs.
Tweet media one
1
0
17
@yacinelearning
Yacine Mahdid
2 days
here it is folks: . btw we’ll cover muon-clip soon so do not despair.
0
3
16
@yacinelearning
Yacine Mahdid
2 days
here are 7min of semi-rough talk to beginners in deep learning feeling stuck right now. the tldr my guys is that deep learning is just a tool, you gotta figure out what you want to apply it to. but also be pragmatic about your current situation and the market environment . đź«‚
Tweet media one
12
16
383
@yacinelearning
Yacine Mahdid
2 days
RT @yacinelearning: @_rynkhn_ *inhale*
Tweet media one
0
1
0
@yacinelearning
Yacine Mahdid
2 days
RT @yacinelearning: *coffee spilling all over the table*.*barista look at us aghast*.*sirens in the distance*
Tweet media one
0
2
0
@yacinelearning
Yacine Mahdid
2 days
the one I hate the most is “we are only using 10% of our brain capacity”. my sweet dear child why would you want to have generalized seizure.
@kalomaze
kalomaze
2 days
there are few pop science neuroscience theories i hate more than "the brain is actually quantum", it's surface level, reddit-tier neil degrasse tyson fan kinda bullshit.there's no selection pressure for that kind of complexity, if anything, there was selection against it.
4
2
57
@yacinelearning
Yacine Mahdid
2 days
*coffee spilling all over the table*.*barista look at us aghast*.*sirens in the distance*
Tweet media one
3
2
26
@yacinelearning
Yacine Mahdid
2 days
I met a business analyst yesterday and he asked me what I thought the next frontier for LLMs is. you will never guess what I said.
5
0
39