πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau Profile
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau

@DBahdanau

Followers
9K
Following
422
Media
22
Statuses
484

Team member at something young. Adjunct Prof @ McGill. Member of Mila, Quebec AI Institute. Stream of consciousness is my own.

Joined August 2017
Don't wanna be here? Send us removal request.
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
2 days
At this point I view NeurIPS as a mix of IMO, IOI, school essay context and debate club. Just a ranking mechanism to separate allegedly smarter kids from allegedly less smart. The only problem is the smartest kids just skip the whole ordeal and join fun startups instead.
6
3
243
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
2 days
. we wanted flying cars and we will get dumb generations.
@mrpxssy
mr pussy
4 days
AI companies are literally being kept afloat by cheating students.
0
0
3
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
11 days
what's is the scheduler that you use on top of Kubernetes in your small startup? do you like it? paid options with great support are especially welcome.
4
0
6
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
17 days
Did you know that nature chooses the microstate with a softmax layer?.
1
0
7
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
20 days
when you book a hotel near the office in SF
Tweet media one
0
0
10
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
21 days
Great comment. LLMs feel like a 1000 year old robot who worked all jobs, talked to everyone, learned everything, and yet can't cross the uncanny valley of actually *getting* what I want, having agency, being creative. But OTOH that's enough to shake the economy.
@ErnestRyu
Ernest Ryu
22 days
7. However, LLMs will become exceedingly powerful for problems that *someone* knows how to solve (in-distribution, in training data). In math research, you combine existing techniques with new creative ideas. LLMs will significantly accelerate the former part. (7/10).
0
0
17
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
23 days
l love my new job!.
8
1
73
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
26 days
RT @GabrielHuang9: As #ICML2025 kicks off in Vancouver, our AI talent is being quietly pushed out. πŸ‡¨πŸ‡¦. We've been waiting 28 months for per….
0
10
0
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
28 days
There's one idea as old as humanity. If I am in control, I can make it better. Hence, I should fight for control and win. At any cost. Ends justify the means. And this is how people with big egos abandon all principles and become evil.
7
11
148
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
1 month
So long, @ServiceNowRSRCH ! It's been great 4 years. I look forward to cheer for more great open-source AI releases from the talented ServiceNow AI people!. I will tell you what's next in due time πŸ˜‰
Tweet media one
8
1
151
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
1 month
so nice to have a few actual Scientists in our community that respect empirical results even when they are go counter their intuition!.
@boazbaraktcs
Boaz Barak
1 month
This study surprised me! The conclusion is opposite to what I would expect. It is tempting to try to find a reason it's bogus but I think it's well executed and solid work. As the authors say, there are a number of potential caveats for this setting that may not generalize.
0
0
6
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
1 month
exactly what I feel about using AI to make advanced modifications in RL code. a lot of busy and chaotic activity but slower than thinking carefully on one's own.
@METR_Evals
METR
1 month
We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.
Tweet media one
1
0
10
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
2 months
knowledge for the sake of knowledge is useless. we need knowledge that informs action and creates impact. scaling up NeurIPS to 20K submissions will only further lower the ratio of impactful knowledge in the proceedings. you can guess, I'm reviewing now. .
5
3
65
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
2 months
A lot of great ideas on how to remedy training instabilities in @MiniMax__AI tech report. Check it out!.
@MiniMax__AI
MiniMax (official)
2 months
Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM β€” setting new standards in long-context reasoning. - World’s longest context window: 1M-token input, 80k-token output.- State-of-the-art agentic use among open-source models.- RL at unmatched efficiency:
Tweet media one
0
0
21
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
finally a voice of reason about mechanistic interpretability!.
@DanHendrycks
Dan Hendrycks
3 months
I wrote about why efforts to understand the inner workings of AI keep falling short.
0
1
17
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
200%. grit grit grit and 50 w&b curves and everything will work eventually.
@_arohan_
rohan anil
3 months
Someone passed this wisdom to me today. Deep learning techniques working vs not working is two devils .- your prior about the technique.- your attention to details about implementation of the technique . Need both to make it work.
3
0
16
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
ask Claude 3.7 if your code has any obvious bugs. watch it invent issues in your code when there aren't any in 90% cases, leading you astray. make your judgement about how soon decent software engineers become useless.
3
1
34
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
I really don't care about reviewers' opinion. But last few weeks before NeurIPS is the only time when it is socially acceptable to neglect almost all your responsibilities and focus on research. πŸ€ͺπŸ€ͺπŸ€ͺ.
0
0
16
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
I'm embarassed to admit that I have just grokked how amazing Python coroutines and asyncio are. I want to rewrite every single piece of code with threads I have every written! . But the learning curve is steep. This great blog opened my eyes:
10
36
367
@DBahdanau
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
3 months
nicely done, team!!.
@tscholak
Torsten Scholak
3 months
🚨🀯 Today Jensen Huang announced SLAM Lab's newest model on the @HelloKnowledge stage: Apriel‑Nemotron‑15B‑Thinker 🚨.A lean, mean reasoning machine punching way above its weight class πŸ‘Š.Built by SLAM Γ— NVIDIA. Smaller models, bigger impact. πŸ§΅πŸ‘‡
0
0
10