NandoDF Profile Banner
Nando de Freitas Profile
Nando de Freitas

@NandoDF

Followers
107K
Following
20K
Media
402
Statuses
12K

MAI Superintelligence Team. Past: AlphaGo tuning, AlphaCode, Gato, ReST, learning to learn, awareness models, recurrent Gemma, Imagen3, Veo, Genie, MAI models

London, England
Joined April 2009
Don't wanna be here? Send us removal request.
@NandoDF
Nando de Freitas
7 days
The AI Lectures of @50cent are out!
@50cent
50cent
7 days
My doc is still #1 on Netflix in 51 countries, 😟you have a right to be upset about this. LOL @50CentAction247 https://t.co/WtNREs3AKy
0
1
23
@NandoDF
Nando de Freitas
9 days
To choose freedom for oneself and for others is the noblest act of love. Every good parent knows this truth. Thank you @MariaCorinaYA for risking your life to remind us all of this.
@NobelPrize
The Nobel Prize
10 days
The 2025 peace laureate Maria Corina Machado arrived safely in Oslo, Norway in the early morning of 11 December. It was the first time in two years that she was able to embrace her daughter Ana (depicted in the first image) and the rest of her family. She was welcomed by an
4
3
34
@mustafasuleyman
Mustafa Suleyman
11 days
We just dropped what we believe is the world's largest study of AI conversations + it found what you talk to AI about has a lot to do with what time it is. @MicrosoftAI researchers found 3 different trends by day, time, and month and 1 rock solid constant
Tweet card summary image
microsoft.ai
At MAI, we don’t just build AI tools, we care about how real people interact with them.
34
76
512
@daveg
David Galbraith
10 days
That @demishassabis decided to remain in the UK rather than move to the US was the single most important event in the UK's future. An entire ecosystem is crystallizing around him.
@jujulemons
Julia Willemyns
10 days
imo this is a HUGE deal Google DeepMind and the UK govt have just signed a partnership w/ 3 pillars: 1. transforming public services 2. accelerating scientific discovery 3. advancing AI security & resilience Concrete bits that matter: - a Gemini model trained on the UK
52
195
3K
@NordaceOfficial
Nordace
26 days
I love this crossbody bag. It is large enough and small enough at the same time. It holds my full size wallet and sunglasses along with anything else that I might need. It isn’t bulky or heavy. It also fits in the Siena Rivera Tote.
2
8
76
@NandoDF
Nando de Freitas
9 days
Reflection on @dwarkesh_sp's reflections on his interview with Rich Sutton, and why LLMs are exciting because they are Skinnerian, Popperian and Gregorian creatures. Minds with finite capacity cannot adapt forever without having to forget previous knowledge. This is true of
Tweet card summary image
dwarkesh.com
Watch now (66 mins) | LLMs aren’t Bitter-Lesson-pilled
11
26
176
@NandoDF
Nando de Freitas
9 days
One of the bravest and smartest leaders in the world, @MariaCorinaYA. May Venezuelans be united with their loved ones soon 💛💙❤️ Freedom is a choice, an act of love.
@SkyNews
Sky News
10 days
"I am very hopeful Venezuela will be free" Nobel Peace Prize winner Maria Corina Machado delivers an address in Oslo after making her return to the public eye following months in hiding https://t.co/DdKUfyLhbo 📺 Sky 501, Virgin 602, Freeview 233 and YouTube
3
1
14
@natolambert
Nathan Lambert
15 days
Good researchers obsess over evals The story of Olmo 3 (post-training), told through evals NeurIPS Talk tomorrow. Upper Level Room 2, 10:35AM.
12
48
597
@NandoDF
Nando de Freitas
14 days
Some really interesting comments below, but the question is still open and requires investigation. I hope a few students pick it up. I liked the discussions on context length generalisation, the fact that we typically train these models as bandits (even when we do RL, which is
2
1
15
@NandoDF
Nando de Freitas
15 days
Why is it that with ChatGPT, Gemini, Claude, Copilot and other LLMs we have to always start new chats for them to work well? What is the scientific explanation? What are the hypotheses? What is the evidence for each?
116
14
210
@infobeautiful
Information is Beautiful
16 days
Kinda amazing how we can model this. The tectonic movement of the landmasses over 1,000 million years until today (by @nytimes)
87
615
3K
@TencentHunyuan
Tencent HY
16 days
Tencent HY 2.0 is officially released. We are rolling out a major performance upgrade to our foundation model, now available via Tencent Cloud API. Built on a Mixture-of-Experts (MoE) architecture (406B total, 32B active parameters) and featuring a 256K context window, HY 2.0
36
53
381
@askalphaxiv
alphaXiv
17 days
New paper from Qwen team! They showed that because token-level updates are just fragile approximations of sequence rewards, you must use Routing Replay and Clipping to minimize the gap between training & inference for stable RL training in LLMs now trending on AlphaXiv 📈
7
51
427
@burkov
BURKOV
17 days
NeurIPS 2025 Best Paper Award: Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance. Gating mechanisms, which selectively suppress or amplify
16
185
1K
@NandoDF
Nando de Freitas
16 days
I’m looking forward to reading this paper this weekend! @jaseweston and @j_foerst are as creative as it gets.
@jaseweston
Jason Weston
16 days
🤝 New Position Paper !!👤🔄🤖 @j_foerst and I wrote a position piece on what we think is the path to safer superintelligence: co-improvement. Everyone is focused on self-improving AI, but (1) we don't know how to do it yet, and (2) it might be misaligned with humans.
3
10
115
@jaseweston
Jason Weston
16 days
🤝 New Position Paper !!👤🔄🤖 @j_foerst and I wrote a position piece on what we think is the path to safer superintelligence: co-improvement. Everyone is focused on self-improving AI, but (1) we don't know how to do it yet, and (2) it might be misaligned with humans.
25
93
503
@NandoDF
Nando de Freitas
16 days
If you’re into triage and AI to improve healthcare, please check out my brother (⁦@jdefreit⁩) and nephew’s startup.
Tweet card summary image
goclinic.ai
Leading MSK clinics gain 15+ minutes per assessment with GoClinic AI. 90% of first sessions can now include treatment.
2
2
9
@rasbt
Sebastian Raschka
18 days
This interesting week started with DeepSeek V3.2! I just wrote up a technical tour of the predecessors and components that led up to this: 🔗 https://t.co/JSAd9cx2s6 - Multi-Head Latent Attention - RLVR - Sparse Attention - Self-Verification - GRPO Updates
33
239
1K
@NandoDF
Nando de Freitas
17 days
Amazing to see AI tools empowering more people to solve complex tasks 👏
@satyanadella
Satya Nadella
17 days
With the Excel World Championship underway, I decided to take the M365 Copilot digital challenge. I’m no World Champ… but thanks to Agent Mode, I held my own!
0
0
6
@satyanadella
Satya Nadella
17 days
With the Excel World Championship underway, I decided to take the M365 Copilot digital challenge. I’m no World Champ… but thanks to Agent Mode, I held my own!
248
365
3K
@askalphaxiv
alphaXiv
20 days
DeepSeek just announced DeepSeek-V3.2 2 open weights LLMs that go head-to-head with SoTA like GPT-5 high, Claude 4.5 Sonnet and Gemini 3.0 Pro With the highlight being their new DeepSeek Sparse Attention SoTA intelligence is now available for everyone to download
2
31
197