Nando de Freitas @NandoDF X Profile

Nando de Freitas

@NandoDF

Followers

107K

Following

20K

Media

402

Statuses

12K

MAI Superintelligence Team. Past: AlphaGo tuning, AlphaCode, Gato, ReST, learning to learn, awareness models, recurrent Gemma, Imagen3, Veo, Genie, MAI models

https://t.co/uOtKxiIIak

London, England

Joined April 2009

Don't wanna be here? Send us removal request.

Nando de Freitas

@NandoDF

7 days

The AI Lectures of @50cent are out!

50cent

@50cent

7 days

My doc is still #1 on Netflix in 51 countries, 😟you have a right to be upset about this. LOL @50CentAction247 • https://t.co/WtNREs3AKy

0

1

23

Nando de Freitas

@NandoDF

9 days

To choose freedom for oneself and for others is the noblest act of love. Every good parent knows this truth. Thank you @MariaCorinaYA for risking your life to remind us all of this.

The Nobel Prize

@NobelPrize

10 days

The 2025 peace laureate Maria Corina Machado arrived safely in Oslo, Norway in the early morning of 11 December. It was the first time in two years that she was able to embrace her daughter Ana (depicted in the first image) and the rest of her family. She was welcomed by an

4

3

34

Mustafa Suleyman

@mustafasuleyman

11 days

We just dropped what we believe is the world's largest study of AI conversations + it found what you talk to AI about has a lot to do with what time it is. @MicrosoftAI researchers found 3 different trends by day, time, and month and 1 rock solid constant

microsoft.ai

At MAI, we don’t just build AI tools, we care about how real people interact with them.

34

76

512

David Galbraith

@daveg

10 days

That @demishassabis decided to remain in the UK rather than move to the US was the single most important event in the UK's future. An entire ecosystem is crystallizing around him.

Julia Willemyns

@jujulemons

10 days

imo this is a HUGE deal Google DeepMind and the UK govt have just signed a partnership w/ 3 pillars: 1. transforming public services 2. accelerating scientific discovery 3. advancing AI security & resilience Concrete bits that matter: - a Gemini model trained on the UK

52

195

3K

Nordace

@NordaceOfficial

26 days

I love this crossbody bag. It is large enough and small enough at the same time. It holds my full size wallet and sunglasses along with anything else that I might need. It isn’t bulky or heavy. It also fits in the Siena Rivera Tote.

2

8

76

Nando de Freitas

@NandoDF

9 days

Reflection on @dwarkesh_sp's reflections on his interview with Rich Sutton, and why LLMs are exciting because they are Skinnerian, Popperian and Gregorian creatures. Minds with finite capacity cannot adapt forever without having to forget previous knowledge. This is true of

dwarkesh.com

Watch now (66 mins) | LLMs aren’t Bitter-Lesson-pilled

11

26

176

Nando de Freitas

@NandoDF

9 days

One of the bravest and smartest leaders in the world, @MariaCorinaYA. May Venezuelans be united with their loved ones soon 💛💙❤️ Freedom is a choice, an act of love.

Sky News

@SkyNews

10 days

"I am very hopeful Venezuela will be free" Nobel Peace Prize winner Maria Corina Machado delivers an address in Oslo after making her return to the public eye following months in hiding https://t.co/DdKUfyLhbo 📺 Sky 501, Virgin 602, Freeview 233 and YouTube

3

1

14

Nathan Lambert

@natolambert

15 days

Good researchers obsess over evals The story of Olmo 3 (post-training), told through evals NeurIPS Talk tomorrow. Upper Level Room 2, 10:35AM.

12

48

597

Nando de Freitas

@NandoDF

14 days

Some really interesting comments below, but the question is still open and requires investigation. I hope a few students pick it up. I liked the discussions on context length generalisation, the fact that we typically train these models as bandits (even when we do RL, which is

2

1

15

Nando de Freitas

@NandoDF

15 days

Why is it that with ChatGPT, Gemini, Claude, Copilot and other LLMs we have to always start new chats for them to work well? What is the scientific explanation? What are the hypotheses? What is the evidence for each?

116

14

210

Information is Beautiful

@infobeautiful

16 days

Kinda amazing how we can model this. The tectonic movement of the landmasses over 1,000 million years until today (by @nytimes)

87

615

3K

Tencent HY

@TencentHunyuan

16 days

Tencent HY 2.0 is officially released. We are rolling out a major performance upgrade to our foundation model, now available via Tencent Cloud API. Built on a Mixture-of-Experts (MoE) architecture (406B total, 32B active parameters) and featuring a 256K context window, HY 2.0

36

53

381

alphaXiv

@askalphaxiv

17 days

New paper from Qwen team! They showed that because token-level updates are just fragile approximations of sequence rewards, you must use Routing Replay and Clipping to minimize the gap between training & inference for stable RL training in LLMs now trending on AlphaXiv 📈

7

51

427

BURKOV

@burkov

17 days

NeurIPS 2025 Best Paper Award: Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance. Gating mechanisms, which selectively suppress or amplify

16

185

1K

Nando de Freitas

@NandoDF

16 days

I’m looking forward to reading this paper this weekend! @jaseweston and @j_foerst are as creative as it gets.

Jason Weston

@jaseweston

16 days

🤝 New Position Paper !!👤🔄🤖 @j_foerst and I wrote a position piece on what we think is the path to safer superintelligence: co-improvement. Everyone is focused on self-improving AI, but (1) we don't know how to do it yet, and (2) it might be misaligned with humans.

3

10

115

Jason Weston

@jaseweston

16 days

🤝 New Position Paper !!👤🔄🤖 @j_foerst and I wrote a position piece on what we think is the path to safer superintelligence: co-improvement. Everyone is focused on self-improving AI, but (1) we don't know how to do it yet, and (2) it might be misaligned with humans.

25

93

503

Nando de Freitas

@NandoDF

16 days

If you’re into triage and AI to improve healthcare, please check out my brother (⁦@jdefreit⁩) and nephew’s startup.

goclinic.ai

Leading MSK clinics gain 15+ minutes per assessment with GoClinic AI. 90% of first sessions can now include treatment.

2

9

Sebastian Raschka

@rasbt

18 days

This interesting week started with DeepSeek V3.2! I just wrote up a technical tour of the predecessors and components that led up to this: 🔗 https://t.co/JSAd9cx2s6 - Multi-Head Latent Attention - RLVR - Sparse Attention - Self-Verification - GRPO Updates

33

239

1K

Nando de Freitas

@NandoDF

17 days

Amazing to see AI tools empowering more people to solve complex tasks 👏

Satya Nadella

@satyanadella

17 days

With the Excel World Championship underway, I decided to take the M365 Copilot digital challenge. I’m no World Champ… but thanks to Agent Mode, I held my own!

0

6

Satya Nadella

@satyanadella

17 days

With the Excel World Championship underway, I decided to take the M365 Copilot digital challenge. I’m no World Champ… but thanks to Agent Mode, I held my own!

248

365

3K

alphaXiv

@askalphaxiv

20 days

DeepSeek just announced DeepSeek-V3.2 2 open weights LLMs that go head-to-head with SoTA like GPT-5 high, Claude 4.5 Sonnet and Gemini 3.0 Pro With the highlight being their new DeepSeek Sparse Attention SoTA intelligence is now available for everyone to download

2

31

197