AnimaAnandkumar Profile Banner
Prof. Anima Anandkumar Profile
Prof. Anima Anandkumar

@AnimaAnandkumar

Followers
33K
Following
6K
Media
171
Statuses
2K

Bren Professor @caltech, Time100, Fmr Sr Director of #AI research @nvidia Fmr Principal Scientist @awscloud

Joined May 2021
Don't wanna be here? Send us removal request.
@AnimaAnandkumar
Prof. Anima Anandkumar
2 days
There is a wrong notion that precision is sacrificed in UE8M0. That is not the case. You can retain the original accuracy even when you directly train in that format when you use our Madam algorithm
@jenzhuscott
Jen Zhu
3 days
Looking into DeepSeek X UE8M0, a radical FP8 format (all exponent, no mantissa). It trades precision for range & simpler hardware. This "software-first" move pushes domestic chips (Huawei, Cambricon) to adapt, accelerating China's integrated AI-semiconductor ecosystem strategy.
1
3
15
@AnimaAnandkumar
Prof. Anima Anandkumar
5 days
RT @jiawzhao: Excited to see Logarithmic format (LNS, UE8M0 FP8) used in production by @deepseek_ai! LNS enables efficient multi (just addi….
0
7
0
@RedBalloonWork
RedBalloon | Free to Work
3 days
🚨New survey reveals minimum tariff concerns amid growing economic momentum. A new nationwide August survey of America’s Main Street small businesses revealed muted concerns about the @POTUS Administration’s tariff actions, as business owners prepare for coming growth.
Tweet media one
5
10
13
@AnimaAnandkumar
Prof. Anima Anandkumar
5 days
It is interesting that the new @deepseek_ai v3.1 is trained using the UE8M0 FP8 scale data format which is logarithmic number system. Our multiplicative weights update (Madam) for training in that format was done several years ago while at @nvidia It yields maximum hardware
Tweet media one
14
106
608
@AnimaAnandkumar
Prof. Anima Anandkumar
8 days
RT @Caltech_LHC: Excellent keynote lecture closing the first day of #ml4jets2025 @caltech by Prof. @AnimaAnandkumar 👏🏽 👏🏽 https://t.co/….
0
1
0
@AnimaAnandkumar
Prof. Anima Anandkumar
14 days
How do we build AI for science? Augment with AI or replace with AI? Popular prescription is to augment AI into existing workflows rather than replace them, e.g., keep the approximate numerical solver for simulations, and use AI only to correct its errors
Tweet media one
2
35
194
@TopstepTV
TopstepTV
4 hours
🚨 @TopstepTV is LIVE! It’s Turnaround Tuesday. Come find out what it means and trade the action with us. Let’s go. Follow us on X & subscribe on YouTube.
2
2
7
@AnimaAnandkumar
Prof. Anima Anandkumar
15 days
Major update of LeanDojo: Lean + LLM for verified math reasoning. Lean4Code v1.0.0 - A specialized integrated development environment built as a fork of VS Code, designed specifically for Lean theorem proving. The IDE features automatic Lean installation,
0
19
85
@AnimaAnandkumar
Prof. Anima Anandkumar
19 days
RT @JeffDean: My longtime collaborator Dave Patterson (long-time faculty at @UCBerkeley, @TheOfficialACM Turing Award winner, and fellow @L….
0
124
0
@AnimaAnandkumar
Prof. Anima Anandkumar
22 days
RT @Azizzadenesheli: #NeuralOperators learn physics through data. We study long term prediction capability of #NeuralOperator on a hard t….
0
2
0
@Benzinga
Benzinga
2 days
LIVE Wednesday @ 6 PM ET 📅. See the strategy that’s crushed the S&P 500 by 6X in 2025 . Tom Gentile’s seasonal trading system is up 58% YTD — and on this live call, he’ll show you how he’s preparing for the toughest stretch of the year and reveal his next trade idea. Register
Tweet media one
7
4
11
@AnimaAnandkumar
Prof. Anima Anandkumar
22 days
Excited to share our recently published paper in @WileyGlobal on "Ocean Emulation With Fourier Neural Operators: Double Gyre" We used Fourier Neural Operators to build the first high-resolution weather model, FourCastNet. Since it works so well for
Tweet media one
0
2
17
@AnimaAnandkumar
Prof. Anima Anandkumar
23 days
My @MLSysConf keynote is now online. . The scaling of large language models has led to impressive gains in language understanding, but at a cost of insatiable memory and bandwidth requirements. I advocated a principled approach of designing optimization.
4
20
101
@AnimaAnandkumar
Prof. Anima Anandkumar
26 days
RT @Azizzadenesheli: FALCON: built on centuries of knowledge from fluid dynamics, turbulent flows, control, RL, and ML, to deliver foundati….
0
1
0
@sekurprivate
Sekur Private (OTCQB:SWISF)
28 days
ExpressVPN leaked users’ real IP for 4 months. This isn’t clickbait. It’s a confirmed vulnerability. A debug setting left in production allowed Remote Desktop traffic to bypass the VPN tunnel: exposing your location in plain text. No alerts. No warnings. Just silence. Now
Tweet media one
4
3
19
@AnimaAnandkumar
Prof. Anima Anandkumar
28 days
RT @Caltech: To help capture the impact of Caltech research and provide information on the federal funding that helps support it, the Offic….
researchimpact.caltech.edu
0
7
0
@AnimaAnandkumar
Prof. Anima Anandkumar
30 days
RT @guohao_li: 🚨 [Call for Papers] SEA Workshop @ NeurIPS 2025 🚨.📅 December 6, 2025 | 📍 San Diego, USA.🌐: Environm….
0
19
0
@AnimaAnandkumar
Prof. Anima Anandkumar
1 month
RT @ShuiwangJi: Our 500+ page AI4Science paper is finally published:. Artificial Intelligence for Science in Quantum, Atomistic, and Contin….
0
21
0
@AnimaAnandkumar
Prof. Anima Anandkumar
1 month
.@shoyer We admire neuralGCM and all the contributions you are making for AI+climate modeling. Social media doesn't allow for too much nuance - what I meant to say was FourCastNet 3 is unprecedented in offering competitive skill at 6-hour resolution, with probabilistic estimates.
@shoyer
Stephan Hoyer
1 month
@AnimaAnandkumar @nvidia @Caltech FourCastNet3 is very impressive, great work!. It's certainly not unprecendented in terms of speed for probabilistic AI-weather prediction, though. E.g., NeuralGCM makes a 15 day probabilistic weather forecast in under 20 seconds.
1
0
9
@MEVSPACEhosting
MEVSPACE
5 hours
Your biggest infra fear:.
0
0
0
@AnimaAnandkumar
Prof. Anima Anandkumar
1 month
I led the creation of the very first high-resolution AI-based weather model FourCastNet at @NVIDIA @Caltech in 2021. Instead of bottoms-up physics-based weather forecasting, for the first time, we were able to show that AI-based models are accurate and tens of thousands of times
7
16
106
@AnimaAnandkumar
Prof. Anima Anandkumar
1 month
Tensors are all you need.
@goyal__pramod
Pramod Goyal
1 month
Einsum is all you need . I have said it before I will say it again, If you truly understand how einsum works, you will never need to worry about . reshape, permute, transpose, matmul, dot product and so much more
Tweet media one
4
5
37
@AnimaAnandkumar
Prof. Anima Anandkumar
1 month
RT @BTolooshams: I am giving a talk on "Neural Operators and Biologically-informed Latent Embeddings for Foundation Models in NeuroAI" at t….
0
13
0