infoxiao Profile Banner
Xiao Ma Profile
Xiao Ma

@infoxiao

Followers
12K
Following
15K
Media
340
Statuses
4K

gemini post-training @googledeepmind. views are mine.

New York
Joined August 2014
Don't wanna be here? Send us removal request.
@infoxiao
Xiao Ma
4 months
🎨 Gemini 2.5 tech report just dropped! So proud to have led the development of RL*F (Reinforcement Learning from Human and Critic Feedback) - our breakthrough in AI training inspired by... art school crits? Here's the thing: How do you teach taste? Style? Things without clear
13
32
356
@johnschulman2
John Schulman
22 hours
We're happy to support the Human Centered LLMs course, on topics close to our hearts. We'd like to support more classes with free credits for students to use on assignments and projects. If you're an instructor interested in using Tinker in your course, please reach out to
@Diyi_Yang
Diyi Yang
1 day
Thanks @thinkymachines for supporting Tinker access for our CS329x students on Homework 2 😉
11
45
519
@THINCfdn
THINC Foundation
2 days
Concerned about biases and politics in your children's education? Be an advocate with these 6 simple steps. Follow THINC to learn more about what you can do as a parent.
0
1
5
@infoxiao
Xiao Ma
24 hours
code should be beautiful
13
0
70
@johnschulman2
John Schulman
2 days
Fine-tuning APIs are becoming more powerful and widespread, but they're harder to safeguard against misuse than fixed-weight sampling APIs. Excited to share a new paper: Detecting Adversarial Fine-tuning with Auditing Agents ( https://t.co/NqMeGSCQIF). Auditing agents search
Tweet card summary image
arxiv.org
Large Language Model (LLM) providers expose fine-tuning APIs that let end users fine-tune their frontier LLMs. Unfortunately, it has been shown that an adversary with fine-tuning access to an LLM...
10
45
449
@infoxiao
Xiao Ma
1 day
it is underrated how funny @karpathy is
@karpathy
Andrej Karpathy
1 day
@LucasAtkins7 This code is extremely dangerous. Here, I improved it.
13
21
936
@Polymarket
Polymarket
8 hours
We're honored to be the Exclusive Prediction Market Partner for the fastest growing sports league in the world. Polymarket is proud to be partnering with the PPA & MLP to bring pickleball polymarkets to the masses.
55
20
196
@karpathy
Andrej Karpathy
1 day
@LucasAtkins7 This code is extremely dangerous. Here, I improved it.
176
136
5K
@infoxiao
Xiao Ma
2 days
that was way too fast
@brian_lovin
Brian Lovin
3 days
Claude Code on iOS! Finally!
1
0
4
@infoxiao
Xiao Ma
3 days
the way i thought this is the new @AnthropicAI ad
@tim_cook
Tim Cook
3 days
It all begins with a great idea.
0
0
11
@infoxiao
Xiao Ma
4 days
so hm how many aura points do i lose if I implement a gmail 'unsubscribe' mcp by just marking it as spam
3
0
6
@DrCHuber
Dr. Colleen Huber
2 days
Even one of the top imaging companies in the US admits how much better ultrasound is than mammography for breast imaging. What is not said is the radiation risk of mammograms is worse than the benign sound waves of U.S. I never Rx'd mammograms, choosing ultrasounds instead.
3
14
73
@infoxiao
Xiao Ma
4 days
i've had enough
1
0
5
@infoxiao
Xiao Ma
4 days
these pilate girls must have minds of steel
4
1
16
@infoxiao
Xiao Ma
4 days
The ultimate test of code maintainability? Let an agent run free and see if it descends into unmanageable chaos.
0
1
4
@infoxiao
Xiao Ma
5 days
galaxy brain is realizing you can use cli to access reminders https://t.co/WXFmtEnyJe and give all the coding clis access to it 🤯
Tweet card summary image
github.com
A simple CLI for interacting with macOS reminders. Contribute to keith/reminders-cli development by creating an account on GitHub.
0
0
9
@RhysSullivan
Rhys
6 days
no one: absolutely no one: literally no one on the planet earth: claude:
171
44
1K
@PRDT_Finance
PRDT | Predictions
2 days
Mark your calendars. After 4 years of building. $PRDT launches November 1st, 2025 - 12PM CET. Let’s make history together. 💚
266
327
1K
@infoxiao
Xiao Ma
6 days
very funny that when we watched matrix before: fiction and now: product roadmap
0
0
9
@infoxiao
Xiao Ma
6 days
out: context switching in: agent switching more in: skill maxing
@johnjhorton
John Horton
6 days
The number of coding agent's you can manage at one time, productively, is your Ma number. It is decreed. I'm at 2 - maybe 3 :(
1
0
3
@infoxiao
Xiao Ma
6 days
@johnjhorton new study when
0
0
2
@infoxiao
Xiao Ma
6 days
Dunbar's number: but for the number of agents you can manage at the same time. can we name it the Ma's number
1
0
9
@CarShield
CarShield
13 days
That strange noise under the hood? You don’t have to ignore it anymore.
0
1
18
@barry_zyj
Barry Zhang
7 days
Let me pitch you Skills: - They allow anyone to customize agents with a simple primitive — files - They give us continuous learning until continuous learning arrives - They are powerful, composable, and sufficiently AGI-pilled - you can now say “probably a skill issue” at work
@claudeai
Claude
7 days
Claude can now use Skills. Skills are packaged instructions that teach Claude your way of working.
49
70
1K
@infoxiao
Xiao Ma
7 days
😂 not me using the exact clip at work
@alexalbert__
Alex Albert
7 days
At a high level, the best analogy I've heard for Skills is something like Neo learning Kung Fu in seconds in the Matrix. We're "loading in" specialized knowledge to our general agents at runtime.
0
0
9
@askalphaxiv
alphaXiv
8 days
Introducing NotebookLM for arXiv papers 🚀 Transform dense AI research into an engaging conversation With context across thousands of related papers, it captures motivations, draws connections to SOTA, and explains key insights like a professor who's read the entire field
57
529
3K