ZacharyHuang12 Profile Banner
Zachary Huang Profile
Zachary Huang

@ZacharyHuang12

Followers
4K
Following
853
Media
55
Statuses
923

Researcher @MSFTResearch AI Frontiers. LLM Agents and Systems. | PhD @ColumbiaCompSci | Prev: @GraySystemsLab @databricks| Fellowship: @GoogleAI | New YouTuber

Joined October 2019
Don't wanna be here? Send us removal request.
@ZacharyHuang12
Zachary Huang
2 days
Just hit 10K subscribers on YouTube! šŸŽ‰ . My early videos teach best practices for building LLM agents. Then I realized: why not automate making the videos themselves?. So my latest videos are made by an LLM agent I built to teach various technical topics. BEST testimony for my
Tweet media one
7
0
20
@grok
Grok
3 days
Join millions who have switched to Grok.
155
184
1K
@ZacharyHuang12
Zachary Huang
3 days
RT @Hesamation: A banger on how to publish the ultimate journal approved research paper in 2025:
Tweet media one
0
460
0
@ZacharyHuang12
Zachary Huang
5 days
RT @zoink: tell me again about how locked in you are
Tweet media one
0
782
0
@ZacharyHuang12
Zachary Huang
6 days
RT @quantbeckman: This is the reason why Renaissance hires PhDs to clean data 😬
Tweet media one
0
136
0
@ZacharyHuang12
Zachary Huang
11 days
RT @DimitrisPapail: Why is cross-entropy a good loss for language pretraining?. caveat: this is all known btw; interestingly, even though t….
0
21
0
@ZacharyHuang12
Zachary Huang
13 days
Claude Code is working for me 24/7, even when I sleep. Here is the command I just sent to it. I'm doing some grunt work of data processing. It has already processed a few to my satisfaction. So I just ask it to keep grinding while I'm going to sleep.
Tweet media one
1
0
15
@ZacharyHuang12
Zachary Huang
14 days
RT @Guangxuan_Xiao: I've written the full story of Attention Sinks — a technical deep-dive into how the mechanism was developed and how our….
0
267
0
@ZacharyHuang12
Zachary Huang
20 days
I've switched to Claude Code from Cursor, but I bet the Gemini CLI will catch up. Here's my take:. For coding agents, two things matter most: (a) Solid agent design (b) Generous token usage for a top-tier LLM.Everything else (codebase indexing, specialized models that applies.
7
1
21
@ZacharyHuang12
Zachary Huang
23 days
RT @joecarlsonshow: This is why I don’t trim Microsoft.
0
24
0
@ZacharyHuang12
Zachary Huang
23 days
RT @random_walker: Many people have told us that this is one of the most memorable opening paragraphs they've come across in a book. šŸ™ The….
0
41
0
@ZacharyHuang12
Zachary Huang
23 days
Tweet media one
0
801
0
@ZacharyHuang12
Zachary Huang
23 days
RT @techdroider: Google Cooked 🤫
0
236
0
@ZacharyHuang12
Zachary Huang
23 days
RT @Kangwook_Lee: @DimitrisPapail (financial) risks.
0
1
0
@ZacharyHuang12
Zachary Huang
24 days
RT @DimitrisPapail: We don’t even understand a two layer perceptron.
0
81
0
@ZacharyHuang12
Zachary Huang
25 days
Just tried out Grok 4 after paying $30. Overall disappointed—coding's not bad but slightly worse than Gemini 2.5 and Claude 4. Doc writing feels terse and keeps running into weird access issues.
3
0
8
@ZacharyHuang12
Zachary Huang
25 days
RT @mayfer: why are they like this
Tweet media one
0
82
0
@ZacharyHuang12
Zachary Huang
25 days
RT @GithubProjects: $ git push origin master --force
Tweet media one
0
929
0
@ZacharyHuang12
Zachary Huang
25 days
RT @SimonXinDong: It turns out,. > GRPO is performing the arithmetic mean --> token-level scaling.> GSPO is performing the geometric mean -….
0
68
0
@ZacharyHuang12
Zachary Huang
26 days
The video I posted 2 weeks ago:
0
0
2