Tu Vu @tuvllms X Profile

Tu Vu

@tuvllms

Followers

4K

Following

3K

Media

54

Statuses

1K

Research Scientist @GoogleDeepMind & Assistant Professor @VT_CS. PhD from @UMass_NLP. Google FLAMe/FreshLLMs/Flan-T5 Collection/SPoT #NLProc

https://t.co/NQb78ET2gx

California, USA

Joined April 2017

Don't wanna be here? Send us removal request.

Tu Vu

@tuvllms

1 year

🚨 New @GoogleDeepMind paper 🚨 We trained Foundational Large Autorater Models (FLAMe) on extensive human evaluations, achieving the best RewardBench perf. among generative models trained solely on permissive data, surpassing both GPT-4 & 4o. 📰: https://t.co/FIPFiHwXyt 🧵:👇

30

102

566

Parallel Web Systems

@p0

5 days

The Task API makes real-world knowledge work possible for AI agents in production, changing the way teams, businesses, and someday the whole online world operate. Learn more: https://t.co/x14sBeQUXw

0

2

9

Hamish Ivison

@hamishivi

4 days

to continue the PipelineRL glazing, @finbarrtimbers implemented PipelineRL for open-instruct a little bit ago and it ended up being probably the single biggest speedup to our overall pipeline. We went from 2-week long RL runs to 5-day runs, without sacrificing performance

Rishabh Agarwal

@agarwl_

4 days

Don't sleep on PipelineRL -- this is one of the biggest jumps in compute efficiency of RL setups that we found in the ScaleRL paper (also validated by Magistral & others before)! What's the problem PipelineRL solves? In RL for LLMs, we need to send weight updates from trainer to

6

34

229

Heavybit

@heavybit

3 days

Having multiple agents under the hood may turn engineering into a multiplayer discipline that requires simultaneous human-in-the-loop oversight of AI agents. Thinking of experimenting with multi-agentic orchestration? https://t.co/AkUSMYZwfT

0

1

3

The Sanghani Center at Virginia Tech

@SanghaniCtrVT

3 days

@therealthapa One more @SanghaniCtrVT paper at #EMNLP2025: Efficient Model Development through Fine-tuning Transfer Main proceedings @linusdd44804 @Sub_RBala @tuvllms (all VT) w/@fyliufengyuan, @kandpal_nikhil https://t.co/OGQa84ots4

aclanthology.org

Pin-Jie Lin, Rishab Balasubramanian, Fengyuan Liu, Nikhil Kandpal, Tu Vu. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

0

3

2

Tu Vu

@tuvllms

3 days

Kimi K2 Thinking has achieved a new state-of-the-art result (56.3%) on our Seal-0 benchmark ( https://t.co/Fb7YS97clt).

arxiv.org

We introduce SealQA, a new challenge benchmark for evaluating SEarch-Augmented Language models on fact-seeking questions where web search yields conflicting, noisy, or unhelpful results. SealQA...

Kimi.ai

@Kimi_Moonshot

4 days

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built

0

2

Kimi.ai

@Kimi_Moonshot

4 days

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built

561

1K

10K

Tu Vu

@tuvllms

4 days

Our paper https://t.co/IyHkhCIFhZ

Tu Vu

@tuvllms

8 months

🚨 New paper 🚨 Excited to share my first paper w/ my PhD students!! We find that advanced LLM capabilities conferred by instruction or alignment tuning (e.g., SFT, RLHF, DPO, GRPO) can be encoded into model diff vectors (à la task vectors) and transferred across model

0

Tu Vu

@tuvllms

4 days

I am not at EMNLP this year, but my student @linusdd44804 will be presenting our paper on efficient model development through fine-tuning transfer. The presentation is tomorrow 2-3:30 pm, A109 (session 15). Please come talk to him!

Linus Pin-Jie Lin

@linusdd44804

4 days

I’ll be presenting our fine-tuning transfer paper tomorrow! TLDR: Alignment tuning effects can be captured as transferable model diff vectors — no need to fine-tune from scratch for every new base model version. Come find me: 🕑 14:00–15:30 📍 A109 (Session 15) #EMNLP2025

1

2

9

Kyle Lo

@kylelostat

4 days

why intern at Ai2? 🐟interns own major parts of our model development, sometimes even leading whole projects 🐡we're committed to open science & actively help our interns publish their work reach out if u wanna build open language models together 🤝 links👇

14

45

700

Rui-Jie (Ridger) Zhu

@RidgerZhu

11 days

Thrilled to release new paper: “Scaling Latent Reasoning via Looped Language Models.” TLDR: We scale up loop language models to 2.6 billion parameters, and pretrained on > 7 trillion tokens. The resulting model is on par with SOTA language models of 2 to 3x size.

20

137

627

Tuhin Chakrabarty

@TuhinChakr

5 days

I am recruiting 1/2 PhD students to work on how GenerativeAI dilutes Creative Labor Markets/ AI and CopyrightLaw / Proliferation of AI slop at Stony Brook Computer Science @sbucompsc starting fall 2026! Come join us :) We are not far from NYC 🗽 (1 hr train to Queens) 🧵

8

41

132

Songyou Peng

@songyoupeng

5 days

Our bigger group at Google DeepMind is hiring interns for next summer! If you are interested in working with us, apply through the link below and also email us. step 1: https://t.co/DUQlKgwVMG (US-based) step 2: send an email to gdm-ct-internships@google.com

Jon Barron

@jon_barron

5 days

Our intern applications are open, instructions are below.

6

44

498

Sebastian Gehrmann

@sebgehr

6 days

We are looking for excellent PhD students across many topics for our Bloomberg CTO AI Research internship next summer. Link to apply below.

6

27

203

Nathan Lambert

@natolambert

10 days

I'm convinced to try it asap, we should all try fp16, look at this plot man. FP16 is like perfect in error reduction. "This is precisely why switching to FP16 provides a fundamental solution. With its 10 mantissa bits, FP16 offers 8 times more precision (2^10 values vs. 2^7

25

43

659

Virginia Tech

@virginia_tech

10 days

Happy Halloween, #Hokies! 👻🧡✨

0

4

114

dr. jack morris

@jxmnop

12 days

useful heuristic the Best Research Problems are: (A) non-obvious enough that no one else will solve in the next six months (B) important enough that someone would solve eventually stressed? pick a less obvious problem laboring in obscurity? pick a more important problem

4

16

264

OpenAI

@OpenAI

11 days

3. Vietnam

38

49

606

Jay Alammar

@JayAlammar

13 days

An Illustrated Guide to AI Agents, with @MaartenGr First 2 chapters now in Early Release! Drafts of the the first two chapters of An Illustrated Guide to AI Agents are now available in Early Release on the O'Reilly platform! These will take you through the central concepts of

12

118

799

Russ Salakhutdinov

@rsalakhu

11 days

New work on Rethinking Thinking Tokens: LLMs as Improvement Operators: https://t.co/w51uLvX5dG Reasoning training encourages LLMs to produce long chains of thought (CoT), improving accuracy via self-checking but increasing context length, compute cost, and latency. This work

11

48

384

The Sanghani Center at Virginia Tech

@SanghaniCtrVT

13 days

Full agenda at Amazon-@virginia_tech Initiative for Efficient & Robust Machine Learning daylong AI Workshop included Lightening Talks on research projects from VT faculty @mali_gulzar Chang Tien-Lu @MingJin80233626 @tuvllms followed by a photo op. @AmazonScience @SanghaniCtrVT

0

1

6