Yizhong Wang @yizhongwyz X Profile

Yizhong Wang

@yizhongwyz

Followers

5K

Following

6K

Media

27

Statuses

704

Incoming assistant professor @UTCompSci, RS @BytedanceTalk, PhD from @uwcse, formerly @allen_ai @AIatMeta @MSFTResearch

Seattle

Joined April 2015

Don't wanna be here? Send us removal request.

Yizhong Wang

@yizhongwyz

1 month

Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! . I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

101

54

663

Yizhong Wang

@yizhongwyz

3 minutes

RT @valentina__py: 💡Beyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of….

0

32

0

Yizhong Wang

@yizhongwyz

17 days

RT @ZEYULIU10: LLMs trained to memorize new facts can’t use those facts well.🤔. We apply a hypernetwork to ✏️edit✏️ the gradients for fact….

0

61

0

Yizhong Wang

@yizhongwyz

23 days

RT @jcqln_h: LMs often output answers that sound right but aren’t supported by input context. This is intrinsic hallucination: the generati….

0

18

0

Yizhong Wang

@yizhongwyz

29 days

RT @liujc1998: We enabled OLMoTrace for Tülu 3 models! 🤠. Matched spans are shorter than for OLMo models, bc we can only search in Tülu's p….

0

12

0

Yizhong Wang

@yizhongwyz

29 days

CONGRATS 🎉!.

Allen School

@uwcse

29 days

Congratulations to @UW #UWAllen Ph.D. grads @sharma_ashish_2 & @sewon__min, @TheOfficialACM Doctoral Dissertation Award honorees! Sharma won for #AI tools for mental health; Min received honorable mention for efficient, flexible language models. #ThisIsUW

1

0

11

Yizhong Wang

@yizhongwyz

1 month

RT @saumyamalik44: I’m thrilled to share RewardBench 2 📊— We created a new multi-domain reward model evaluation that is substantially harde….

0

49

0

Yizhong Wang

@yizhongwyz

1 month

RT @StellaLisy: 🤯 We cracked RLVR with. Random Rewards?!.Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:.- Rando….

0

338

0

Yizhong Wang

@yizhongwyz

1 month

RT @percyliang: What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire….

0

193

0

Yizhong Wang

@yizhongwyz

2 months

RT @seungonekim: Glad to share that our AgoraBench paper has been accepted at @aclmeeting 2025 (main)! Special thanks to our coauthors @sco….

0

10

0

Yizhong Wang

@yizhongwyz

3 months

RT @goncalorafaria: Introducing 𝗤𝗔𝗹𝗶𝗴𝗻🚀, a 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝗮𝗹𝗶𝗴𝗻𝗺𝗲𝗻𝘁 𝗺𝗲𝘁𝗵𝗼𝗱 that improves language model performance using Markov chain Monte Car….

0

33

0

Yizhong Wang

@yizhongwyz

3 months

RT @liujc1998: Today we're unveiling OLMoTrace, a tool that enables everyone to understand the outputs of LLMs by connecting to their train….

0

46

0

Yizhong Wang

@yizhongwyz

3 months

RT @taoyds: 🚀After a year of development based on our OSWorld, Computer Use Agent Arena is LIVE! . Test top AI agents (Operator, Claude 3.7….

0

19

0

Yizhong Wang

@yizhongwyz

3 months

RT @BowenWangNLP: 🎮 Computer Use Agent Arena is LIVE! 🚀.🔥 Easiest way to test computer-use agents in the wild without any setup.🌟 Compare t….

0

104

0

Yizhong Wang

@yizhongwyz

3 months

RT @alisawuffles: We created SuperBPE🚀, a *superword* tokenizer that includes tokens spanning multiple words. When pretraining at 8B scale….

0

320

0

Yizhong Wang

@yizhongwyz

4 months

As the evaluation of LMs becomes increasingly complex and broad, how can we draw insights beyond simple metrics? Check out our latest work led by @ZhiyuanZeng_ !.

Zhiyuan Zeng

@ZhiyuanZeng_

4 months

Is a single accuracy number all we can get from model evals?🤔.🚨Does NOT tell where the model fails.🚨Does NOT tell how to improve it. Introducing EvalTree🌳.🔍identifying LM weaknesses in natural language.🚀weaknesses serve as actionable guidance. (paper&demo 🔗in🧵). [1/n]

0

1

11

Yizhong Wang

@yizhongwyz

4 months

RT @allen_ai: Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks….

0

161

0

Yizhong Wang

@yizhongwyz

4 months

RT @HannaHajishirzi: Excited to drive innovation and push the boundaries of open, scientific AI research & development! 🚀 Join us at @allen….

0

15

0

Yizhong Wang

@yizhongwyz

4 months

RT @_awettig: 🤔 Ever wondered how prevalent some type of web content is during LM pre-training?. In our new paper, we propose WebOrganizer….

0

49

0

Yizhong Wang

@yizhongwyz

5 months

RT @thinkymachines: Today, we are excited to announce Thinking Machines Lab (, an artificial intelligence research….

0

513

0