yizhongwyz Profile Banner
Yizhong Wang Profile
Yizhong Wang

@yizhongwyz

Followers
5K
Following
6K
Media
27
Statuses
704

Incoming assistant professor @UTCompSci, RS @BytedanceTalk, PhD from @uwcse, formerly @allen_ai @AIatMeta @MSFTResearch

Seattle
Joined April 2015
Don't wanna be here? Send us removal request.
@yizhongwyz
Yizhong Wang
1 month
Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! . I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘
Tweet media one
Tweet media two
101
54
663
@yizhongwyz
Yizhong Wang
3 minutes
RT @valentina__py: 💡Beyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of….
0
32
0
@yizhongwyz
Yizhong Wang
17 days
RT @ZEYULIU10: LLMs trained to memorize new facts can’t use those facts well.🤔. We apply a hypernetwork to ✏️edit✏️ the gradients for fact….
0
61
0
@yizhongwyz
Yizhong Wang
23 days
RT @jcqln_h: LMs often output answers that sound right but aren’t supported by input context. This is intrinsic hallucination: the generati….
0
18
0
@yizhongwyz
Yizhong Wang
29 days
RT @liujc1998: We enabled OLMoTrace for Tülu 3 models! 🤠. Matched spans are shorter than for OLMo models, bc we can only search in Tülu's p….
0
12
0
@yizhongwyz
Yizhong Wang
29 days
CONGRATS 🎉!.
@uwcse
Allen School
29 days
Congratulations to @UW #UWAllen Ph.D. grads @sharma_ashish_2 & @sewon__min, @TheOfficialACM Doctoral Dissertation Award honorees! Sharma won for #AI tools for mental health; Min received honorable mention for efficient, flexible language models. #ThisIsUW
1
0
11
@yizhongwyz
Yizhong Wang
1 month
RT @saumyamalik44: I’m thrilled to share RewardBench 2 📊— We created a new multi-domain reward model evaluation that is substantially harde….
0
49
0
@yizhongwyz
Yizhong Wang
1 month
RT @StellaLisy: 🤯 We cracked RLVR with. Random Rewards?!.Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:.- Rando….
0
338
0
@yizhongwyz
Yizhong Wang
1 month
RT @percyliang: What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire….
0
193
0
@yizhongwyz
Yizhong Wang
2 months
RT @seungonekim: Glad to share that our AgoraBench paper has been accepted at @aclmeeting 2025 (main)! Special thanks to our coauthors @sco….
0
10
0
@yizhongwyz
Yizhong Wang
3 months
RT @goncalorafaria: Introducing 𝗤𝗔𝗹𝗶𝗴𝗻🚀, a 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝗮𝗹𝗶𝗴𝗻𝗺𝗲𝗻𝘁 𝗺𝗲𝘁𝗵𝗼𝗱 that improves language model performance using Markov chain Monte Car….
0
33
0
@yizhongwyz
Yizhong Wang
3 months
RT @liujc1998: Today we're unveiling OLMoTrace, a tool that enables everyone to understand the outputs of LLMs by connecting to their train….
0
46
0
@yizhongwyz
Yizhong Wang
3 months
RT @taoyds: 🚀After a year of development based on our OSWorld, Computer Use Agent Arena is LIVE! . Test top AI agents (Operator, Claude 3.7….
0
19
0
@yizhongwyz
Yizhong Wang
3 months
RT @BowenWangNLP: 🎮 Computer Use Agent Arena is LIVE! 🚀.🔥 Easiest way to test computer-use agents in the wild without any setup.🌟 Compare t….
0
104
0
@yizhongwyz
Yizhong Wang
3 months
RT @alisawuffles: We created SuperBPE🚀, a *superword* tokenizer that includes tokens spanning multiple words. When pretraining at 8B scale….
0
320
0
@yizhongwyz
Yizhong Wang
4 months
As the evaluation of LMs becomes increasingly complex and broad, how can we draw insights beyond simple metrics? Check out our latest work led by @ZhiyuanZeng_ !.
@ZhiyuanZeng_
Zhiyuan Zeng
4 months
Is a single accuracy number all we can get from model evals?🤔.🚨Does NOT tell where the model fails.🚨Does NOT tell how to improve it. Introducing EvalTree🌳.🔍identifying LM weaknesses in natural language.🚀weaknesses serve as actionable guidance. (paper&demo 🔗in🧵). [1/n]
Tweet media one
Tweet media two
Tweet media three
0
1
11
@yizhongwyz
Yizhong Wang
4 months
RT @allen_ai: Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks….
0
161
0
@yizhongwyz
Yizhong Wang
4 months
RT @HannaHajishirzi: Excited to drive innovation and push the boundaries of open, scientific AI research & development! 🚀 Join us at @allen….
0
15
0
@yizhongwyz
Yizhong Wang
4 months
RT @_awettig: 🤔 Ever wondered how prevalent some type of web content is during LM pre-training?. In our new paper, we propose WebOrganizer….
0
49
0
@yizhongwyz
Yizhong Wang
5 months
RT @thinkymachines: Today, we are excited to announce Thinking Machines Lab (, an artificial intelligence research….
0
513
0