ZhiruoW Profile Banner
Zora Wang Profile
Zora Wang

@ZhiruoW

Followers
2K
Following
1K
Media
41
Statuses
323

PhD student @LTIatCMU | fun 👩🏻‍💻 🐈 💃 🪴 🎶

Pittsburgh, PA
Joined August 2021
Don't wanna be here? Send us removal request.
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
7
52
243
@JiayiiGeng
Jiayi Geng
9 days
We use LLMs for everyday tasks—research, writing, coding, decision-making. They remember our conversations, adapt to our needs and preferences. Naturally, we trust them more with repeated use. But this growing trust might be masking a hidden risk: what if their beliefs are
16
73
359
@jyangballin
John Yang
9 days
New eval! Code duels for LMs ⚔️ Current evals test LMs on *tasks*: "fix this bug," "write a test" But we code to achieve *goals*: maximize revenue, cut costs, win users Meet CodeClash: LMs compete via their codebases across multi-round tournaments to achieve high-level goals
29
91
367
@ZhiruoW
Zora Wang
14 days
Ever used a top-scoring coding agent that still couldn't help with your actual task? How can we transform agents from "interns who never learn" to "workers who improve over time"? Read about our new approach: ToM-SWE 🪄
@nlpxuhui
Xuhui Zhou
14 days
Hoping your coding agents could understand you and adapt to your preferences? Meet TOM-SWE, our new framework for coding agents that don’t just write code, but model the user's mind persistently (ranging from general preferences to small details) arxiv: https://t.co/uznLAjgWKr
0
4
18
@bravo_abad
Jorge Bravo Abad
16 days
This paper makes an important point. AI agents don’t naturally follow human workflows — they often default to coding everything, even when humans would reason, sketch, or explore first. This can make them fast, but also awkward in tasks that rely on interpretation or judgment.
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
0
2
13
@ChengZhoujun
Zhoujun (Jorge) Cheng
17 days
Agent trajectories are often left underexplored while showcasing rich behaviors. Really appreciate this offering a thoughtful snapshot of human–agent comparisons, reading like a compact and detailed GDPEval.
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
0
2
9
@EchoShao8899
Yijia Shao
17 days
Analyzing agent trajectories has become a new canonical way for evaluation. Yet most trajectories operate at the action level - dense but semantically shallow. We define 𝙬𝙤𝙧𝙠𝙛𝙡𝙤𝙬 as a higher-level abstraction and release a toolkit to induce it from raw actions for both
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
2
12
122
@oshaikh13
Omar Shaikh
17 days
We recorded a bunch of people actually working on their computers (!) and then compared agent performance to actual human workflows. Awesome paper led by @ZhiruoW :)
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
3
6
45
@gneubig
Graham Neubig
17 days
Check out our new work on examining how human and AI workers perform tasks! We get human and AI workers to perform the same tasks, extract workflows, and get insights about where agents do well and where they still have a ways to go.
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
1
5
59
@Diyi_Yang
Diyi Yang
17 days
@ZhiruoW 's research compares AI agents vs humans across real work tasks (data analysis, engineering, design, writing). Key findings: 👉Agents are 88% faster & 90-96% cheaper 👉BUT produce lower quality work, often fabricate data to mask limitations 👉Agents code everything,
@ZhiruoW
Zora Wang
17 days
Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine
1
15
107
@ZhiruoW
Zora Wang
17 days
As agents keep climbing up that "autonomy slider" We'll need: - stronger visual understanding - better action calibration, and - workflow-inspired agent designs
1
0
11
@ZhiruoW
Zora Wang
17 days
Quality? Still shaky. Agents fabricate data, misuse tools -- but finish tasks 88% faster and at a fraction of the cost. Maybe the question isn’t agents vs humans.. But how do we team humans & agents?
1
0
9
@ZhiruoW
Zora Wang
17 days
AI doesn’t always speed us up. 🧠 Augmentation → +24.3% faster, minimal disruption. ⚙️ Automation → -17.7% slower, workflows reshaped by verification & debugging.
1
0
9
@ZhiruoW
Zora Wang
17 days
Our findings reveal a striking divide: Agents approach all work programmatically -- even open-ended, visual tasks like design. Humans rely on UI-centric, perceptual interaction. Their workflows may look similar at a high level, but low-level processes are worlds apart.
1
0
12
@ZhiruoW
Zora Wang
17 days
We induce workflows to: - uniformly represent diverse work activities - enable systematic comparison across workers
1
0
7
@ZhiruoW
Zora Wang
17 days
We directly compared humans and AI agents. Across computer-use jobs. Via 5 essential work skills: data analysis, engineering, computation, writing, and design.
1
0
11
@dan_fried
Daniel Fried
20 days
Very excited to see Zora and her work recognized! Check out her research on skill learning for LLM agents, memory, and programmatic approaches to real-world tasks (even beyond code).
@ZhiruoW
Zora Wang
21 days
🎉Thrilled to be named a Google PhD Fellow! Feeling incredibly grateful for my advisors, collaborators, and friends who’ve supported and inspired me along the way 🫶 Looking forward to the journey ahead!
0
2
14
@gneubig
Graham Neubig
21 days
Congrats to @ZhiruoW on getting chosen as a Google PhD fellow! If you haven't checked out Zora's work on agents that learn from experience this is a good excuse to do so 😄
@ZhiruoW
Zora Wang
21 days
🎉Thrilled to be named a Google PhD Fellow! Feeling incredibly grateful for my advisors, collaborators, and friends who’ve supported and inspired me along the way 🫶 Looking forward to the journey ahead!
1
8
81
@ZhiruoW
Zora Wang
21 days
🎉Thrilled to be named a Google PhD Fellow! Feeling incredibly grateful for my advisors, collaborators, and friends who’ve supported and inspired me along the way 🫶 Looking forward to the journey ahead!
@Googleorg
Google.org
22 days
🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: https://t.co/0Pvuv6hsgP
20
33
426