Xin Eric Wang @xwang_lk X Profile

Xin Eric Wang

@xwang_lk

Followers

18K

Following

7K

Media

417

Statuses

3K

Head of Research @SimularAI. Professor @ UCSB @ucsantabarbara. #Multimodal #Embodied #Agents. AI for Humanity in the long run. 🦋 https://t.co/v7sGFsIKxf

Santa Barbara, CA

Joined February 2012

Don't wanna be here? Send us removal request.

Xin Eric Wang

@xwang_lk

11 days

🚀 Meet 𝐀𝐠𝐞𝐧𝐭 𝐒2.5, our latest open-source Computer Use Agent, setting 𝐧𝐞𝐰 𝐒𝐎𝐓𝐀 𝐨𝐧 𝐎𝐒𝐖𝐨𝐫𝐥𝐝 𝐕𝐞𝐫𝐢𝐟𝐢𝐞𝐝!. ✨ Upgrades: Flexible context engineering + advanced high-level reasoning. 📈 OSWorld Verified Results:.- 100 steps: S2.5 56.0% (Prev. SOTA: 53.1%)

13

45

316

Xin Eric Wang

@xwang_lk

6 hours

Generative World Model Theorem: Video generation is to the physical world what LLMs are to the internet.

1

3

15

Xin Eric Wang

@xwang_lk

7 hours

RT @ShijieZhoucla: 🚀 Our #ICCV2025 work — VLM4D is here!. The first benchmark to truly test if Vision-Language Models can think in 4D (3D s….

0

3

0

Xin Eric Wang

@xwang_lk

2 days

They should have named the o-series GPT-5. Period.

0

7

Xin Eric Wang

@xwang_lk

4 days

Really? Still?

17

4

90

Xin Eric Wang

@xwang_lk

5 days

My ranking: . GPT-3.5 (ChatGPT) > o1 > GPT-4V > o3 >= GPT-4o > GPT-5 > GPT-4.1 > GPT-4.5.

1

0

15

Xin Eric Wang

@xwang_lk

5 days

o1 is a bigger deal than GPT-5. If the reasoning series weren't released, GPT-5 would've stunned everyone. IMO, the reason why o1 was not named as GPT-5 is that they wanted GPT-5 to be a single model for both. However, now GPT-5 is a hybrid system of multiple underlying models.

1

0

7

Xin Eric Wang

@xwang_lk

5 days

If it wasn't clear to you, now it is super clear.

Xin Eric Wang

@xwang_lk

2 years

GPT-4 and Gemini are systems and products, not models. There must be lots of engineering tricks and hard-coded rules, and we would never know how many models are inside the systems before open-sourcing. Requesting comparison with them is unfair in scientific research.

0

9

Xin Eric Wang

@xwang_lk

5 days

TBH, functionality-wise, GPT-5 meets my expectations, but I am a bit disappointed that it is not a single model yet, which I thought was the goal for GPT-5. This feels like an early release due to unknown pressure.

0

6

Xin Eric Wang

@xwang_lk

6 days

RT @bremen79: Dear Reviewers,. It is completely fine to admit you were wrong in your initial evaluations. You will not lose anything, and t….

0

15

0

Xin Eric Wang

@xwang_lk

6 days

RT @FrancescoLocat8: @xwang_lk PC of the DB track here! Please kindly remind the reviewer that contributing new methods is not necessary fo….

0

3

0

Xin Eric Wang

@xwang_lk

6 days

A YC startup told us that they are building everything on top of Agent S2. VCs back startups with capital. We back them with infra and agent tech. More importantly, Agent S keeps evolving. So you can, too.

Xin Eric Wang

@xwang_lk

11 days

🚀 Meet 𝐀𝐠𝐞𝐧𝐭 𝐒2.5, our latest open-source Computer Use Agent, setting 𝐧𝐞𝐰 𝐒𝐎𝐓𝐀 𝐨𝐧 𝐎𝐒𝐖𝐨𝐫𝐥𝐝 𝐕𝐞𝐫𝐢𝐟𝐢𝐞𝐝!. ✨ Upgrades: Flexible context engineering + advanced high-level reasoning. 📈 OSWorld Verified Results:.- 100 steps: S2.5 56.0% (Prev. SOTA: 53.1%)

0

4

Xin Eric Wang

@xwang_lk

6 days

A benchmark paper submitted to NeurIPS Dataset & Benchmark Track. Reviewer: Valuable benchmark! But no new modeling method proposed. Reject. 😂😂😂.

15

10

571

Xin Eric Wang

@xwang_lk

6 days

One step at a time, we will get there.

0

2

Xin Eric Wang

@xwang_lk

6 days

AGI will never be achieved without research.

0

10

Xin Eric Wang

@xwang_lk

7 days

Tried gpt-oss locally. It is good, quite slow though. It works exactly like R1. Maybe better, but they work the same way. (R1 deserves a citation.). No multimodal yet. But I think the community will build multimodal-oss very soon. Re: why open source — by gpt-oss. “From the

1

2

36

Xin Eric Wang

@xwang_lk

7 days

👏👏👏.

OpenAI

@OpenAI

7 days

We released two open-weight reasoning models—gpt-oss-120b and gpt-oss-20b—under an Apache 2.0 license. Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety.

0

1

Xin Eric Wang

@xwang_lk

9 days

The more you know, the more you don’t know.

Yuchen Jin

@Yuchenj_UW

9 days

Real intelligence starts when you admit how much you still don’t know.

1

2

17

Xin Eric Wang

@xwang_lk

11 days

@SimularAI 👏 Huge shout-out to the amazing @SimularAI Research team for the dedication and hard work. More exciting updates coming soon!. 🙏 Special thanks to @TianbaoX and the OSWorld team for making the OSWorld AWS evaluation reliable and accessible!.

0

5

Xin Eric Wang

@xwang_lk

11 days

Agent S2.5 is open sourced here: Try it out!. It is also available on @SimularAI Cloud (no deployment needed).

github.com

Agent S: an open agentic framework that uses computers like a human - simular-ai/Agent-S

1

0

16