xwang_lk Profile Banner
Xin Eric Wang Profile
Xin Eric Wang

@xwang_lk

Followers
18K
Following
7K
Media
417
Statuses
3K

Head of Research @SimularAI. Professor @ UCSB @ucsantabarbara. #Multimodal #Embodied #Agents. AI for Humanity in the long run. πŸ¦‹ https://t.co/v7sGFsIKxf

Santa Barbara, CA
Joined February 2012
Don't wanna be here? Send us removal request.
@xwang_lk
Xin Eric Wang
11 days
πŸš€ Meet π€π πžπ§π­ 𝐒2.5, our latest open-source Computer Use Agent, setting 𝐧𝐞𝐰 π’πŽπ“π€ 𝐨𝐧 πŽπ’π–π¨π«π₯𝐝 π•πžπ«π’πŸπ’πžπ!. ✨ Upgrades: Flexible context engineering + advanced high-level reasoning. πŸ“ˆ OSWorld Verified Results:.- 100 steps: S2.5 56.0% (Prev. SOTA: 53.1%)
Tweet media one
13
45
316
@xwang_lk
Xin Eric Wang
6 hours
Generative World Model Theorem: Video generation is to the physical world what LLMs are to the internet.
1
3
15
@xwang_lk
Xin Eric Wang
7 hours
RT @ShijieZhoucla: πŸš€ Our #ICCV2025 work β€” VLM4D is here!. The first benchmark to truly test if Vision-Language Models can think in 4D (3D s….
0
3
0
@xwang_lk
Xin Eric Wang
2 days
They should have named the o-series GPT-5. Period.
0
0
7
@xwang_lk
Xin Eric Wang
4 days
Really? Still?
Tweet media one
17
4
90
@xwang_lk
Xin Eric Wang
5 days
My ranking: . GPT-3.5 (ChatGPT) > o1 > GPT-4V > o3 >= GPT-4o > GPT-5 > GPT-4.1 > GPT-4.5.
1
0
15
@xwang_lk
Xin Eric Wang
5 days
o1 is a bigger deal than GPT-5. If the reasoning series weren't released, GPT-5 would've stunned everyone. IMO, the reason why o1 was not named as GPT-5 is that they wanted GPT-5 to be a single model for both. However, now GPT-5 is a hybrid system of multiple underlying models.
1
0
7
@xwang_lk
Xin Eric Wang
5 days
If it wasn't clear to you, now it is super clear.
@xwang_lk
Xin Eric Wang
2 years
GPT-4 and Gemini are systems and products, not models. There must be lots of engineering tricks and hard-coded rules, and we would never know how many models are inside the systems before open-sourcing. Requesting comparison with them is unfair in scientific research.
0
0
9
@xwang_lk
Xin Eric Wang
5 days
TBH, functionality-wise, GPT-5 meets my expectations, but I am a bit disappointed that it is not a single model yet, which I thought was the goal for GPT-5. This feels like an early release due to unknown pressure.
Tweet media one
0
0
6
@xwang_lk
Xin Eric Wang
6 days
RT @bremen79: Dear Reviewers,. It is completely fine to admit you were wrong in your initial evaluations. You will not lose anything, and t….
0
15
0
@xwang_lk
Xin Eric Wang
6 days
RT @FrancescoLocat8: @xwang_lk PC of the DB track here! Please kindly remind the reviewer that contributing new methods is not necessary fo….
0
3
0
@xwang_lk
Xin Eric Wang
6 days
A YC startup told us that they are building everything on top of Agent S2. VCs back startups with capital. We back them with infra and agent tech. More importantly, Agent S keeps evolving. So you can, too.
@xwang_lk
Xin Eric Wang
11 days
πŸš€ Meet π€π πžπ§π­ 𝐒2.5, our latest open-source Computer Use Agent, setting 𝐧𝐞𝐰 π’πŽπ“π€ 𝐨𝐧 πŽπ’π–π¨π«π₯𝐝 π•πžπ«π’πŸπ’πžπ!. ✨ Upgrades: Flexible context engineering + advanced high-level reasoning. πŸ“ˆ OSWorld Verified Results:.- 100 steps: S2.5 56.0% (Prev. SOTA: 53.1%)
Tweet media one
0
0
4
@xwang_lk
Xin Eric Wang
6 days
A benchmark paper submitted to NeurIPS Dataset & Benchmark Track. Reviewer: Valuable benchmark! But no new modeling method proposed. Reject. πŸ˜‚πŸ˜‚πŸ˜‚.
15
10
571
@xwang_lk
Xin Eric Wang
6 days
One step at a time, we will get there.
0
0
2
@xwang_lk
Xin Eric Wang
6 days
AGI will never be achieved without research.
0
0
10
@xwang_lk
Xin Eric Wang
7 days
Tried gpt-oss locally. It is good, quite slow though. It works exactly like R1. Maybe better, but they work the same way. (R1 deserves a citation.). No multimodal yet. But I think the community will build multimodal-oss very soon. Re: why open source β€” by gpt-oss. β€œFrom the
Tweet media one
1
2
36
@xwang_lk
Xin Eric Wang
7 days
πŸ‘πŸ‘πŸ‘.
@OpenAI
OpenAI
7 days
We released two open-weight reasoning modelsβ€”gpt-oss-120b and gpt-oss-20bβ€”under an Apache 2.0 license. Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety.
0
0
1
@xwang_lk
Xin Eric Wang
9 days
The more you know, the more you don’t know.
@Yuchenj_UW
Yuchen Jin
9 days
Real intelligence starts when you admit how much you still don’t know.
1
2
17
@xwang_lk
Xin Eric Wang
11 days
@SimularAI πŸ‘ Huge shout-out to the amazing @SimularAI Research team for the dedication and hard work. More exciting updates coming soon!. πŸ™ Special thanks to @TianbaoX and the OSWorld team for making the OSWorld AWS evaluation reliable and accessible!.
0
0
5
@xwang_lk
Xin Eric Wang
11 days
Agent S2.5 is open sourced here: Try it out!. It is also available on @SimularAI Cloud (no deployment needed).
Tweet card summary image
github.com
Agent S: an open agentic framework that uses computers like a human - simular-ai/Agent-S
1
0
16