webagentlab Profile Banner
WebAgentlab Profile
WebAgentlab

@webagentlab

Followers
571
Following
2K
Media
386
Statuses
1K

WebAgentLab is building an open-source community focused on Web Agent and the broader GUI Agent field.

join to contribute 👉
Joined November 2024
Don't wanna be here? Send us removal request.
@webagentlab
WebAgentlab
1 year
🌟 Introducing WebAgent Wiki: Your All-in-One Resource! 🌟 Key features: 📚RESEARCH PAPER: 49+ papers categorized by keywords & authors 💻PROJECT: 16+ GitHub projects related to WebAgents 🏢COMPANY:14+ companies contributing to WebAgent development 🔬EXPERT: 300+ experts,
0
6
22
@MSFTResearch
Microsoft Research
20 hours
The Microsoft Research Asia StarTrack Scholars Program is a three-month experience designed to foster global collaboration and accelerate frontier research. Applications are due today, December 15. Details on research fields, program description, and application procedures are
Tweet card summary image
microsoft.com
Microsoft Research Asia (MSRA) StarTrack Scholars Program is a visiting researcher program dedicated to empowering young scholars from across the globe. The program extends an exclusive invitation to...
0
2
2
@Tiny_Fish
TinyFish
21 hours
The web was built for humans. And honestly, that's fine for you guys. But the next trillion internet users are AI agents, robots, and devices who act on your behalf - booking appointments, filling forms, placing orders, and getting things done - using sites that will never,
34
39
200
@xueqing_w
Xueqing Wu
4 days
Want to use LLMs to build your websites? They may not be as good as shown by prior benchmarks! Introducing🔥FronTalk🔥that evaluates multi-turn multi-modal front-end coding - Users propose new evolving requests in multiple turns, including visual instructions🖼️in the form of
10
16
48
@natolambert
Nathan Lambert
2 days
Open models year in review What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026. First, the winning models: 1. DeepSeek R1 (@deepseek_ai): Transformed the AI world 2. Qwen 3 Family (@AlibabaGroup): The new
59
245
1K
@webagentlab
WebAgentlab
3 days
📊 Who’s leading GUI Agent research? I compiled a ranking of top institutions by academic publications in GUI Agents. Key takeaways: 🇨🇳 China shows strong concentration around top universities and big tech labs (Alibaba, Tsinghua, SJTU). 🇺🇸 The U.S. ecosystem is more
1
0
5
@webagentlab
WebAgentlab
4 days
AgentBay: A Hybrid Interaction Sandbox for Seamless Human-AI Intervention in Agentic Systems AgentBay is a hybrid interaction sandbox that enhances Human-in-the-Loop (HITL) collaboration in autonomous AI systems by providing secure, isolated environments and an Adaptive
0
0
1
@webagentlab
WebAgentlab
4 days
LegalWebAgent: Empowering Access to Justice via LLM-Based Web Agents The LegalWebAgent framework utilizes multimodal large language models to autonomously navigate legal processes and enhance access to justice for ordinary citizens by effectively understanding user queries,
1
0
0
@webagentlab
WebAgentlab
4 days
Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding This paper introduces ZoomClick, a training-free method that enhances GUI grounding by leveraging zoom properties to improve model accuracy and efficiency, alongside the GUIZoom-Bench
1
0
1
@webagentlab
WebAgentlab
4 days
An Index-based Approach for Efficient and Effective Web Content Extraction The paper introduces an Index-based Web Content Extraction method that enhances the efficiency and effectiveness of extracting relevant information from web pages by predicting positional indices instead
1
0
0
@webagentlab
WebAgentlab
4 days
MVP: Multiple View Prediction Improves GUI Grounding The paper introduces the Multiple View Prediction (MVP) framework, a training-free approach that enhances GUI grounding stability by aggregating predictions from multiple attention-guided views to mitigate prediction
1
0
1
@webagentlab
WebAgentlab
4 days
EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce EcomBench is a comprehensive benchmark designed to evaluate the performance of foundation agents in real-world e-commerce environments by incorporating genuine user demands and dynamic market conditions
1
1
3
@webagentlab
WebAgentlab
4 days
GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection GAIR is a novel framework that enhances GUI automation by integrating heterogeneous capabilities from multiple Multimodal Large Language Models (MLLMs) through information-joint reasoning and group
1
0
0
@webagentlab
WebAgentlab
4 days
🚨#GUIAgent Papers of the Week(12/06~12/12): ◾️GAIR ◾️EcomBench ◾️MVP ◾️IndexLM ◾️ZoomClick ◾️AgentBay ◾️iRAG ◾️LegalWebAgent check it out 👉 https://t.co/UmA4u42yoi
1
0
0
@trycua
Cua
6 days
1/3 Very last-minute - but it’s happening. We're hosting a hack day with our friends at CodeRabbit this Saturday. If you’re in SF, come hack with us!
3
6
24
@TaylorOgan
Taylor Ogan
12 days
Another DeepSeek moment. This is the world’s first actual smart phone. It’s an engineering prototype of ZTE’s Nubia M153 running ByteDance’s Doubao AI agent fused into Android at the OS level. It has complete control over the phone. It can see the UI, choose/download apps,
150
665
5K
@abhshkdz
Abhishek Das
6 days
Today, we're making Scouts available to everyone! Earlier this year, Scouts was born out of a simple observation — that so many of life's background (or even foreground!) tasks have a recurring flavor, e.g. house hunting, early stages of travel planning, sourcing leads,
71
66
418
@karen_ullrich
Dr. Karen Ullrich
6 days
Release Day 🎉 Meet OpenApps — a pure-Python, open-source ecosystem for stress-testing UI agents at scale. Runs on a single CPU. Generates thousands of unique UI variations. And it reveals just how fragile today’s SOTA agents are. (Yes, even GPT-4 and Claude struggle.)
3
16
31
@shi_weiyan
Weiyan Shi
8 days
fun panel with @jaseweston @ysu_nlp @willccbb @xwang_lk @natashajaques - What agents can/can't solve in 1 yr - 1K+ step tasks - Academia & long-horizon tasks - Continual learning: in-context vs weights - Human-AI co-evolution Claude joined as our first AI panelist! Recording🧵
@shi_weiyan
Weiyan Shi
8 days
Finally with a closing keynote by @ysu_nlp on “Computer Use: Modern Moravec’s Paradox”, we connect the history and the future 🙌 — “symbolic reasoning” vs “Perception & Mobility” in agents — future of AI — dragon-slaying on agent plasticity and reliability
8
12
85
@syz0x1
Shuyan Zhou
7 days
It was really fun to meet new people and discuss agent environments. Thanks to the workshop organizers for putting together such a great event! Here is the slide deck from the talk:
@SEAWorkshop
SEA Workshop
9 days
Invited Talk 6 "Towards Future-proof Benchmarks for Digital Agents" from Shuyan Zhou @syz0x1 (Duke University)
3
9
61
@shi_weiyan
Weiyan Shi
8 days
Thanks everyone for attending our “multi-turn interaction workshop” @mti_neurips ❤️ Hope you had great fun at the workshop and the after-party w/ @PrimeIntellect w/ 🎳🎤🏓 AI people sure know how to party! Let’s keep the “multi-turn interaction” going🤘see you next time 🙌
@shi_weiyan
Weiyan Shi
8 days
fun panel with @jaseweston @ysu_nlp @willccbb @xwang_lk @natashajaques - What agents can/can't solve in 1 yr - 1K+ step tasks - Academia & long-horizon tasks - Continual learning: in-context vs weights - Human-AI co-evolution Claude joined as our first AI panelist! Recording🧵
4
5
47