XiangruTang Profile Banner
Rob Tang Profile
Rob Tang

@XiangruTang

Followers
1K
Following
1K
Media
85
Statuses
825

Final-year CS PhD student @Yale. Research intern @google. This account is for academic purposes.

New York City
Joined March 2019
Don't wanna be here? Send us removal request.
@XiangruTang
Rob Tang
9 days
🔥Excited to share our latest work: Agent KB .It achieves new open-source SOTA on the GAIA benchmark!. We enable agents to learn from each other's experiences across tasks through hierarchical experience sharing. Paper📜: Code🧑‍💻:
Tweet media one
1
20
56
@XiangruTang
Rob Tang
1 day
volunteer at Google booth #ICML25 as a research intern and then grab a drink!
Tweet media one
Tweet media two
0
1
10
@XiangruTang
Rob Tang
6 days
Great work created by @YilunZhao_NLP @armancohan.
0
0
1
@XiangruTang
Rob Tang
6 days
SciArena is such a cool platform: 🔬. Vote on base model outputs for real scientific literature tasks—long-form, citation-grounded answers based on actual papers. I’ve found it super useful. Go explore, vote, and give feedback! 🧠📚 .#AI4Science #AI2
Tweet media one
Tweet media two
1
2
5
@XiangruTang
Rob Tang
7 days
🚀 Just discovered an amazing app for vibe coding! @_akhaliq .✨ Describe your app in plain English → Get complete HTML/CSS/JS code.🖼️ Upload UI designs → Auto-generate code.🌐 One-click deploy and Free to use with multiple LLMs! 👉 .
3
11
41
@XiangruTang
Rob Tang
7 days
RT @ApollonVisual: @rohanpaul_ai those improvements are nothing to scoff at. really impressive "Claude-3.7 with Agent KB increased performa….
0
1
0
@XiangruTang
Rob Tang
8 days
Thanks for sharing and promoting! @_akhaliq.
@_akhaliq
AK
8 days
AGENT KB. Leveraging Cross-Domain Experience for Agentic Problem Solving
Tweet media one
0
0
7
@XiangruTang
Rob Tang
9 days
@wangchunshu @xinzhongderiyu3 @Chi_Wang_ @xingyaow_ @liujiaheng2 @espaiade @DanielStupid @richardxp888 @WUFang40615703 @GeZhang86038849 @MetaGPT_ @allhands_ai If you find this work interesting, would love your ⭐ on GitHub or 👍 on the HF paper!.
0
0
1
@XiangruTang
Rob Tang
9 days
Thread 6/7.🛠️ Framework-Agnostic: Agent KB works with different agent architectures (smolagents, OpenHands) and LLMs (GPT-4, Claude, o3-mini, etc.). It's designed as a modular infrastructure that any agent system can benefit from!
Tweet media one
1
0
1
@XiangruTang
Rob Tang
9 days
Thread 5/7.🔍 Key Innovation: Unlike existing memory systems that store raw execution logs, Agent KB maintains abstracted reasoning patterns that capture generalizable problem-solving strategies. This enables effective knowledge transfer even between dissimilar domains! 🎯
Tweet media one
1
0
2
@XiangruTang
Rob Tang
9 days
Thread 4/7.💻 SWE-bench Results (Code repair tasks):. Claude-3.7: 41.33% → 53.33% (+12.0pp).Significant improvements across all tested models. Our cross-domain knowledge transfer works beyond just Q&A - it enhances code understanding and debugging too!
Tweet media one
1
0
1
@XiangruTang
Rob Tang
9 days
Thread 3/7.📊 GAIA Results (Game-changing improvements!):. GPT-4.1: 55.15% → 73.94% (+18.79pp).Claude-3.7: 58.79% → 75.15% (+16.36pp).Level 3 (hardest): Claude-3.7 jumps from 38.46% to 57.69% (+19.23pp). This shows our approach excels, especially on complex reasoning tasks! 🚀
Tweet media one
1
0
2
@XiangruTang
Rob Tang
9 days
Thread 2/7.💡Our Solution: Agent KB introduces a Teacher-Student dual-phase retrieval with a novel Reason-Retrieve-Refine pipeline: .Student Agent: retrieves workflow-level patterns for overall strategy .Teacher Agent: retrieves step-level experiences for execution refinement
Tweet media one
Tweet media two
1
0
2
@XiangruTang
Rob Tang
9 days
Thread 1/7.🤔 The Problem: Current agents can't learn from each other's experiences. They repeatedly rediscover similar problem-solving strategies when encountering new tasks, even when successful approaches from related domains could be adapted.
Tweet media one
1
0
2
@XiangruTang
Rob Tang
12 days
What we need is a healthier and more inclusive community infrastructure, not fear-based accountability through punishment.
0
0
1
@XiangruTang
Rob Tang
12 days
Fear-driven reviewing can lead to anxiety and rushed, irresponsible decisions. In practice, it takes few minutes for an Area Chair to reassign a reviewer. If someone is unavailable, they should be encouraged to notify the AC early—not be coerced through punitive policies.
@XiangruTang
Rob Tang
12 days
It's profoundly unethical for #NeurIPS to enforce reviewing obligations by threatening to desk-reject if a coauthor fails to submit reviews. Coauthors may feel pressured to urge their colleagues to rush reviews, resulting in low-quality reports.
0
0
4
@XiangruTang
Rob Tang
12 days
It's profoundly unethical for #NeurIPS to enforce reviewing obligations by threatening to desk-reject if a coauthor fails to submit reviews. Coauthors may feel pressured to urge their colleagues to rush reviews, resulting in low-quality reports.
1
0
5
@XiangruTang
Rob Tang
16 days
**Biomedical Superintelligence** is the paradigm I have been actively advancing. It's a continuously evolving multi-agent system that autonomously integrates reasoning, hypothesis gen, experiment design, and feedback-driven learning across molecular, cellular, and clinical scales.
@_jasonwei
Jason Wei
17 days
We don’t have AI self-improves yet, and when we do it will be a game-changer. With more wisdom now compared to the GPT-4 days, it's obvious that it will not be a “fast takeoff”, but rather extremely gradual across many years, probably a decade. The first thing to know is that.
0
0
4
@XiangruTang
Rob Tang
16 days
Truly embracing San Francisco culture.
@djcows
djcows
16 days
visiting the xAI office really humbled me
Tweet media one
0
0
4