Kai Zhang @DrogoKhal4 X Profile

Kai Zhang

@DrogoKhal4

Followers

2K

Following

4K

Media

35

Statuses

494

PhD-ing @osunlp with @ysu_nlp

Columbus, OH

Joined February 2019

Don't wanna be here? Send us removal request.

Kai Zhang

@DrogoKhal4

3 months

🚀Big WebDreamer update!.We train 💭Dreamer-7B, a small but strong world model for real-world web planning. 💥Beats Qwen2-72B.⚖️Matches #GPT-4o.Trained on 3M synthetic examples — and yes, all data + models are open-sourced.

Yu Gu

@yugu_nlp

8 months

❓Wondering how to scale inference-time compute with advanced planning for language agents?. 🙋‍♂️Short answer: Using your LLM as a world model.💡More detailed answer: Using GPT-4o to predict the outcome of actions on a website can deliver strong performance with improved safety and

1

24

80

Kai Zhang

@DrogoKhal4

2 days

RT @hhsun1: 🚨 Postdoc Hiring:.I am looking for a postdoc to work on rigorously evaluating and advancing the capabilities and safety of comp….

0

27

0

Kai Zhang

@DrogoKhal4

11 days

RT @scaling01: this is scientific seppuku

0

181

0

Kai Zhang

@DrogoKhal4

16 days

RT @Benjamin_eecs: We've always been excited about self-play unlocking continuously improving agents. Our insight: RL selects generalizable….

0

52

0

Kai Zhang

@DrogoKhal4

19 days

RT @tzmhuang: We're already using AI search systems every day for more and more complex tasks, but how good are they really? Challenge: eva….

0

3

0

Kai Zhang

@DrogoKhal4

19 days

RT @yuting_ning: 🧐Agentic search is revolutionizing how we gather information, but how reliable is it? Can it really deliver accurate answe….

0

4

0

Kai Zhang

@DrogoKhal4

20 days

RT @ysu_nlp: 🔎Agentic search like Deep Research is fundamentally changing web search, but it also brings an evaluation crisis⚠️. Introducin….

0

47

0

Kai Zhang

@DrogoKhal4

1 month

RT @MuCai7: Impressed by V-JEPA 2's improvement on TemporalBench . Indeed, we need a better video encoder for the t….

0

3

0

Kai Zhang

@DrogoKhal4

1 month

RT @xiangyue96: Attending #CVPR2025 in #Nashville! We will have our multimodal LLM evaluation tutorial tmr afternoon! Feel free to ping me….

0

10

0

Kai Zhang

@DrogoKhal4

1 month

RT @YifeiLiPKU: 📢 Introducing AutoSDT, a fully automatic pipeline that collects data-driven scientific coding tasks at scale!.We use AutoSD….

0

25

0

Kai Zhang

@DrogoKhal4

1 month

RT @HuggingPapers: Are we heading down the right path towards omni-modality? 🤔. This new paper explores the effects of extending modality i….

0

22

0

Kai Zhang

@DrogoKhal4

1 month

Thanks for sharing our work :).

DailyPapers

@HuggingPapers

1 month

Are we heading down the right path towards omni-modality? 🤔. This new paper explores the effects of extending modality in language models.

0

2

9

Kai Zhang

@DrogoKhal4

1 month

RT @YuanshengNi: 📢 Introducing VisCoder – fine-tuned language models for Python-based visualization code generation and feedback-driven sel….

0

16

0

Kai Zhang

@DrogoKhal4

1 month

Had a blast working with @DarthZhu_ !.We try to analyze and use the modality-specific models extended from the same #LLM backbones to create omni ones. e.g., Qwen2-VL, -Video, -Audio, on #Qwen2.Tho most results are negative, we have some interesting findings here :).

Tinghui Zhu

@DarthZhu_

1 month

😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it generalize to omni-modality?. We study the effects of extending modality and ask three questions:. #LLM #MLLM #OmniModality.

1

4

15

Kai Zhang

@DrogoKhal4

2 months

RT @davidbau: Dear MAGA friends,. I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cu….

0

72

0

Kai Zhang

@DrogoKhal4

2 months

RT @yizhongwyz: Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! . I will continue….

0

54

0

Kai Zhang

@DrogoKhal4

2 months

RT @hhsun1: Realistic adversarial testing of Computer-Use Agents (CUAs) to identify their vulnerabilities and make them safer and more secu….

0

24

0

Kai Zhang

@DrogoKhal4

2 months

RT @lateinteraction: Sigh, it's a bit of a mess. Let me just give you guys the full nuance in one stream of consciousness since I think we….

0

85

0

Kai Zhang

@DrogoKhal4

2 months

RT @LiaoZeyi: ⁉️Can you really trust Computer-Use Agents (CUAs) to control your computer⁉️. Not yet, @AnthropicAI Opus 4 shows an alarming….

0

32

0

Kai Zhang

@DrogoKhal4

2 months

RT @vardaanpahuja: 🚀 Thrilled to unveil the most exciting project of my PhD:.Explorer — Scaling Exploration-driven Web Trajectory Synthesis….

0

24

0

Kai Zhang

@DrogoKhal4

2 months

RT @irenelizihui: 📢 Today, we release #MMLUProX, which upgrades MMLU-Pro to 29 languages across 14 disciplines—11,829 reasoning-heavy Qs pe….

0

18

0