DrogoKhal4 Profile Banner
Kai Zhang Profile
Kai Zhang

@DrogoKhal4

Followers
2K
Following
4K
Media
35
Statuses
494

PhD-ing @osunlp with @ysu_nlp

Columbus, OH
Joined February 2019
Don't wanna be here? Send us removal request.
@DrogoKhal4
Kai Zhang
3 months
šŸš€Big WebDreamer update!.We train šŸ’­Dreamer-7B, a small but strong world model for real-world web planning. šŸ’„Beats Qwen2-72B.āš–ļøMatches #GPT-4o.Trained on 3M synthetic examples — and yes, all data + models are open-sourced.
Tweet media one
@yugu_nlp
Yu Gu
8 months
ā“Wondering how to scale inference-time compute with advanced planning for language agents?. šŸ™‹ā€ā™‚ļøShort answer: Using your LLM as a world model.šŸ’”More detailed answer: Using GPT-4o to predict the outcome of actions on a website can deliver strong performance with improved safety and
Tweet media one
1
24
80
@DrogoKhal4
Kai Zhang
2 days
RT @hhsun1: 🚨 Postdoc Hiring:.I am looking for a postdoc to work on rigorously evaluating and advancing the capabilities and safety of comp….
0
27
0
@DrogoKhal4
Kai Zhang
11 days
RT @scaling01: this is scientific seppuku
Tweet media one
0
181
0
@DrogoKhal4
Kai Zhang
16 days
RT @Benjamin_eecs: We've always been excited about self-play unlocking continuously improving agents. Our insight: RL selects generalizable….
0
52
0
@DrogoKhal4
Kai Zhang
19 days
RT @tzmhuang: We're already using AI search systems every day for more and more complex tasks, but how good are they really? Challenge: eva….
0
3
0
@DrogoKhal4
Kai Zhang
19 days
RT @yuting_ning: 🧐Agentic search is revolutionizing how we gather information, but how reliable is it? Can it really deliver accurate answe….
0
4
0
@DrogoKhal4
Kai Zhang
20 days
RT @ysu_nlp: šŸ”ŽAgentic search like Deep Research is fundamentally changing web search, but it also brings an evaluation crisisāš ļø. Introducin….
0
47
0
@DrogoKhal4
Kai Zhang
1 month
RT @MuCai7: Impressed by V-JEPA 2's improvement on TemporalBench . Indeed, we need a better video encoder for the t….
0
3
0
@DrogoKhal4
Kai Zhang
1 month
RT @xiangyue96: Attending #CVPR2025 in #Nashville! We will have our multimodal LLM evaluation tutorial tmr afternoon! Feel free to ping me….
0
10
0
@DrogoKhal4
Kai Zhang
1 month
RT @YifeiLiPKU: šŸ“¢ Introducing AutoSDT, a fully automatic pipeline that collects data-driven scientific coding tasks at scale!.We use AutoSD….
0
25
0
@DrogoKhal4
Kai Zhang
1 month
RT @HuggingPapers: Are we heading down the right path towards omni-modality? šŸ¤”. This new paper explores the effects of extending modality i….
0
22
0
@DrogoKhal4
Kai Zhang
1 month
Thanks for sharing our work :).
@HuggingPapers
DailyPapers
1 month
Are we heading down the right path towards omni-modality? šŸ¤”. This new paper explores the effects of extending modality in language models.
Tweet media one
0
2
9
@DrogoKhal4
Kai Zhang
1 month
RT @YuanshengNi: šŸ“¢ Introducing VisCoder – fine-tuned language models for Python-based visualization code generation and feedback-driven sel….
0
16
0
@DrogoKhal4
Kai Zhang
1 month
Had a blast working with @DarthZhu_ !.We try to analyze and use the modality-specific models extended from the same #LLM backbones to create omni ones. e.g., Qwen2-VL, -Video, -Audio, on #Qwen2.Tho most results are negative, we have some interesting findings here :).
@DarthZhu_
Tinghui Zhu
1 month
😓 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ā“ Can it generalize to omni-modality?. We study the effects of extending modality and ask three questions:. #LLM #MLLM #OmniModality.
1
4
15
@DrogoKhal4
Kai Zhang
2 months
RT @davidbau: Dear MAGA friends,. I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cu….
0
72
0
@DrogoKhal4
Kai Zhang
2 months
RT @yizhongwyz: Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! . I will continue….
0
54
0
@DrogoKhal4
Kai Zhang
2 months
RT @hhsun1: Realistic adversarial testing of Computer-Use Agents (CUAs) to identify their vulnerabilities and make them safer and more secu….
0
24
0
@DrogoKhal4
Kai Zhang
2 months
RT @lateinteraction: Sigh, it's a bit of a mess. Let me just give you guys the full nuance in one stream of consciousness since I think we….
0
85
0
@DrogoKhal4
Kai Zhang
2 months
RT @LiaoZeyi: ā‰ļøCan you really trust Computer-Use Agents (CUAs) to control your computerā‰ļø. Not yet, @AnthropicAI Opus 4 shows an alarming….
0
32
0
@DrogoKhal4
Kai Zhang
2 months
RT @vardaanpahuja: šŸš€ Thrilled to unveil the most exciting project of my PhD:.Explorer — Scaling Exploration-driven Web Trajectory Synthesis….
0
24
0
@DrogoKhal4
Kai Zhang
2 months
RT @irenelizihui: šŸ“¢ Today, we release #MMLUProX, which upgrades MMLU-Pro to 29 languages across 14 disciplines—11,829 reasoning-heavy Qs pe….
0
18
0