Jiahui Gao @jiahuigao3 X Profile

Jiahui Gao

@jiahuigao3

Followers

283

Following

212

Media

1

Statuses

47

Hong Kong

Joined July 2018

Don't wanna be here? Send us removal request.

Jiahui Gao

@jiahuigao3

16 days

RT @gm8xx8: Follow-up to Dream 7B, now focused on code:. Dream-Coder 7B is a diffusion-based code LLM from HKU + Huawei Noah’s Ark, built o….

0

6

0

Jiahui Gao

@jiahuigao3

17 days

RT @JiachengYe15: 📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering s….

0

22

0

Jiahui Gao

@jiahuigao3

17 days

RT @ikekong: What happend after Dream 7B?. First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, traine….

0

33

0

Jiahui Gao

@jiahuigao3

17 days

To address variable‑length generation, DreamOn dynamically adjusts masked spans during infilling, expanding or contracting them to precisely match the target length.✅.

Zirui Wu @ACL2025 🇦🇹

@WilliamZR7

17 days

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

0

1

9

Jiahui Gao

@jiahuigao3

17 days

Dream-Coder, trained entirely on public data, achieves state-of-the-art coding performance among open diffusion code LLMs.

Zhihui Xie

@_zhihuixie

17 days

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

0

11

Jiahui Gao

@jiahuigao3

2 months

Very cool! We also welcome everyone to check out our large diffusion language model called Dream-7B announced last month. We've open-sourced the checkpoint. Try our demo at:. For more details, please refer to our blog:

huggingface.co

Google DeepMind

@GoogleDeepMind

2 months

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

0

12

Jiahui Gao

@jiahuigao3

4 months

Dream 7B: A general diffusion language model that happens to excel at planning. Without task-specific training, it outperforms Qwen2.5 7B and LLaMA3 8B on countdown and sudoku problems.

Jiacheng Ye

@JiachengYe15

4 months

🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.

0

2

Jiahui Gao

@jiahuigao3

4 months

RT @JiachengYe15: 🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.….

0

206

0

Jiahui Gao

@jiahuigao3

4 months

RT @ai4mathworkshop: 📣🔊 Excited to announce the 2nd AI for Math Workshop at #ICML2025 @icmlconf! . 🔍 Workshop details: .

0

10

0

Jiahui Gao

@jiahuigao3

4 months

RT @hahahawu2: 💡Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging. We comprehensively study existing model merging methods….

0

11

0

Jiahui Gao

@jiahuigao3

4 months

RT @mircale2003: 🤔How can we obtain Long-CoT data for theorem proving?. 🚀DeepSeek-R1 utilizes large-scale collected Long-CoT data interleav….

0

1

0

Jiahui Gao

@jiahuigao3

5 months

RT @ikekong: Come to play chess with our diffusion reasoning model here: by @JiachengYe15 ! Check out our research….

lichess.org

BOT diffusearchv0 played 1008 games since Nov 21, 2024. Current Bullet rating: 1680.

0

13

0

Jiahui Gao

@jiahuigao3

5 months

RT @JiachengYe15: 🤔 Always wondering if a next-token prediction model is the end of planning and reasoning. 🎯 Now excited to announce our….

0

5

0

Jiahui Gao

@jiahuigao3

5 months

RT @ZhijiangG: 🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! .Check out our late….

0

62

0

Jiahui Gao

@jiahuigao3

5 months

RT @chuanyang_jin: How to achieve human-level open-ended machine Theory of Mind?. Introducing #AutoToM: a fully automated and open-ended To….

0

22

0

Jiahui Gao

@jiahuigao3

5 months

RT @SterZhang: 🚀 Introducing VLM²-Bench!. A simple yet essential ability that we use in daily life. But when tackling vision-centric tasks….

0

47

0

Jiahui Gao

@jiahuigao3

6 months

RT @_zhihuixie: Introducing CTRL, a new framework that trains LLMs to critique via RL without human supervision or distillation, enabling t….

0

59

0

Jiahui Gao

@jiahuigao3

6 months

A very interesting direction! We also had an early exploration in this area, where we enabled VLMs to localize a target object by reasoning over user instructions and then utilized a tool to further localize the object in the image.

github.com

Contribute to OptimalScale/DetGPT development by creating an account on GitHub.

Andrew Ng

@AndrewYNg

6 months

Introducing Agentic Object Detection!. Given a text prompt like “unripe strawberries” or “Kellogg’s branded cereal” and an image, we use an agentic workflow to reason at length and detect the specified objects. No need to label any training data. Watch the video for details.

0

4

Jiahui Gao

@jiahuigao3

7 months

RT @Renee42581826: 🚀Excited to co-organize the #ICLR2025 Workshop on Modularity for Collaborative, Decentralized, and Continual Deep Learni….

0

13

0

Jiahui Gao

@jiahuigao3

7 months

RT @li_chengzu: Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨.🔍 Imagine While Reasoning in Space with MVoT. Multimodal….

0

168

0