jiahuigao3 Profile Banner
Jiahui Gao Profile
Jiahui Gao

@jiahuigao3

Followers
283
Following
212
Media
1
Statuses
47

Hong Kong
Joined July 2018
Don't wanna be here? Send us removal request.
@jiahuigao3
Jiahui Gao
16 days
RT @gm8xx8: Follow-up to Dream 7B, now focused on code:. Dream-Coder 7B is a diffusion-based code LLM from HKU + Huawei Noah’s Ark, built o….
0
6
0
@jiahuigao3
Jiahui Gao
17 days
RT @JiachengYe15: 📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering s….
0
22
0
@jiahuigao3
Jiahui Gao
17 days
RT @ikekong: What happend after Dream 7B?. First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, traine….
0
33
0
@jiahuigao3
Jiahui Gao
17 days
To address variable‑length generation, DreamOn dynamically adjusts masked spans during infilling, expanding or contracting them to precisely match the target length.✅.
@WilliamZR7
Zirui Wu @ACL2025 🇦🇹
17 days
We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.
0
1
9
@jiahuigao3
Jiahui Gao
17 days
Dream-Coder, trained entirely on public data, achieves state-of-the-art coding performance among open diffusion code LLMs.
@_zhihuixie
Zhihui Xie
17 days
🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.
Tweet media one
0
0
11
@jiahuigao3
Jiahui Gao
2 months
Very cool! We also welcome everyone to check out our large diffusion language model called Dream-7B announced last month. We've open-sourced the checkpoint. Try our demo at:. For more details, please refer to our blog:
Tweet card summary image
huggingface.co
@GoogleDeepMind
Google DeepMind
2 months
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
12
@jiahuigao3
Jiahui Gao
4 months
Dream 7B: A general diffusion language model that happens to excel at planning. Without task-specific training, it outperforms Qwen2.5 7B and LLaMA3 8B on countdown and sudoku problems.
@JiachengYe15
Jiacheng Ye
4 months
🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.
Tweet media one
0
0
2
@jiahuigao3
Jiahui Gao
4 months
RT @JiachengYe15: 🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.….
0
206
0
@jiahuigao3
Jiahui Gao
4 months
RT @ai4mathworkshop: 📣🔊 Excited to announce the 2nd AI for Math Workshop at #ICML2025 @icmlconf! . 🔍 Workshop details: .
0
10
0
@jiahuigao3
Jiahui Gao
4 months
RT @hahahawu2: 💡Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging. We comprehensively study existing model merging methods….
0
11
0
@jiahuigao3
Jiahui Gao
4 months
RT @mircale2003: 🤔How can we obtain Long-CoT data for theorem proving?. 🚀DeepSeek-R1 utilizes large-scale collected Long-CoT data interleav….
0
1
0
@jiahuigao3
Jiahui Gao
5 months
RT @ikekong: Come to play chess with our diffusion reasoning model here: by @JiachengYe15 ! Check out our research….
Tweet card summary image
lichess.org
BOT diffusearchv0 played 1008 games since Nov 21, 2024. Current Bullet rating: 1680.
0
13
0
@jiahuigao3
Jiahui Gao
5 months
RT @JiachengYe15: 🤔 Always wondering if a next-token prediction model is the end of planning and reasoning. 🎯 Now excited to announce our….
0
5
0
@jiahuigao3
Jiahui Gao
5 months
RT @ZhijiangG: 🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! .Check out our late….
0
62
0
@jiahuigao3
Jiahui Gao
5 months
RT @chuanyang_jin: How to achieve human-level open-ended machine Theory of Mind?. Introducing #AutoToM: a fully automated and open-ended To….
0
22
0
@jiahuigao3
Jiahui Gao
5 months
RT @SterZhang: 🚀 Introducing VLM²-Bench!. A simple yet essential ability that we use in daily life. But when tackling vision-centric tasks….
0
47
0
@jiahuigao3
Jiahui Gao
6 months
RT @_zhihuixie: Introducing CTRL, a new framework that trains LLMs to critique via RL without human supervision or distillation, enabling t….
0
59
0
@jiahuigao3
Jiahui Gao
6 months
A very interesting direction! We also had an early exploration in this area, where we enabled VLMs to localize a target object by reasoning over user instructions and then utilized a tool to further localize the object in the image.
Tweet card summary image
github.com
Contribute to OptimalScale/DetGPT development by creating an account on GitHub.
@AndrewYNg
Andrew Ng
6 months
Introducing Agentic Object Detection!. Given a text prompt like “unripe strawberries” or “Kellogg’s branded cereal” and an image, we use an agentic workflow to reason at length and detect the specified objects. No need to label any training data. Watch the video for details.
0
0
4
@jiahuigao3
Jiahui Gao
7 months
RT @Renee42581826: 🚀Excited to co-organize the #ICLR2025 Workshop on Modularity for Collaborative, Decentralized, and Continual Deep Learni….
0
13
0
@jiahuigao3
Jiahui Gao
7 months
RT @li_chengzu: Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨.🔍 Imagine While Reasoning in Space with MVoT. Multimodal….
0
168
0