
Jiahui Gao
@jiahuigao3
Followers
283
Following
212
Media
1
Statuses
47
RT @gm8xx8: Follow-up to Dream 7B, now focused on code:. Dream-Coder 7B is a diffusion-based code LLM from HKU + Huawei Noah’s Ark, built o….
0
6
0
RT @JiachengYe15: 📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering s….
0
22
0
RT @ikekong: What happend after Dream 7B?. First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, traine….
0
33
0
To address variable‑length generation, DreamOn dynamically adjusts masked spans during infilling, expanding or contracting them to precisely match the target length.✅.
We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.
0
1
9
Very cool! We also welcome everyone to check out our large diffusion language model called Dream-7B announced last month. We've open-sourced the checkpoint. Try our demo at:. For more details, please refer to our blog:
huggingface.co
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
12
Dream 7B: A general diffusion language model that happens to excel at planning. Without task-specific training, it outperforms Qwen2.5 7B and LLaMA3 8B on countdown and sudoku problems.
🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.
0
0
2
RT @JiachengYe15: 🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.….
0
206
0
RT @ai4mathworkshop: 📣🔊 Excited to announce the 2nd AI for Math Workshop at #ICML2025 @icmlconf! . 🔍 Workshop details: .
0
10
0
RT @hahahawu2: 💡Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging. We comprehensively study existing model merging methods….
0
11
0
RT @mircale2003: 🤔How can we obtain Long-CoT data for theorem proving?. 🚀DeepSeek-R1 utilizes large-scale collected Long-CoT data interleav….
0
1
0
RT @ikekong: Come to play chess with our diffusion reasoning model here: by @JiachengYe15 ! Check out our research….
lichess.org
BOT diffusearchv0 played 1008 games since Nov 21, 2024. Current Bullet rating: 1680.
0
13
0
RT @JiachengYe15: 🤔 Always wondering if a next-token prediction model is the end of planning and reasoning. 🎯 Now excited to announce our….
0
5
0
RT @ZhijiangG: 🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! .Check out our late….
0
62
0
RT @chuanyang_jin: How to achieve human-level open-ended machine Theory of Mind?. Introducing #AutoToM: a fully automated and open-ended To….
0
22
0
RT @SterZhang: 🚀 Introducing VLM²-Bench!. A simple yet essential ability that we use in daily life. But when tackling vision-centric tasks….
0
47
0
RT @_zhihuixie: Introducing CTRL, a new framework that trains LLMs to critique via RL without human supervision or distillation, enabling t….
0
59
0
A very interesting direction! We also had an early exploration in this area, where we enabled VLMs to localize a target object by reasoning over user instructions and then utilized a tool to further localize the object in the image.
github.com
Contribute to OptimalScale/DetGPT development by creating an account on GitHub.
Introducing Agentic Object Detection!. Given a text prompt like “unripe strawberries” or “Kellogg’s branded cereal” and an image, we use an agentic workflow to reason at length and detect the specified objects. No need to label any training data. Watch the video for details.
0
0
4
RT @Renee42581826: 🚀Excited to co-organize the #ICLR2025 Workshop on Modularity for Collaborative, Decentralized, and Continual Deep Learni….
0
13
0
RT @li_chengzu: Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨.🔍 Imagine While Reasoning in Space with MVoT. Multimodal….
0
168
0