
Tianqi Chen
@tqchenml
Followers
18K
Following
3K
Media
59
Statuses
1K
AssistProf @CarnegieMellon. Chief Technologist @OctoML. Creator of @XGBoostProject, @ApacheTVM. Member https://t.co/QYyfjQNWTX, @TheASF. Views are on my own
CMU
Joined May 2015
RT @JeffDean: Mark your calendars for #MLSys2026 in May, 2026 in Seattle. Submission deadline for papers is Oct 30 this year.
0
15
0
#MLSys2026 will be led by the general chair @luisceze and PC chairs @JiaZhihao and @achowdhery. The conference will be held in Bellevue on Seattle's east side. Consider submitting and bringing your latest works in AI and systemsāmore details at
š¢Exciting updates from #MLSys2025! All session recordings are now available and free to watch at Weāre also thrilled to announce that #MLSys2026 will be held in Seattle next Mayāsubmissions open next month with a deadline of Oct 30. We look forward to
0
12
57
RT @JiaZhihao: š¢Exciting updates from #MLSys2025! All session recordings are now available and free to watch at Weā¦.
0
30
0
RT @chrisdonahuey: Excited to announce šµMagenta RealTime, the first open weights music generation model capable of real-time audio generatiā¦.
0
80
0
RT @JiaZhihao: One of the best ways to reduce LLM latency is by fusing all computation and communication into a single GPU megakernel. Butā¦.
0
120
0
RT @BeidiChen: Say hello to Multiverse ā the Everything Everywhere All At Once of generative modeling. š„ Lossless, adaptive, and gloriouslā¦.
0
21
0
Check out our work on parallel reasoning š§ ; We bring an AI-assisted curator that identifies parallel paths in sequential traces, then tune models into native parallel thinkers that runs efficiently with prefix sharing and batching. Really excited about this general direction.
š„ We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation. š Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46%. š Website: š§µ 1/n
1
15
98
RT @InfiniAILab: š„ We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation. š Multiversā¦.
0
78
0
RT @NVIDIAAIDev: .@lmsysorg (SGLang) now achieves 7,583 tokens per second per GPU running @deepseek_ai R1 on the GB200 NVL72, a 2.7x leap oā¦.
0
36
0
Checkout the technical deep dive on FlashInfer.
š Our Deep Dive Blog Covering our Winning MLSys Paper on FlashInfer Is now live ā”ļø Accelerate LLM inference with FlashInferāNVIDIAās high-performance, JIT-compiled library built for ultra-efficient transformer inference on GPUs. Go under the hood with
0
4
28
RT @NVIDIAAIDev: š Our Deep Dive Blog Covering our Winning MLSys Paper on FlashInfer Is now live ā”ļø Accelerate LLMā¦.
0
27
0
RT @matei_zaharia: Excited to launch Agent Bricks, a new way to build auto-optimized agents on your tasks. Agent Bricks uniquely takes a *dā¦.
0
45
0
RT @yi_xin_dong: @databricks 's Agent Bricks is powered by XGrammar for structured generation, and achieves high quality and efficiency. Itā¦.
0
4
0