Junchen Jiang @JunchenJiang X Profile

Junchen Jiang

@JunchenJiang

Followers

397

Following

88

Media

4

Statuses

85

CS Prof @ UChicago https://t.co/U01oOWGnip (Fast distributed LLM inference) https://t.co/hoetjwXKIt (Best KV cache layer)

Chicago, IL

Joined September 2012

Don't wanna be here? Send us removal request.

Junchen Jiang

@JunchenJiang

1 day

Go LMCache 🚀.

EmbeddedLLM

@EmbeddedLLM

2 days

@lmcache in @vllm_project Singapore meetup!

1

10

Junchen Jiang

@JunchenJiang

13 days

RT @lmcache: 8 KV-Cache Systems You Can’t Afford to Miss in 2025. By 2025, KV-cache has evolved from a “nice-to-have” optimization into a c….

0

16

0

Junchen Jiang

@JunchenJiang

15 days

RT @zhzHNN: Interviewing 100 Bay Area Startups has always been my dream — and today, I’m starting the journey. 🚀. Big thanks to @lmcache f….

0

3

0

Junchen Jiang

@JunchenJiang

16 days

RT @TerryTangYuan: Excited to share that I'll be speaking at 𝗖𝗹𝗼𝘂𝗱 𝗡𝗮𝘁𝗶𝘃𝗲 𝗞𝟴𝘀 𝗔𝗜 𝗗𝗮𝘆, in addition to @KubeCon_! . Dan Sun and I will be del….

colocatedeventsna2025.sched.com

View more about this event at CNCF-hosted Co-located Events North America 2025

0

4

0

Junchen Jiang

@JunchenJiang

20 days

RT @lmcache: CacheGen( lets you store KV caches on disk or AWS S3 and load them way faster than recomputing! . Mode….

0

6

0

Junchen Jiang

@JunchenJiang

23 days

RT @bentomlai: 🤔 What is KV cache offloading and why does it matter for LLM inference?. #LLMs use the KV cache to accelerate inference spee….

0

2

0

Junchen Jiang

@JunchenJiang

24 days

RT @lmcache: LMCache supports gpt-oss (20B/120B) on Day 1!. TTFT 1.20s → 0.39s (-67.5%), finish time 15.70s → 7.73s (-50.7%) compared to Va….

0

9

0

Junchen Jiang

@JunchenJiang

26 days

RT @lmcache: 🚀 Big news from LMCache Lab!. 📝 3 papers accepted at SOSP ’25 & NSDI ’26, pushing the frontier of LLM-inference efficiency:….

0

6

0

Junchen Jiang

@JunchenJiang

1 month

RT @NadavTimor: KV cache go brrr with @JunchenJiang's @lmcache!.Join us tomorrow to learn more about next‑gen long‑context LLM inference: h….

0

2

0

Junchen Jiang

@JunchenJiang

1 month

RT @zhzHNN: @hidecloud Hi, we are organizing a meetup in Bay Area to discuss context engineering with @JunchenJiang and @lmcache . Are you….

0

1

0

Junchen Jiang

@JunchenJiang

1 month

RT @zhzHNN: 25 Must-Know Projects for AI/LLM Serving – From 2017 to Now.

0

3

0

Junchen Jiang

@JunchenJiang

1 month

RT @siddhantrayyy: With RAG and agents becoming ubiquitous in LLM systems, tuning quality and performance JOINTLY is essential to achieve t….

0

6

0

Junchen Jiang

@JunchenJiang

1 month

RT @astrogu_: Excited to share our latest work 𝗠𝗘𝗧𝗜𝗦 at #SOSP2025. This one’s special as it’s my first full CS project from start to finis….

0

3

0

Junchen Jiang

@JunchenJiang

2 months

RT @lmcache: The gang 🫡

0

4

0

Junchen Jiang

@JunchenJiang

2 months

RT @this_will_echo: 🤯 Believe it or not, even when an LLM generates just ONE SINGLE word, it can still be powerful!. Say in recommendation:….

0

7

0

Junchen Jiang

@JunchenJiang

2 months

RT @lmcache: 🚨 LMCache now turbocharges multimodal models in vLLM!. By caching image-token KV pairs, repeated images now get ~100% cache hi….

0

12

0

Junchen Jiang

@JunchenJiang

2 months

RT @lmcache: 𝗟𝗠𝗖𝗮𝗰𝗵𝗲 𝗿𝗲𝗮𝗰𝗵𝗲𝘀 𝟮,𝟬𝟬𝟬+ 𝘀𝘁𝗮𝗿𝘀 𝗼𝗻 𝗚𝗶𝘁𝗛𝘂𝗯! 🌟 . A huge thank you to our open-source community—your support is fueling next‑gen eff….

0

5

0

Junchen Jiang

@JunchenJiang

2 months

RT @astrogu_: 🥳🥳🥳.

0

1

0

Junchen Jiang

@JunchenJiang

2 months

RT @GitHubGPT: 📛 LMCache.🧠 LMCache, an LLM engine, boosts performance by minimizing TTFT and enhancing throughput via effective KV cache ma….

github.com

Supercharge Your LLM with the Fastest KV Cache Layer - LMCache/LMCache

0

3

0

Junchen Jiang

@JunchenJiang

2 months

RT @lmcache: Our very own @JunchenJiang gave a talk about large-scale efficient inference at Open Source Summit 2025 yesterday with Yue Zh….

0

2

0