
Joshua Gu
@astrogu_
Followers
34
Following
24
Media
2
Statuses
30
CS Phd student @MIT, @MIT_CSAIL, @MITEECS๐จโ๐ป| @LMCache Lab | Previous: BS @UChicago. Research on AI Systems
Chicago, IL
Joined December 2023
๐ฅ Check it out! ๐ฅ.
Want to create your own LLM Inference Endpoint on Any Cloud in seconds? . We're announcing the alpha release of LMIgnite, the one-click high-performance inference stack built for speed and scale. Powered by LMCache, vLLM, and vLLM Production Stack. ๐ค Join the alpha and
0
0
2
Excited to share our latest work ๐ ๐๐ง๐๐ฆ at #SOSP2025. This oneโs special as itโs my first full CS project from start to finishโfrom early brainstorming and iterating on ideas to running experiments and writing the paper. Learned a ton, and perseverance finally paid off! ๐.
With RAG and agents becoming ubiquitous in LLM systems, tuning quality and performance JOINTLY is essential to achieve the best LLM quality-of-experience. Our paper at SOSP this year, addresses this exact tradeoff!๐ฅ
0
3
7
๐ฅ Tencent x @lmcache.
๐ ๐ง๐ฒ๐ป๐ฐ๐ฒ๐ป๐ x ๐๐ ๐๐ฎ๐ฐ๐ต๐ฒ Collaboration: Integrating ๐ ๐ผ๐ผ๐ป๐ฐ๐ฎ๐ธ๐ฒ Store for Enhanced LLM Inference Caching! ๐ฅฎ๐ฅฎ. Excited to share insights from a powerful collaboration between Tencent engineers and the LMCache Lab team! ๐. With the help from Tencent Engineers,
0
0
0
RT @lmcache: ๐ Exciting news from #EuroSys2025: Our work on CacheBlend won Best Paper! ๐. CacheBlend delivers the first-ever speedup for RAโฆ.
0
8
0
FAST!!! ๐ Thrilled to see all the hard work pay off!.
Our open-source LLM cluster deployment solution is 10x faster than SOTA OSS solution. Check out the vLLM Production-Stack!๐คฉ๐คฉ๐คฉ. Since Jan 2025, vLLM Production Stack has been the reference open-source vLLM inference cluster solution with advanced KV cache offloading and K8s
0
0
1
๐ vLLM Production Stack is here!.
๐ We're thrilled to announce vLLM Production Stackโan open-source, Enterprise-Grade LLM inference solution that is now an official first-party ecosystem project under vLLM!. Why does this matter?.A handful of companies focus on LLM training, but millions of apps and businesses
0
0
2