Vinod Grover @vinodg X Profile

Vinod Grover

@vinodg

Followers

3K

Following

9K

Media

79

Statuses

1K

Sr Distinguished Engineer @nvidia. Compilers, CUDA C++, PL, Machine Learning and Systems. tweets and opinions are personal.

Seattle, WA

Joined March 2008

Don't wanna be here? Send us removal request.

Vinod Grover

@vinodg

15 days

RT @__tensorcore__: marks the start of a short series of blogposts about CUTLASS 3.x and CuTe that we've been meani….

developer.nvidia.com

In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models have layers that cannot be expressed as…

0

50

0

Vinod Grover

@vinodg

1 month

RT @samim: It's time

0

3

0

Vinod Grover

@vinodg

1 month

RT @ShashiTharoor: Proud of Thiruvananthapuram girl Divi Bijesh, who won her second world title in under-10 chess this year! Long may she c….

onmanorama.com

Divi Bijesh bagged the U-10 girls title of FIDE World Cadets Cup at Batumi, Georgia.

0

239

0

Vinod Grover

@vinodg

1 month

RT @samim:

0

2

0

Vinod Grover

@vinodg

2 months

RT @awsTO: Here's a paper describing quantum computing using standard programming constructs, w/o linear algebra! . Goal: demystify quantu….

0

55

0

Vinod Grover

@vinodg

3 months

RT @tqchenml: #MLSys2025 make sure to attend 10:30am keynote @istoica05 An AI stack: from scaling AI workloads to evaluating LLMs. Checkou….

0

15

0

Vinod Grover

@vinodg

3 months

RT @zhyncs42: MLSys 2025 is coming up! Want to meet the developers behind FlashInfer, XGrammar, and SGLang @lmsysorg in person? Join us for….

lu.ma

Join top engineers and researchers to explore the latest breakthroughs in AI infrastructure! Hosted by LMSYS Org, SGLang, FlashInfer and XGrammer, this event…

0

9

0

Vinod Grover

@vinodg

4 months

RT @_xjdr: This is an SM90 to SM100 porting guide deep research and i made that is mostly accurate. sharing in case others might find it us….

0

12

0

Vinod Grover

@vinodg

4 months

RT @RajeevAlur: Congratulations to Swarat Chaudhuri (PhD, @PennCIS 2007) for this wonderful honor from Guggenheim Foundation .

gf.org

Since 1925, the Guggenheim Foundation has given Fellowships to exceptional artists, writers, scholars, and scientists, empowering them to pursue meaningful work under the freest possible conditions.

0

6

0

Vinod Grover

@vinodg

4 months

RT @tqchenml: Happy to share our latest work at @ASPLOSConf 2025! LLMs are dynamic, both in sequence and batches. Relax brings an ML compil….

arxiv.org

Dynamic shape computations have become critical in modern machine learning workloads, especially in emerging large language models. The success of these models has driven the demand for their...

0

36

0

Vinod Grover

@vinodg

5 months

#NewProfilePic

0

15

Vinod Grover

@vinodg

5 months

RT @ye_combinator: Check out the intra-kernel profiler in flashinfer to visualize the timeline of each SM/warpgroup in the lifecycle of a C….

0

31

0

Vinod Grover

@vinodg

6 months

Accepted for publication in #MLSys25 conference!.

Vinod Grover

@vinodg

6 months

Pipeline Parallelism in JAX!.

1

2

27

Vinod Grover

@vinodg

6 months

RT @0xA95: If you would like to share your work on array programming please consider submitting your paper to ARRAY '25 (co-located with @P….

0

4

0

Vinod Grover

@vinodg

6 months

Pipeline Parallelism in JAX!.

Vinod Grover

@vinodg

8 months

Scaling Deep Learning Training with MPMD Pipeline Parallelism. Joint work with @0xA95 @seanprime7 Hanfeng Chen .

0

1

32

Vinod Grover

@vinodg

6 months

RT @__tensorcore__: 🔥🚨 CUTLASS Blackwell is here 🚨🔥. 3.8 release is loaded with support for new features of Blackwell, even an attention ke….

0

32

0

Vinod Grover

@vinodg

7 months

Latest version of flashInfer paper with some cool ideas!.

Tianqi Chen

@tqchenml

7 months

Are you curious about how to build an efficient and customizable attention engine bebind the scene of major LLM serving frameworks? Checkout the latest arxiv paper on FlashInfer about all the cool ideas from @ye_combinator.

0

2

19

Vinod Grover

@vinodg

8 months

RT @luisceze: Amazing to see Flashinfer’s traction in the short 8mo since it was first introduced. Try out the latest release.

0

2

0

Vinod Grover

@vinodg

8 months

RT @vartuattheghat: Check out @vinodg and team's work on scaling JAX based DL Training.

0

1

0

Vinod Grover

@vinodg

8 months

Scaling Deep Learning Training with MPMD Pipeline Parallelism. Joint work with @0xA95 @seanprime7 Hanfeng Chen .

arxiv.org

We present JaxPP, a system for efficiently scaling the training of large deep learning models with flexible pipeline parallelism. We introduce a seamless programming model that allows implementing...

0

5

27