Xavier Gonzalez @xavierjgonzalez X Profile

Xavier Gonzalez

@xavierjgonzalez

Followers

395

Following

1K

Media

5

Statuses

133

PhD candidate studying AI at @Stanford. Advised by @scott_linderman. Parallelizing "inherently sequential" processes like RNNs and MCMC.

https://t.co/nsmDNGi0Kx

Stanford, CA

Joined January 2021

Don't wanna be here? Send us removal request.

Xavier Gonzalez

@xavierjgonzalez

12 days

Parallelizing "inherently sequential" processes has become all the rage in the era of GPUs. But can we really parallelize anything? In work led by me and @Leokoz8, we show that the "predictability" of a dynamical system determines whether it can be parallelized efficiently.

Scott Linderman

@scott_linderman

12 days

📣Announcing 2 @NeurIPSConf papers! "Parallelizing MCMC Across the Sequence Length": uses Newton iterations to parallelize MCMC! 🤯 But can we parallelize any nonlinear state space model? "Predictability Enables Parallelizability": proves what systems we can parallelize. 🧵

1

2

16

Stefano Ermon

@StefanoErmon

4 days

When we began applying diffusion to language in my lab at Stanford, many doubted it could work. That research became Mercury diffusion LLM: 10X faster, more efficient, and now the foundation of @_inception_ai. Proud to raise $50M with support from top investors.

Inception

@_inception_ai

4 days

Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in

39

86

1K

Dan Biderman

@dan_biderman

10 days

Amazing work from a great team

Scott Linderman

@scott_linderman

12 days

📣Announcing 2 @NeurIPSConf papers! "Parallelizing MCMC Across the Sequence Length": uses Newton iterations to parallelize MCMC! 🤯 But can we parallelize any nonlinear state space model? "Predictability Enables Parallelizability": proves what systems we can parallelize. 🧵

0

1

17

Kelly Buchanan

@ekellbuch

11 days

Folks say RNNs can’t be parallelized, but they can! This cool new work shows which nonlinear systems can be parallelized, and that you can parallelize MCMC **within a chain** !

Scott Linderman

@scott_linderman

12 days

📣Announcing 2 @NeurIPSConf papers! "Parallelizing MCMC Across the Sequence Length": uses Newton iterations to parallelize MCMC! 🤯 But can we parallelize any nonlinear state space model? "Predictability Enables Parallelizability": proves what systems we can parallelize. 🧵

0

1

6

Jimmy Smith

@jimmysmith1919

12 days

Love this line of work from @scott_linderman @xavierjgonzalez @dmzoltowski and others on parallelizing nonlinear systems. The new papers explore parallelizing MCMC as well as characterizing the types of nonlinear systems that can or cannot be parallelized.

Scott Linderman

@scott_linderman

12 days

📣Announcing 2 @NeurIPSConf papers! "Parallelizing MCMC Across the Sequence Length": uses Newton iterations to parallelize MCMC! 🤯 But can we parallelize any nonlinear state space model? "Predictability Enables Parallelizability": proves what systems we can parallelize. 🧵

1

8

David

@dmzoltowski

12 days

Excited to share these two papers on parallelizing MCMC and theory for parallelizing nonlinear sequence models! This is a fun line of work and there is lots more to explore in these areas. Collabs with @SkylerWu9 @xavierjgonzalez @Leokoz8 Ken Clarkson & @scott_linderman

Scott Linderman

@scott_linderman

12 days

📣Announcing 2 @NeurIPSConf papers! "Parallelizing MCMC Across the Sequence Length": uses Newton iterations to parallelize MCMC! 🤯 But can we parallelize any nonlinear state space model? "Predictability Enables Parallelizability": proves what systems we can parallelize. 🧵

0

3

15

Karan Goel

@krandiash

13 days

We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -

1K

8K

Jessy Lin

@realJessyLin

20 days

As part of our recent work on memory layer architectures, I wrote up some of my thoughts on the continual learning problem broadly: Blog post: https://t.co/HNLqfNsQfN Some of the exposition goes beyond mem layers, so I thought it'd be useful to highlight separately:

26

171

1K

Sabri Eyuboglu

@EyubogluSabri

20 days

I think we're going to start seeing more continual learning techniques like this one that dynamically adapt which parameters are updated -- some really important ideas in here

Jessy Lin

@realJessyLin

20 days

🧠 How can we equip LLMs with memory that allows them to continually learn new things? In our new paper with @AIatMeta, we show how sparsely finetuning memory layers enables targeted updates for continual learning, w/ minimal interference with existing knowledge. While full

0

10

148

James Whittington

@jcrwhittington

18 days

Want the freedom of a fancy fellowship, but not the year-long wait or arduous application? Come join my lab! Work on neuroscience and AI, explore your creativity, be independent or work closely with me, collaborate widely, and have a lot of fun! https://t.co/zmROCA5Ib6

5

36

165

Hugo Larochelle

@hugo_larochelle

20 days

We at TMLR are proud to announce that selected papers will now be eligible for an opportunity to present at the joint NeurIPS/ICML/ICLR Journal-to-Conference (J2C) Track: https://t.co/CyjZtqbnBS

medium.com

Great news! We’re excited to announce that selected papers published in the Transactions on Machine Learning Research (TMLR) will now be…

14

78

460

Charlie O'Neill

@charles0neill

2 months

Today, we’re launching Parsed. We are incredibly lucky to live in a world where we stand on the shoulders of giants, first in science and now in AI. Our heroes have gotten us to this point, where we have brilliant general intelligence in our pocket. But this is a local minima. We

57

59

488

Brian L Trippe

@brianltrippe

27 days

🚨New paper! Generative models are often “miscalibrated”. We calibrate diffusion models, LLMs, and more to meet desired distributional properties. E.g. we finetune protein models to better match the diversity of natural proteins. https://t.co/2c06vD0x2D https://t.co/9Tbhf6ml8K

3

45

199

William Merrill

@lambdaviking

1 month

My thesis, 𝘈 𝘵𝘩𝘦𝘰𝘳𝘺 𝘰𝘧 𝘵𝘩𝘦 𝘤𝘰𝘮𝘱𝘶𝘵𝘢𝘵𝘪𝘰𝘯𝘢𝘭 𝘱𝘰𝘸𝘦𝘳 𝘢𝘯𝘥 𝘭𝘪𝘮𝘪𝘵𝘢𝘵𝘪𝘰𝘯𝘴 𝘰𝘧 𝘭𝘢𝘯𝘨𝘶𝘢𝘨𝘦 𝘮𝘰𝘥𝘦𝘭𝘪𝘯𝘨 𝘢𝘳𝘤𝘩𝘪𝘵𝘦𝘤𝘵𝘶𝘳𝘦𝘴, is now online:

8

46

385

Korbinian Poeppel

@KorbiPoeppel

1 month

Bit late now, but happy to announce that "parallelizable Linear Source Transition Mark networks" #pLSTM was accepted at the main track of NeurIPS 2025! If you cannot make it to San Diego, watch the talk at the ASAP Seminar: https://t.co/RmUCYhos5E

1

8

16

Xavier Gonzalez

@xavierjgonzalez

1 month

Really enjoyed presenting our @NeurIPSConf papers to the asap seminar! With @LeoKoz8. Here is the talk on youtube:

0

3

12

Xavier Gonzalez

@xavierjgonzalez

1 month

Super excited to be presenting my latest @NeurIPSConf paper "Predictability enables Parallelization of nonlinear SSMs" with @Leokoz8 at the ASAP seminar at 2 pm eastern tomorrow Tuesday 9/30. Tune in if you are interested! zoom link: https://t.co/sYyzzMOvoL asap seminar:

cmu.zoom.us

Zoom is the leader in modern enterprise cloud communications.

0

1

4

Xavier Gonzalez

@xavierjgonzalez

1 month

@X I think @X requires that videos (like .mp4's) be vertical? The error messages could be a lot more helpful about explaining what aspect ratio is required though.

0