Alex Tong @AlexanderTong7 X Profile

Alex Tong

@AlexanderTong7

Followers

3K

Following

350

Media

11

Statuses

189

Postdoc at Mila studying cell dynamics with Yoshua Bengio. I work on generative modeling and apply this to cells and proteins.

Montreal QC, Canada

Joined May 2017

Don't wanna be here? Send us removal request.

Alex Tong

@AlexanderTong7

21 days

RT @danyalrehman17: Wrapping up #ICML2025 on a high note — thrilled (and pleasantly surprised!) to win the Best Paper Award at @genbio_work….

0

11

0

Alex Tong

@AlexanderTong7

24 days

RT @martoskreto: we’re not kfc but come watch us cook with our feynman-kac correctors, 4:30 pm today (july 16) at @icmlconf poster session….

0

15

0

Alex Tong

@AlexanderTong7

24 days

Come check out SBG happening now! W-115 11-1:30 with.@charliebtan .@bose_joey .Chen Lin.@leonklein26 .@mmbronstein

0

18

91

Alex Tong

@AlexanderTong7

27 days

RT @jacobbamberger: 🚨 ICML 2025 Paper 🚨. "On Measuring Long-Range Interactions in Graph Neural Networks". We formalize the long-range probl….

0

16

0

Alex Tong

@AlexanderTong7

29 days

RT @bose_joey: 🚨 Our workshop on Frontiers of Probabilistic Inference: Learning meets Sampling got accepted to #NeurIPS2025!!. After the in….

0

3

0

Alex Tong

@AlexanderTong7

29 days

Thrilled to be co-organizing FPI at #NeurIPS2025! I'm particularly excited about our new 'Call for Open Problems'track. If you have a tough, cross-disciplinary challenge, we want you to share it and inspire new collaborations. A unique opportunity! Learn more below.

Kirill Neklyudov

@k_neklyudov

29 days

1/ Where do Probabilistic Models, Sampling, Deep Learning, and Natural Sciences meet? 🤔 The workshop we’re organizing at #NeurIPS2025!. 📢 FPI@NeurIPS 2025: Frontiers in Probabilistic Inference – Learning meets Sampling. Learn more and submit →

0

2

25

Alex Tong

@AlexanderTong7

1 month

RT @bose_joey: 🎉Personal update: I'm thrilled to announce that I'm joining Imperial College London @imperialcollege as an Assistant Profess….

0

34

0

Alex Tong

@AlexanderTong7

1 month

RT @fabian_theis: New OpenProblems paper out! 📝. Led by Malte Lücken with Smita Krishnaswamy, we present – a commun….

nature.com

Nature Biotechnology - Defining and benchmarking open problems in single-cell analysis

0

42

0

Alex Tong

@AlexanderTong7

2 months

Yes! We were heavily influenced by your work @JamesTThorn although I think still quite challenging to train energy based models.

James Thornton

@JamesTThorn

2 months

Nice to see energy based diffusion models + SMC/ Feynman Kac for temp annealing is useful for non-toy examples!.

0

7

Alex Tong

@AlexanderTong7

2 months

RT @bose_joey: 🚨 I heard people saying that Diffusion Samplers are actually not more efficient than MD? . Well, if that's you, check out ou….

0

5

0

Alex Tong

@AlexanderTong7

2 months

PITA fixes this by going back to a regular diffusion model training objective but with sequential Monte Carlo type correction.

0

2

Alex Tong

@AlexanderTong7

2 months

iDEM introduced a very effective but biased training scheme for diffusion-based samplers. This was great, but we were never able to scale it to real molecules.

arxiv.org

Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In...

0

2

8

Alex Tong

@AlexanderTong7

2 months

A bit of backstory on PITA: the project started with a key goal—to fix the inherent bias in prior diffusion samplers (like iDEM!). PITA leverages importance sampling to guarantee correctness. This commitment to unbiasedness is what gives PITA its power. See thread for details👇.

Kirill Neklyudov

@k_neklyudov

2 months

(1/n) Sampling from the Boltzmann density better than Molecular Dynamics (MD)? It is possible with PITA 🫓 Progressive Inference Time Annealing! A spotlight @genbio_workshop of @icmlconf 2025!. PITA learns from "hot," easy-to-explore molecular states 🔥 and then cleverly "cools"

2

4

34

Alex Tong

@AlexanderTong7

2 months

RT @brekelmaniac: Given q_t, r_t as diffusion model(s), an SDE w/drift β ∇ log q_t + α ∇ log r_t doesn’t sample the sequence of geometric a….

0

6

0

Alex Tong

@AlexanderTong7

2 months

RT @k_neklyudov: Why do we keep sampling from the same distribution the model was trained on?. We rethink this old paradigm by introducing….

0

26

0

Alex Tong

@AlexanderTong7

2 months

Check out FKCs! A principled flexible approach for diffusion sampling. I was surprised how well it scaled to high dimensions given its reliance on importance reweighting. Thanks to great collaborators @Mila_Quebec @VectorInst @imperialcollege and @GoogleDeepMind. Thread👇🧵.

Marta Skreta

@martoskreto

2 months

🧵(1/6) Delighted to share our @icmlconf 2025 spotlight paper: the Feynman-Kac Correctors (FKCs) in Diffusion. Picture this: it’s inference time and we want to generate new samples from our diffusion model. But we don’t want to just copy the training data – we may want to sample

2

5

34

Alex Tong

@AlexanderTong7

2 months

Using ESM2 through hugging face? Might as well speed it up with flash attention!.

Fred Zhangzhi Peng

@pengzhangzhi1

2 months

FlashAttention-accelerated Protein Language Models ESM2 now supports Huggingface. One line change, up to 70% faster and 60% less memory! 🧬⚡. Huggingface: Github:

0

2

16

Alex Tong

@AlexanderTong7

2 months

RT @k_neklyudov: The supervision signal in AI4Science is so crisp that we can solve very complicated problems almost without any data or RL….

0

22

0

Alex Tong

@AlexanderTong7

2 months

RT @bose_joey: Really excited about this new paper. As someone who spent a ton of time training regular flows with MLE and got burned FORT….

0

4

0

Alex Tong

@AlexanderTong7

2 months

I'm particularly excited about the new opportunities this opens up for new fast architectures that are trained with regression but also have fast and accurate likelihood computation as heavily used in e.g. our work on Boltzmann Generators

arxiv.org

Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann generators tackle this problem by pairing normalizing flows with...

0

2

15