Zuobai Zhang Profile
Zuobai Zhang

@Oxer22

Followers
696
Following
1K
Media
29
Statuses
123

Research intern @nvidia; Ph.D. student at @Mila_Quebec. Interested in deep generative model, drug discovery and protein science.

Montreal, Quebec
Joined September 2021
Don't wanna be here? Send us removal request.
@Oxer22
Zuobai Zhang
4 months
🚀 Excited to share Proteina, our protein structure generative model! Trained on 21M protein structures with scaled-up model size, with state-of-the-art designability and diversity. Huge thanks to the incredible NVIDIA GenAI team for an amazing internship experience! 🌟 #GenAI.
@karsten_kreis
Karsten Kreis
4 months
📢📢 "Proteina: Scaling Flow-based Protein Structure Generative Models". #ICLR2025 (Oral Presentation). 🔥 Project page: 📜 Paper: 🛠️ Code and weights: 🧵Details in thread. (1/n)
2
6
42
@Oxer22
Zuobai Zhang
2 months
Interesting analysis from Pascal! Combining different modalities for protein representation learning will be the future🚀🚀🚀.
@NotinPascal
Pascal Notin
2 months
Have we hit a "scaling wall" for protein language models? 🤔 Our latest ProteinGym v1.3 release suggests that for zero-shot fitness prediction, simply making pLMs bigger isn't better beyond 1-4B parameters. The winning strategy? Combining MSAs & structure in multimodal models!.
0
0
3
@Oxer22
Zuobai Zhang
2 months
RT @HannesStaerk: At #ICLR2025 in Singapore now. Looking for people to chat with :) . Also presenting ProtComposer:
0
7
0
@Oxer22
Zuobai Zhang
2 months
RT @DidiKieran: Excited to present my first paper officially as a PhD student now as an ICLR Oral this week! Super fun work with the GenAIR….
0
15
0
@Oxer22
Zuobai Zhang
2 months
RT @karsten_kreis: 🔥 I'm at ICLR'25 in Singapore this week - happy to chat!. 📜 With wonderful co-authors, I'm co-presenting 4 main conferen….
0
7
0
@Oxer22
Zuobai Zhang
2 months
RT @jiarlu: ICLR'25 | Structure Language Models for Protein Conformation Generation.Keywords: language modeling | discrete diffusion | PLM….
0
23
0
@Oxer22
Zuobai Zhang
3 months
RT @ArashVahdat: My GTC talk highlighting some of the Gen AI for science projects from my team at NVIDIA and the lessons we've learned alon….
0
18
0
@Oxer22
Zuobai Zhang
4 months
RT @karsten_kreis: 🔥Large-scale protein structure generation? Flow Matching? Fold class guidance? Big transformers? State-of-the-art benchm….
0
8
0
@Oxer22
Zuobai Zhang
4 months
RT @karsten_kreis: 🔥Excited to present *Proteina* next week (accepted as Oral to ICLR'25) in the ML Protein Engineering Seminar Series. Hea….
0
9
0
@Oxer22
Zuobai Zhang
4 months
We’re giving a tutorial on protein design at #AAAI2025! Come and join us if you’re interested.
@jiarlu
Jiarui Lu
4 months
🤔Excited about Protein + AI? Join our "AI for Protein Design" tutorial at #AAAI2025, with gorgeous @Oxer22 @divnori @WengongJin @tangjianpku and Jiwoong Park. 🧬✨. 📅 8:30–12:30 (EST), Feb 26 .📍Room 117, Philadelphia Convention Center, Philadelphia.👉
Tweet media one
1
1
19
@Oxer22
Zuobai Zhang
5 months
RT @XinyuYuan402: 🚀scCello (NeurIPS2024 Spotlight) is now open-source! .We’re excited to release the code and data for scCello, a cell-onto….
0
10
0
@Oxer22
Zuobai Zhang
7 months
RT @karsten_kreis: @ArashVahdat and I will chat about opportunities in NVIDIA's fundamental generative AI research team. 🔥 Protein and mol….
0
7
0
@Oxer22
Zuobai Zhang
7 months
I’ll present our work tomorrow afternoon at East Exhibition Hall #1211. Come and have a chat if you’re interested in protein design and representation learning! #NeurIPS2024.
@Oxer22
Zuobai Zhang
7 months
🤔 How does structural information help in protein fitness prediction? . 🚀 Excited to share our work S3F (Sequence-Structure-Surface Fitness) model at #NeurIPS2024, a multimodal model for zero-shot protein fitness prediction! 🧬 #AIforScience.Paper: 1/🧵
Tweet media one
0
1
13
@Oxer22
Zuobai Zhang
7 months
We hope this work can help justify the value of structure and surface information for augmenting pure sequence-based models!.
0
0
0
@Oxer22
Zuobai Zhang
7 months
🙏 A huge thank you to my incredible collaborators @NotinPascal, Yining, Aurelie, @vijiltw, @deboramarks, @payel791 and my advisor @tangjianpku for making this work possible!.
1
0
2
@Oxer22
Zuobai Zhang
7 months
Curious to learn more? Explore our code here:.👉 We can’t wait to discuss this at #NeurIPS2024 in Vancouver! 🌟 🇨🇦.6/🧵.
1
0
2
@Oxer22
Zuobai Zhang
7 months
🔬 Case Study: GB1. How do structure and surface help capture epistasis?.(a-c) Spearman’s correlation for all 361 mutations at site pairs:.▪️ ESM struggles (a).▪️ S2F & S3F perform better (b, c), especially for key residue pairs distant in sequence but close in 3D space. 5/🧵
Tweet media one
1
0
0
@Oxer22
Zuobai Zhang
7 months
🖼️ How do sequences, structures, surfaces, and MSA contribute to protein fitness prediction?. Our analysis compares these aspects across key factors:.(a-d) Function type, MSA depth, taxon, mutation depth.(e-f) Structure quality, pLDDT score.(g) Generalization to OOD assays.4/🧵
Tweet media one
1
0
0
@Oxer22
Zuobai Zhang
7 months
🎯🔬S3F achieves very strong results on the zero-shot ProteinGym benchmark (217 mutational assays) by integrating different aspects of proteins - sequence, structure, surface and MSA!.3/🧵
Tweet media one
1
0
2
@Oxer22
Zuobai Zhang
7 months
Protein function depends on sequence, structure, and surface topology. While recent methods use structures, they show only incremental gains over sequence-only models. Surface details? Overlooked. 🧩 S3F integrates:.🔹 Sequence embeddings.🔹 Structure.🔹 Surface topology.2/🧵
Tweet media one
1
0
1