NotinPascal Profile Banner
Pascal Notin Profile
Pascal Notin

@NotinPascal

Followers
1K
Following
2K
Media
43
Statuses
430

Research in AI for Protein Design @Harvard | Prev. CS PhD @UniofOxford, Maths & Physics @Polytechnique

Boston
Joined September 2020
Don't wanna be here? Send us removal request.
@NotinPascal
Pascal Notin
23 days
🧬 😴 TIRED: Scaling protein models to billions of parameters hoping they'll memorize all of evolution and generalize beyond.🔥 WIRED: Smart retrieval-augmented models that dynamically access what they need from sequence databases.
@ruben_weitzman
Ruben Weitzman
24 days
🚨ICML Paper Alert🚨.What if finding the right protein homologs wasn't a slow search, but a learned part of the model itself?.We introduce 𝐏𝐫𝐨𝐭𝐫𝐢𝐞𝐯𝐞𝐫, an end-to-end framework that learns to retrieve the most useful homologs for self-supervised reconstruction! (1/12)
Tweet media one
1
6
92
@NotinPascal
Pascal Notin
14 days
RT @schwabpa: Save the date! Machine Learning for Drug Discovery (MLDD) is happening soon on Monday 30 June, 2025. MLDD aims to bring toge….
0
29
0
@NotinPascal
Pascal Notin
17 days
RT @AvivSpinner: I wonder if the gym membership for ProteinGym covers RNAGym as well?? oh wait, it's all free and 100% open source!.
0
4
0
@NotinPascal
Pascal Notin
17 days
Links:.🔗 Paper: 💻 Code: 9/9.
0
1
5
@NotinPascal
Pascal Notin
17 days
1
0
6
@NotinPascal
Pascal Notin
17 days
The moderate performance across all tasks reveals exciting opportunities! Key directions: RNA-specific training data, integrating structure-function relationships, and improving non-canonical base pair prediction. RNAGym provides a standardized foundation for progress. 7/9.
1
0
1
@NotinPascal
Pascal Notin
17 days
🌀 Tertiary structure: 215 diverse 3D structures from recent PDB entries. NuFold leads monomers (0.393 TM-score), AlphaFold3 dominates complexes (0.381 TM-score). Non-Watson-Crick interactions remain a major challenge for all methods.6/9
Tweet media one
2
3
7
@NotinPascal
Pascal Notin
17 days
🔗 Secondary structure: 901k chemical mapping profiles using DMS & 2A3 reactivity. EternaFold achieves top performance (0.656 F1-score), closely followed by CONTRAfold & Vienna. Traditional thermodynamic methods are still competitive with newer deep learning approaches.5/9
Tweet media one
1
0
1
@NotinPascal
Pascal Notin
17 days
🔬 Fitness prediction: 70 assays across tRNA, ribozymes, aptamers & mRNAs (1M+ mutations). Evo 2 performs best overall, but performance varies dramatically by RNA type: RNA-FM excels at tRNA/aptamers while Evo 2 leads mRNA tasks. Lots of room for improvement across the board!.4/9
Tweet media one
1
0
2
@NotinPascal
Pascal Notin
17 days
RNAGym tackles three essential RNA prediction tasks: 🔬 Fitness prediction: How mutations affect RNA function 🔗 Secondary structure: Base-pairing patterns 🌀 Tertiary structure: 3D molecular architecture.All evaluated zero-shot to test true generalization!.3/9.
1
0
2
@NotinPascal
Pascal Notin
17 days
Why do we need this? RNA modeling faces major challenges: limited experimental data (<1% of PDB entries), inherently less stable structures than proteins, and evaluation has been scattered across different studies with varying approaches. 2/9.
1
0
4
@NotinPascal
Pascal Notin
17 days
🚨 New paper 🚨 RNA modeling just got its own Gym! 🏋️ Introducing RNAGym, large-scale benchmarks for RNA fitness and structure prediction. 🧵 1/9
Tweet media one
3
47
226
@NotinPascal
Pascal Notin
20 days
RT @AvivSpinner: What would our data landscapes look like if we could biochemically characterize 100s, 1000s, 10^n evolutionary sequences?….
0
23
0
@NotinPascal
Pascal Notin
23 days
RT @lood_ml: As sequence databases get bigger and more diverse, retrieval-based methods provide an interesting alternative to scaling succe….
0
2
0
@NotinPascal
Pascal Notin
23 days
RT @KevinKaichuang: End-to-end differentiable homology search for protein fitness prediction. @ruben_weitzman @lood_ml @yaringal @debora….
0
14
0
@NotinPascal
Pascal Notin
23 days
More details in @RubenWeitzman's excellent tweetorial above and in a new blog post: paper:
1
0
4
@NotinPascal
Pascal Notin
23 days
Introducing Protriever — SOTA fitness prediction for sequence-based models. Homolog retrieval orders of magnitude faster than prior approaches. These architectures will be essential building blocks for next-gen protein models!.
1
0
3
@NotinPascal
Pascal Notin
2 months
Future-proofed vaccine design with generative models that predict viral evolution 👇.
@nooryoussef03
Noor Youssef
2 months
🚨 New in @ImmunityCP !.EVE-Vax, an AI model that anticipates future viral evolution and designs antigens to proactively test vaccines + therapeutics—before variants even emerge. We envision this work will help make future-proofed vaccines and therapeutics. 👇 (1/7).
1
2
19
@NotinPascal
Pascal Notin
2 months
RT @nooryoussef03: 🚨 New in @ImmunityCP !.EVE-Vax, an AI model that anticipates future viral evolution and designs antigens to proactively….
0
22
0
@NotinPascal
Pascal Notin
2 months
Even simple methods leveraging these 2 modalities significantly outperform billion-parameter sequence-only models. So, what's next? Better retrieval, advanced multimodal approaches, & alignment. Read more: #BioTech #AI #pLMs.
1
5
24