Shiyu Jiang
@shiyu_jiang23
Followers
76
Following
972
Media
0
Statuses
86
PhD student @qcb_usc Prev. @UVACPHG @JohnsHopkins @Westlake_Uni @FCBarcelona Fan
Los Angeles, CA
Joined November 2023
17
473
5K
Excited to be part of this work! We used ColabSaprot to finetune and run inference on an experimental sequence-to-expression dataset for another work — the results aligned well with experimental validation and the process was quite smooth.
ColabSaprot & SaprotHub are now in @NatureBiotech! 🧬 A user-friendly, no-code platform for training, sharing, and collaborating on protein language models. We also provide ColabSeprot, integrating ESM1b, ESM2, ProTrek, and ProtBert for the community. https://t.co/OIh5t7uvPE
0
2
5
Westlake University deeply mourns the passing of Professor Chen-Ning Yang — Nobel laureate in Physics, Honorary Chair of the Board of Trustees of Westlake University, and a member of Westlake’s Advisory Board. Professor Yang’s century-long life was a bridge between East and West,
1
21
79
Small Molecule Approach to RNA Targeting Binder Discovery (SMARTBind) Using Deep Learning Without Structural Input 1. SMARTBind is a novel framework that identifies RNA-binding small molecules using only RNA sequences, eliminating the need for structural data. This approach
0
9
31
Congratulations to the team!
Our trimodal pLM, ProTrek, is now in @NatureBiotech! Search protein by function, not just seq/struc: BLAST ➡️ 🧬 Seq Foldseek ➡️ 🏗️ Struc ProTrek ➡️ 💡 Func (Text) 🔗 Try: https://t.co/2NBaH0QOeY 🐙 GitHub: https://t.co/T3zPFix8ta 📊 5B embeddings: https://t.co/FYLVV814UV
0
0
2
New paper on Biorxiv is a massive and long term collaboration with Yanjun Li and Chenglong Li on using AL/ML to design small molecules targeting RNA. @UFHealth and @UFHealthCancer the approach is broadly applicable and an advance on current approaches. We are exited for apps.
Small Molecule Approach to RNA Targeting Binder Discovery (SMARTBind) Using Deep Learning Without Structural Input https://t.co/RmE6Qn10j0
#biorxiv_bioinfo
3
7
35
Small Molecule Approach to RNA Targeting Binder Discovery (SMARTBind) Using Deep Learning Without Structural Input https://t.co/RmE6Qn10j0
#biorxiv_bioinfo
biorxiv.org
Accurate identification of small molecule binders to RNA is critical for chemical probes and therapeutics. Computational approaches offer a cost-effective strategy to identify small molecules...
0
3
18
De novo Design of All-atom Biomolecular Interactions with RFdiffusion3 🚀 New preprint from David Baker!🚀 1. A groundbreaking study introduces RFdiffusion3 (RFD3), a novel diffusion model that generates protein structures in the context of ligands, nucleic acids, and other
1
79
295
Many of the most complex and useful functions in biology emerge at the scale of whole genomes. Today, we share our preprint “Generative design of novel bacteriophages with genome language models”, where we validate the first, functional AI-generated genomes 🧵
41
330
1K
MIT Course announcement: Machine Learning for Computational Biology #MLCB25 Fall'24 Lecture Videos: https://t.co/Puzq59oni4 Fall'24 Lecture Notes: https://t.co/RRNRY1SZaZ (a) Genomes: Statistical genomics, gene regulation, genome language models, chromatin structure, 3D genome
Today was my last lecture for @MIT #ComputationalBiology: #Genomes, #Networks, #Evolution, #Health. I recorded each and immediately posted online here: https://t.co/z7eouWlosl Please do share, and let me know which topics need more explanations, clarifications, and corrections!
22
464
2K
BioEmu now published in @ScienceMagazine !! What is BioEmu? Check out this video: https://t.co/PAj96iKvR7
Today in the journal Science: BioEmu from Microsoft Research AI for Science. This generative deep learning method emulates protein equilibrium ensembles – key for understanding protein function at scale. https://t.co/WwKjj5B0eb
15
107
406
The past few years of "AI for life sciences" has been all about the models: AF3, NNPs, PLMs, binder generation, docking, co-folding, ADMET, &c. But Chai-2, and lots of related work, shows us that the vibes are shifting. Models themselves are becoming just a building block; the
What is Chai-2? It is a "series of models." This includes a "multimodal generative architecture, integrating all-atom structure prediction and generative modeling" (to me this sounds like AF3 and their earlier Chai-1). The release is a bit vague; this graphic is the best info:
6
31
202
🚨 New paper 🚨 RNA modeling just got its own Gym! 🏋️ Introducing RNAGym, large-scale benchmarks for RNA fitness and structure prediction. 🧵 1/9
3
49
223
Predicting function of evolutionarily implausible DNA sequences 1.This work introduces NULLSETTES, the first benchmark to evaluate genomic language models (gLMs) on their ability to predict loss-of-function (LOF) mutations in synthetic DNA sequences that lack evolutionary
0
6
14
It’s finally happening!!! Diffusion is so much more satisfying than autoregressive for protein & DNA sequences that don’t really have directionality 🥹 Waiting for this to empirically land & replace BERT/one-step discrete diffusion for protein foundation models 👀
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
13
31
585
RLXF is on @biorxivpreprint! We introduce a PPO-based workflow to align the logits of any protein language model away from evolutionary plausibility and towards biochemical function. Not only does this seem to be more robust than other preference optimization methods... (1/3)
2
25
122
Interested in single cell genomics but need help getting started? Check out the full agenda for our Single Cell Genomics Day on Friday 4/25. All talks will be live-streamed (no registration required) at https://t.co/JqaO6urVPF
4
205
585
#RNA structure prediction remains unsolved -- nice tech feature incl #deeplearning and #CASP16 in @Nature
nature.com
Nature - AlphaFold’s highly accurate structural models transformed protein biology,but RNA lags behind.
1
22
82
Announcing Neo-1: the world’s most advanced atomistic foundation model, unifying structure prediction and all-atom de novo generation for the first time - to decode and design the structure of life 🧵(1/10)
39
374
2K
🧬 Meet Lyra, a new paradigm for accessible, powerful modeling of biological sequences. Lyra is a lightweight SSM achieving SOTA performance across DNA, RNA, and protein tasks—yet up to 120,000x smaller than foundation models (ESM, Evo). Bonus: you can train it on your Mac. read
18
149
736