Jiecong Lin
@JasonLinjc
Followers
232
Following
2K
Media
6
Statuses
272
Postdoc at @harvardmed @MGHPathology @DFBC_PedCare @HKUniversity, passionate about developing deep learning models to decipher gene regulation
Joined July 2017
Excited to share our work introducing EPInformer🧬, a scalable and lightweight deep learning framework to predict gene expression by integrating promoter-enhancer sequences with epigenomic signals and chromatin contacts. 📜 https://t.co/YSWOsueEJZ (1/11)
biorxiv.org
Transcriptional regulation, critical for cellular differentiation and adaptation to environmental changes, involves coordinated interactions among DNA sequences, regulatory proteins, and chromatin...
2
16
100
Exciting new joint study out in @Nature today from Mineto Ota (Marson and @jkpritch labs) - Causal modelling of gene effects from regulators to programs to traits
nature.com
Nature - Approaches combining genetic association and Perturb-seq data that link genetic variants to functional programs to traits are described.
3
29
137
The model of gene expression taught in school is highly misleading! Transcription factors are proteins that bind to DNA and then help repress, or activate, the expression of genes. Cells have hundreds of different types of transcription factors, each tuned to regulate different
18
244
1K
Excited to share our new paper on predicting gene expression in yeast! We introduce "Shorkie," a supervised ML model that builds off a self-supervised foundation to interpret regulatory DNA. Preprint:
1
5
37
Excited to share that Ctrl-DNA, our constrained RL + Genomic Language Model system for cell-type–specific regulatory DNA design, co-led with @xingyuchen67, was accepted as NeurIPS 2025 Spotlight (top 3.2%) 🧬✨ Paper: https://t.co/kZHZ4YFcdA Code: https://t.co/wO42Qv2chY
1
1
4
Excited for a major milestone in a collab effort led by @jengreitz to map enhancers & interpret variants in the human genome: The E2G Portal https://t.co/O9CRC1gRsf collates predictions of enhancer-gene regulatory interactions across >1,600 cell types & tissues. Use cases 👇1/
6
30
124
OmniPath ( https://t.co/g2JRIODn2k): integrated knowledgebase for multi-omics analysis https://t.co/SYaCvU76su 🧬🖥️🧪 Python module https://t.co/n7As1pmtmv R package https://t.co/bwzjAmuab8
#Rstats
1
66
306
Latest genomic AI models report near-perfect prediction of pathogenic variants (e.g. AUROC>0.97 for Evo2). We ran extensive independent evals and found these figures are true, but very misleading. A breakdown of our new preprint: 🧵
9
118
482
Ever wish you could hit "undo" on disease? 🩺🔄 https://t.co/VW9BSsvJd7 Most drug discovery asks: what does this perturbation do to cells? But we can also ask the reverse: which perturbations undo a disease signature and move cells back toward health? That's the idea behind
8
84
408
This is the single most important resource for genomics research. Earth’s DNA is now at our fingertips. What a heroic effort and mega congrats to @RayanChikhi and @RNA_Life!!!
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
0
5
48
In the genomics community, we have focused pretty heavily on achieving state-of-the-art predictive performance. While undoubtedly important, how we *use* these models after training is potentially even more important. tangermeme v1.0.0 is out now. Hope you find it useful!
3
22
97
In April '25, I shared the origin story of Evo on the TED stage. I talked about the motivation behind generating DNA with AI and how it could change what’s possible. It was an incredible experience. full video: https://t.co/tstT6cQI6u
1
8
55
🚀 OncoGAN is now published in @CellGenomics! We introduce an AI system that generates high-fidelity, privacy-preserving synthetic cancer genomes — now open-access. 🔍 Why OncoGAN? --No patient data leakage — critical for genomic privacy --Built-in ground truth — ideal for
4
26
141
Tonight’s reading material. A team at DeepMind wrote this piece on how you must think about GPUs. Essential for AI engineers and researchers.
17
343
2K
🚀 Introducing PantheonOS ( https://t.co/NZM3wcHXbG): A Fully Open-Source Agent OS for Science PantheonOS began as a research project in my Stanford lab and has since evolved into a vision to redefine data science in the era of AI—starting with computational biology, especially
2
38
119
I'm glad that I had a chance to contribute to this wide-ranging article discussing the myriad ways ML is being used in genomics:
nature.com
Nature - Scientists are seeking to decipher the role of non-coding DNA in the human genome, helped by a suite of artificial-intelligence tools.
0
9
49
Introducing STELLA — a Self-Evolving LLM agent that autonomously creates its own tools to navigate and accelerate biomedical research. 🤖 Why It Matters: ⛓️ The Limitation: Most AI agents are fundamentally limited by a fixed set of predefined tools. This is a major bottleneck
0
4
33
🤝Excited to partner with Tamarind @kavi_deniz to build towards an agentic AI protein designer. 🔁 Agentic protein optimization — Starting from a sequence, Biomni iteratively improves thermostability by orchestrating AlphaFold-2, ThermoMPNN, and reasoning over predictions and
4
16
135
Excited to be at #ISMBECCB2025! I'll be presenting at RegSys on "Learning the Regulatory Genome by Destruction and Creation" (Thursday 3:15PM in Room 11BC). I'm looking forward to connecting with both new and old friends throughout the conference! 🧬 #CRISPR #Genomics #Diffusion
1
3
27
Some aspects of AI discourse seem to come from a different planet, oblivious to basic realities on Earth. AI for science is one such area. In this new essay, @sayashk and I argue that visions of accelerating science through AI should be considered unserious if they don't confront
normaltech.ai
Confronting the production-progress paradox
17
71
249