Will Connell
@wilstc
Followers
689
Following
2K
Media
28
Statuses
303
predicting phenotypes 🖥🧬🔮 @transcriptabio
Joined March 2016
🧬🔮 Single cell foundation models have been a recent hot topic in bio-ML! A few of the recent methods and some thoughts 🧬🔮 1) Geneformer 2) scGPT 3) scFoundation 4) Exceiver
4
59
253
The recent breakthroughs from @nablabio & @chaidiscovery emphasize a split in early biotech strategy. For the specific range of problems that antibodies address, making the binder, is becoming trivial. This forces a choice between 'fast but competitive' and 'AI intractable'. 🧵
4
11
92
This is a major reason I joined @transcriptabio We've proved our platform in rare disease – an area that uniquely allows you to: 1) realize the mission of helping people, immediately 2) receive the gold-standard of clinical feedback, immediately 📄:
researchsquare.com
Discovering new and viable therapies for genetic diseases is a time consuming and cost intensive process. This is even more challenging for rare disorders that affect a small fraction of the popula...
I will not stop tweeting this until every drug discovery company gets human evidence in 3 years or less
0
0
2
Most current drug discovery efforts is structure-based eg. create small molecules or antibodies that best binds X. However, a drug may not drive its efficacy from its strongest binder. Taking a step away from structure-paradigm, we reason that if a CRISPR knockout of a gene
10
52
209
“this discussion on the challenges of evaluating a Foundation model is more interesting than the challenge itself.” Agreed!
My second post on the Arc Virtual Cell Challenge. The challenge’s Discord forums are in turmoil. Some participants have discovered a trick to get to the top of the leaderboard. https://t.co/aRb83MyCCn
#arc_virtual_cell_challenge #foundation_models
1
0
6
We're excited to present LeaVS, a method to scale up learning for protein function models. It is based on the co-design of wet lab experiments and in silico training.
4
11
52
Arc is hiring a unique role to lead the Virtual Cell Challenge. In its first year the Challenge has already attracted participation from thousands of top bio AI researchers and support from sponsors like NVIDIA. We need someone to help us make this annual competition historically
5
13
97
Massive, clean Pertub-seq dataset. 8M cells, 2 cell types, deeply sequenced. 🧬🪩 👏
Virtual Cell community - this one's for you! X-Atlas/Orion is now live on Hugging Face. Train your own models with streamlined workflows built into the Hugging Face API. 🔗 HuggingFace: https://t.co/U2BGIBGx4w 📜 License: cc-by-nc-sa-4.0
0
0
2
Awesome 👏🏼👏🏼
Welcome to the age of generative genome design! In 1977, Sanger et al. sequenced the first genome—of phage ΦX174. Today, led by @samuelhking, we report the first AI-generated genomes. Using ΦX174 as a template, we made novel, high-fitness phages with genome language models. 🧵
0
0
0
👏👏 last year, I used some metrics from Logan to help me analyze the volume and growth rate of genomics data you can find "Scaling biology: genomics" in the reply
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
1
0
8
*using the ctrl pop as a ref biases perturbation effects to be poorly distinguishable* a new study by @mariabrbic's lab reaches the same conclusion as the @ShiftBioscience study I highlighted recently 👏 ...with an nice fwd comment on model "utility" 😎 https://t.co/jTqs8oM1Ti
What is the state of research on the emerging grand challenge of virtual cell modeling? 1/n https://t.co/lOm7d8nd0g
0
1
5
The biggest challenge for AI in biology isn't just models, it's the data used to train them. Standard biological data isn't built for AI. To unlock generative AI for drug discovery, we must rethink how we generate and capture data. 1/
2
27
230
I highlight a recent paper from @BoWang87 and colleagues at @ShiftBioscience that mark new progress towards the larger goal.
1
0
4
Second, the field has been rallying to describe metrics that provide high signal and prediction "utility". Good measurements, confidence metrics, and directly actionable outcomes are crucial for utility. 3/n
1
0
1
First of all, it's important to delineate between cis- and trans- gene regulatory modeling. 2/n
1
0
2
What is the state of research on the emerging grand challenge of virtual cell modeling? 1/n https://t.co/lOm7d8nd0g
2
11
103
I highlight a recent paper from @BoWang87 and colleagues at @ShiftBioscience that mark new progress towards the larger goal.
0
0
0
Second, the field has been rallying to describe metrics that provide high signal and prediction "utility". Good measurements, confidence metrics, and directly actionable outcomes are crucial for utility.
1
0
0
First of all, it's important to delineate between cis- and trans- gene regulatory modeling. 2/n
1
0
0
Further, some simple changes may greatly improve the many available methods, particularly when compared against these new well-calibrated baselines. There is much more progress to follow from this. Great job @BoWang87 and the @shiftbioscience team!
0
0
1