Will Connell Profile
Will Connell

@wilstc

Followers
655
Following
1K
Media
27
Statuses
289

predicting phenotypes 🖥🧬🔮 @transcriptabio

Joined March 2016
Don't wanna be here? Send us removal request.
@wilstc
Will Connell
2 years
🧬🔮 Single cell foundation models have been a recent hot topic in bio-ML! A few of the recent methods and some thoughts 🧬🔮. 1) Geneformer.2) scGPT.3) scFoundation.4) Exceiver.
4
58
250
@wilstc
Will Connell
11 days
RT @lizbwood: The biggest challenge for AI in biology isn't just models, it's the data used to train them. Standard biological data isn't b….
0
25
0
@wilstc
Will Connell
25 days
I highlight a recent paper from @BoWang87 and colleagues at @ShiftBioscience that mark new progress towards the larger goal.
1
0
4
@wilstc
Will Connell
25 days
Second, the field has been rallying to describe metrics that provide high signal and prediction "utility". Good measurements, confidence metrics, and directly actionable outcomes are crucial for utility. 3/n.
1
0
1
@wilstc
Will Connell
25 days
First of all, it's important to delineate between cis- and trans- gene regulatory modeling. 2/n.
1
0
2
@wilstc
Will Connell
25 days
What is the state of research on the emerging grand challenge of virtual cell modeling? 1/n.
Tweet media one
2
10
101
@wilstc
Will Connell
25 days
I highlight a recent paper from @BoWang87 and colleagues at @ShiftBioscience that mark new progress towards the larger goal.
0
0
0
@wilstc
Will Connell
25 days
Second, the field has been rallying to describe metrics that provide high signal and prediction "utility". Good measurements, confidence metrics, and directly actionable outcomes are crucial for utility.
1
0
0
@wilstc
Will Connell
25 days
First of all, it's important to delineate between cis- and trans- gene regulatory modeling. 2/n.
1
0
0
@wilstc
Will Connell
1 month
Further, some simple changes may greatly improve the many available methods, particularly when compared against these new well-calibrated baselines. There is much more progress to follow from this. Great job @BoWang87 and the @shiftbioscience team!.
0
0
1
@wilstc
Will Connell
1 month
The field has been circling the paradox of poor method performance wrt to simple baselines. My (optimistic) takeaway is that the methods aren’t necessarily bad, but our measuring sticks have been. We’ve understood this – but the study does a great job rigorously documenting why. .
1
0
0
@wilstc
Will Connell
1 month
Unfortunately, many studies have favored a narrative over actionable conclusions — for example, notably omitting R², a classical regression metric that captures variance and scale relative to baseline. There is much more nuance explored in the study. .
1
0
0
@wilstc
Will Connell
1 month
To my relief, they highlight a blatant failure of Pearson correlation for evaluating simulation of trans-gene expression. PC ignores magnitude of deviation from baseline, yet it’s often reported in place of more informative metrics. .
1
0
0
@wilstc
Will Connell
1 month
1) new metrics that appropriately consider biased baselines. 2) a loss function that calibrates the model to learn the most important, sparse signals. .
1
0
0
@wilstc
Will Connell
1 month
👏👏👏. High-impact results here, a thorough study on the pitfalls of current “trans-gene expression” virtual cell modeling metrics and proposals for. .
@BoWang87
Bo Wang
1 month
Do deep generative models in single-cell omics really work for perturbation prediction?. Some benchmark studies say yes:.🔗 🔗Others say no:.🔗 🔗 BMC Genomics: To move beyond this
Tweet media one
1
0
4
@wilstc
Will Connell
1 month
It’s early days, but this is the kind of effort that helps structure the space. Looking forward to seeing how the field responds—and how the benchmark evolves, which is a laudable goal in itself. n/n.
0
0
1
@wilstc
Will Connell
1 month
You could also blend these approaches. These aren’t fancy models—but they may be strong baselines. For the comp they may help tease apart what’s truly generalized vs cleverly matched. 8/n.
1
0
0
@wilstc
Will Connell
1 month
2️⃣ Submit perturbation profiles from a cell line whose perturbed states match the X% of H1 perturbations that are revealed (if this exists). Align in output space, then map across. 7/n.
1
0
0
@wilstc
Will Connell
1 month
1️⃣ Directly submit perturbation profiles from a public dataset where the basal (unperturbed) expression is most similar to H1 (if this exists). Basal transcriptional proximity might make the transfer surprisingly effective. 6/n.
1
0
1
@wilstc
Will Connell
1 month
Based on past comps, I’d expect strong submissions to blend ML techniques with smart data reuse. A couple baseline “hacks” come to mind: 5/n.
1
0
1
@wilstc
Will Connell
1 month
It’s not zero-shot: participants get expression profiles for X% of perturbations in H1 (in Arc's State paper it was 30%). So it’s really a few-shot adaptation task, not a full generalization eval. 4/n
Tweet media one
1
0
0