Eugene Katsevich Profile
Eugene Katsevich

@EugeneKatsevich

Followers
276
Following
66
Media
7
Statuses
35

Assistant professor of statistics @Wharton, PhD @Stanford. Seeking to unravel the mysteries of #genomics and human disease with #statistics.

Philadelphia
Joined August 2020
Don't wanna be here? Send us removal request.
@EugeneKatsevich
Eugene Katsevich
3 years
I recently added some resources for students to my webpage (. It includes a guide to navigating the statistics job market, which I hope will be useful to those applying this year!.
0
15
76
@EugeneKatsevich
Eugene Katsevich
8 months
RT @nevillesanjana: Delighted to share new work from our lab: .MultiPerturb-seq 🎛️❌📥📤. Over the last few years, we've been combining CRIS….
0
58
0
@EugeneKatsevich
Eugene Katsevich
2 years
RT @davidliwei: Our latest computational model to unlock the power of single-cell perturbations (e.g., Perturb-seq), by i. quantifying part….
0
9
0
@EugeneKatsevich
Eugene Katsevich
2 years
RT @macfound: Rina Foygel Barber (@UChicagoPSD) is a statistician and 2023 MacArthur Fellow investigating the theoretical foundations of fo….
0
37
0
@EugeneKatsevich
Eugene Katsevich
2 years
RT @naturemethods: Introducing guided sparse factor analysis (GSFA), a statistical framework to detect changes in gene expression resulting….
Tweet card summary image
nature.com
Nature Methods - Guided sparse factor analysis (GSFA) is a powerful statistical framework to detect changes in gene expression as a result of perturbations in single-cell CRISPR screening.
0
40
0
@EugeneKatsevich
Eugene Katsevich
2 years
Thank you @WhartonAnlytcs for supporting this work!.
@WhartonAIAI
Wharton AI & Analytics Initiative
2 years
.@Wharton professor @EugeneKatsevich's software package, SCEPTRE, is helping scientists discover new biological insights by linking genetics to disease risk. With enough data, SCEPTRE could help diagnose and treat diseases more effectively. Learn more:.
0
2
7
@EugeneKatsevich
Eugene Katsevich
2 years
RT @james_y_zou: We share our thoughts on how #datascience education can use + evolve w/ #LLMs We give examples/re….
0
30
0
@EugeneKatsevich
Eugene Katsevich
2 years
I really like Mendeley Desktop's automatic BibTeX syncing feature. Now that Mendeley Reference Manager has superseded Mendeley Desktop, I am wondering if/when @mendeley_com will add this feature to Reference Manager. Is anyone else still using Mendeley Desktop for this reason?.
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
RT @johnomix: 🚨Our work on multi-modal, single-cell pooled CRISPR screens to study GWAS variant-to-function, STING-seq, is now out in @Scie….
0
109
0
@EugeneKatsevich
Eugene Katsevich
2 years
Thank you very much to my co-authors Ziang Niu (@MaxwellAng1), Abhinav Chakraborty, and Oliver Dukes!.
0
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
In summary, a double robustness phenomenon provides hope for applying MX methods with the model for X learned in sample, and brings these methods in closer proximity to doubly robust and semiparametric inference.
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
We also looked at power, adjusting for Type-I error inflation using oracle calibration. We found GCM test to have somewhat higher power than dCRT, and lasso-based tests to have somewhat higher power than post-lasso-based tests.
Tweet media one
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
We did not try very small samples or very discrete data, but we suspect in these scenarios that dCRT may have better Type-I error control than the GCM test.
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
Comparing dCRT to GCM test in numerical simulations, we found fairly similar Type-I error. Notably, switching the lasso out for post-lasso dramatically improves Type-I error control for both methods. The Maxway CRT ( inspired us to try this.
Tweet media one
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
We also leveraged semiparametric theory to prove that the GCM test is asymptotically optimal against local partially linear (and generalized partially linear) alternatives.
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
In fact, we proved that the dCRT is asymptotically equivalent to another doubly robust conditional independence test: Shah and Peters' Generalized Covariance Measure (GCM) test (. The resampling distribution converges to N(0,1) in a conditional sense.
1
0
1
@EugeneKatsevich
Eugene Katsevich
2 years
We formalized this intuition by proving a double robustness property for the dCRT. If the product of the errors in the model for X and the model for Y|X decays quickly enough, then the dCRT will control Type-I error.
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
We found that the Type-I error can be heavily inflated, but it depends on how well the model for Y given X is fit. A better approximation to E[Y|X] gives better Type-I error control!
Tweet media one
1
0
0
@EugeneKatsevich
Eugene Katsevich
2 years
We begin to address this gap by considering what happens to the dCRT ( when X is learned in sample.
Tweet media one
1
0
1
@EugeneKatsevich
Eugene Katsevich
2 years
MX methods now deployed in genetics ( and genomics (, usually fitting the model for X in sample. However, existing theory assumes either that the model for X is perfectly known or trained on a large unlabeled sample.
genomebiology.biomedcentral.com
Single-cell CRISPR screens are a promising biotechnology for mapping regulatory elements to target genes at genome-wide scale. However, technical factors like sequencing depth impact not only...
1
0
1
@EugeneKatsevich
Eugene Katsevich
2 years
Do model-X methods (like knockoffs or CRT) still control Type-I error when the model for X is learned in sample?. See our new preprint (, read the thread below, and/or come to my ISSI talk ( next Wednesday to find out!
Tweet media one
3
3
20