@maurice_weiler
Maurice Weiler
3 months
Another DNA language model leveraging the reverse complement symmetry of the double helix 🧬 The two strands carry exactly the same information, and are related by 1) reversing the sequence and 2) swapping base pairs A⟷T and C⟷G. Hard-coding this prior into sequence models…
@SchiffYair
Yair Schiff
3 months
We are excited to present Caduceus: bi-directional DNA language model built on Mamba, with long range modeling that respects inherent symmetry of double helix DNA structure. Caduceus is SoTA on several benchmarks, including identifying causal SNPs for gene expression. 🧵1/9
Tweet media one
4
54
244
1
10
57

Replies

@maurice_weiler
Maurice Weiler
3 months
For a more representation theoretic formulation, check out this RC-equivariant steerable CNN by Vincent Mallet and @jeanphi_vert :
1
1
13