
Lood van Niekerk
@lood_ml
Followers
175
Following
2K
Media
5
Statuses
134
Ever wondered why masked diffusion outperforms other types of discrete diffusion? (e.g for EvoDiff) .Alan figured out and then fixed it!.
There are many domain-specific noise processes for discrete diffusion, but masking dominates! Why? We show masking exploits a key property of discrete diffusion, which we use to unlock the potential of those structured processes and beat masking! @gruver_nate @andrewgwils 1/7
0
0
4
RT @kulesatony: University budgets everywhere are getting slashed, and we hear many PhD students with accepted ICML papers can no longer af….
0
34
0
Definitely chat to Ruben and the team at their ICML poster session next month! And check out the attached thread for more info. (@PeterM_rchGroth, @yaringal, @deboramarks, @NotinPascal).(5/5). Paper:
0
0
1
Really looking forward to this line of research being explored further! Credits to @ruben_weitzman and others for getting a challenging codebase up-and-running (incl GPU embedding vector databases, joint training of reader-retriever) (4/5).
1
0
1
As sequence databases get bigger and more diverse, retrieval-based methods provide an interesting alternative to scaling successively bigger protein language models. (1/5).
🚨ICML Paper Alert🚨.What if finding the right protein homologs wasn't a slow search, but a learned part of the model itself?.We introduce 𝐏𝐫𝐨𝐭𝐫𝐢𝐞𝐯𝐞𝐫, an end-to-end framework that learns to retrieve the most useful homologs for self-supervised reconstruction! (1/12)
1
2
17
RT @KevinKaichuang: End-to-end differentiable homology search for protein fitness prediction. @ruben_weitzman @lood_ml @yaringal @debora….
0
14
0
RT @GabriCorso: Excited to unveil Boltz-2, our new model capable not only of predicting structures but also binding affinities! Boltz-2 is….
0
418
0
Excited to expand this further with bigger datasets!. They're presenting at the following times:.Thursday May 15.10:30-11:15AM.12:25-1:55PM.Poster session C .Poster hashtag #C044 . Feel free to reach out to datapoints@ginkgobioworks.com if you want to know more!. (4/4).
0
1
3
New antibody developability dataset by the Datapoints team at Ginkgo - I hope this becomes a standard benchmark in the future. Congrats to the team who onboarded these assays and are running them consistently at ~thousands of variants per week. Preprint link in 🧵.
Great new resource for anyone working on antibody CMC, developability, or discovery. New preprint out today from Ginkgo that benchmarks 246 therapeutic IgGs across 10 developability assays. The resulting dataset is ML-ready and publicly available. If you're thinking about early
0
0
5
Congrats to @AlyssaKMorrow and others on this latest work: a model for designing full-length mRNA sequences for a given protein sequence 🧬.
It's launch day for mDD-0, the latest model from Ginkgo AI! Read the white paper about generative AI for full-length mRNA sequences including custom therapeutic payloads. We're excited to see what you'll build!.
1
1
10
RT @etowah0: Can we learn protein biology from a language model?. In new work led by @liambai21 and me, we explore how sparse autoencoders….
0
101
0