Artem Shmatko
@artem_shmatko
Followers
145
Following
82
Media
12
Statuses
32
Computational histopathology & generative models at @MoritzGerstung group
Heidelberg, Germany
Joined November 2020
I want to thank our great collaborators from @DKFZ and @emblebi : @alex_w_jung, @kgaurav1208, @s_brunak, @udansk, @ewanbirney, @tomaswfitz and especially my supervisor @MoritzGerstung It was a pleasure working with you!
1
0
5
For more details on code and analysis, check out our paper: https://t.co/dZP1UuvvB7 And github repository: https://t.co/9ccmzChAon
nature.com
Nature - Delphi-2M forecasts a person’s future health, covering more than 1,000 diseases, provides insights into co-morbidity dynamics and generates synthetic data for the training of AI...
1
0
3
Conclusion. Repurposing transformers for modelling disease progression enables a range of new abilities and insights. We believe it will be very enabling for biomedical research in improving our understanding of disease risks. More work (training and testing) will be needed
1
0
2
There are also some important limitations, however. Delphi-2M learns a range of biases present in UK Biobank’s data. This includes immortality (probands had to be alive to be recruited), but also the fact that not all sources of data (GPs, hospitals, questionnaires) were
1
0
1
Lastly, Delphi-2M, trained on data from the UK Biobank, performs very well on data from Danish registries. This is not trivial, as the cohorts and healthcare systems are very different.
1
0
1
3. Explainability Without any prior knowledge about the disease relationship, Delphi-2M’s internal embeddings cluster related diagnoses of similar types. This includes female genital cancers, different forms of neurodegeneration, or also diabetes and known co-morbidities.
1
0
2
When sampling from birth, Delphi-2M produces purely synthetic health trajectories. It turns out that a model trained on synthetic data has many of Delphi-2 M's properties without ever having seen any real data. This may be very useful as health data are very sensitive, which
1
0
2
Conditional sampling from the age of 60 produces long-term predictions of future health events over a horizon of up to 20 years. There is still *a lot* of chance involved in each case, but the amount of correctly predicted diagnoses is around 30-40% higher than sampling using
1
1
1
2. A key new feature of Delphi-2M is its generative AI nature. When applied autoregressively, Delphi-2M can sample entire health trajectories. This can be from birth or conditioned using data up to a certain age.
1
0
1
Delphi-2M remains consistently accurate across different ICD-10 chapters, sexes and prediction time horizons, allowing for estimating the entire spectrum of disease risks up to decades into the future.
1
0
1
1. Multi-disease predictions. Traditional epidemiological risk models work with one single disease. Delphi-2M models 1256 different ICD-10 coded diseases and death at once. One model to rule them all!
1
0
1
Using transformers instantly gives 3 advantages over traditional disease risk models: 1. Multi-disease predictions 2. Generative 3. Explainability
1
0
1
Delphi-2M is trained on data from 400,000 participants of the UK Biobank and validated on 1.8 million participants from Danish health registries.
1
0
1
Delphi-2M extends the GPT-2 architecture to model health trajectories consisting of ICD-10 diagnoses or other information such as sex or smoking, each paired with the corresponding age. It includes an additional head so it can not only predict what is likely to happen but also
1
0
1
How can transformer models be used for this purpose? LLMs model text as a sequence of words [or tokens]. Delphi-2M models health trajectories as a sequence of diagnoses. The language here is that of ICD-10 diagnostic codes.
1
0
1
Over time, our medical history accumulates, and an important question is how much our past diagnoses influence the future. Often, diseases occur in clusters, so-called co-morbidities. Can this process be modelled?
1
0
1
How much does your medical history reveal about your future? As a team effort of @MoritzGerstung and @ewanbirney's labs, we present Delphi-2M, a transformer for modelling future health based on past medical history. https://t.co/dZP1UuvvB7
2
20
48
Artem Shmatko presenting our joint study with @MoritzGerstung group on AI-assisted prediction of methylation class from H&E sections at #eano2024
@EANOassociation
@uniklinik_hd
@DKFZ
0
5
17
Alpha-release of our weekend project. Stay tuned for better results and details on how this is generated!
0
0
10
That's what a “just for fun” project looks like when polished to perfection… Great job @LomakinAI!
The wonders of population genetics allow us to reveal the echoes of the tumultuous times our ancestors survived, forever imprinted in our genome. In this thread, I'll share my experience replicating one of my all-time favourite papers by @lh3lh3 and @richard_durbin
0
0
4