Martin Fahrenberger @FahrenbergerM X Profile

Martin Fahrenberger

@FahrenbergerM

Followers

76

Following

138

Media

10

Statuses

89

Bioinformatician/Data-Scientist, joined PhD University of Vienna and Medical University of Vienna. Open for freelance projects.

https://t.co/kKhOTsORhq

Vienna, Austria

Joined May 2017

Don't wanna be here? Send us removal request.

Martin Fahrenberger

@FahrenbergerM

20 hours

This reduction in background noise leads to clearer separation between distinct tissue areas. P.S. If you like what you read, I’m currently looking for a new position as a Bioinformatician / Data Scientist in academia or industry — feel free to reach out!

0

Martin Fahrenberger

@FahrenbergerM

20 hours

As an example, we applied GTestimate to the 10x Genomics sagittal mouse-brain dataset and observed cleaner normalized expression patterns: the choroid-plexus marker Ttr retained its true signal while unspecific background expression dropped by up to 50 % compared to NormalizeData

1

0

Martin Fahrenberger

@FahrenbergerM

20 hours

The choice of normalization method also has a major impact on spot-wise normalization in Spatial Transcriptomics.

1

0

Martin Fahrenberger

@FahrenbergerM

20 hours

🧠GTestimate for Spatial Transcriptomics In this final spotlight from our GTestimate paper https://t.co/p7YoUL6MAH I want to focus on an alternative application of GTestimate: Spatial Transcriptomics.

1

0

Martin Fahrenberger

@FahrenbergerM

2 days

This quantified uncertainty could be leveraged for imputation, confidence weighting, or data-quality metrics in future work.

0

Martin Fahrenberger

@FahrenbergerM

2 days

We can estimate for each cell how much of its transcriptome remained unobserved — the missing mass. For a high quality dataset, we found an average missing mass of 30 %, reaching up to 70 % for some cells (Supplementary Fig. S11).

1

0

Martin Fahrenberger

@FahrenbergerM

2 days

🧩GTestimate: The “Missing Mass” I’ve been highlighting different aspects of our paper https://t.co/p7YoUL6MAH over the last few days. Today I want to point out a unique feature of the Good–Turing estimator: it tells you not only what you see but also how much you miss.

1

0

Martin Fahrenberger

@FahrenbergerM

3 days

Result: GTestimate resulted in the highest clustering accuracy overall, outperforming NormalizeData at 14 of 15 tested resolutions and SCTransform at 10 of 15. This demonstrates the strong downstream impact of our new normalization method. Check it out: https://t.co/p7YoUL7kqf

academic.oup.com

AbstractBackground. Single-cell RNA-seq suffers from unwanted technical variation between cells, caused by its complex experiments and shallow sequencing d

0

Martin Fahrenberger

@FahrenbergerM

3 days

We used the Fu et al. (2024) PBMC dataset (Liu dataset) https://t.co/uqm1Xw9SK2 This dataset contains 9266 experimentally annotated cells, we used to compare the unsupervised clustering performance of different Seurat pipelines using NormalizeData, SCTransform, and GTestimate.

academic.oup.com

Abstract. Cell-type annotation is a critical step in single-cell data analysis. With the development of numerous cell annotation methods, it is necessary t

1

0

Martin Fahrenberger

@FahrenbergerM

3 days

📊GTestimate: Clustering Performance During the review process for our paper, Reviewer #1 asked us to benchmark our method on datasets with known cell-type labels.

1

0

Martin Fahrenberger

@FahrenbergerM

4 days

If this approach sounds interesting, I’d be happy to discuss potential applications or collaborations.

0

Martin Fahrenberger

@FahrenbergerM

4 days

Beyond benchmarking, cta-seq could also be used to characterize rare cell types or specific cell populations, by first identifying these cells in a typical scRNA-seq run and then resequencing the sample while focusing your efforts on the cells of interest for extra depth.

1

0

Martin Fahrenberger

@FahrenbergerM

4 days

Using these data, we showed that the Good–Turing estimator reduced relative gene-expression estimation error by 17 % on average compared to the traditional Maximum Likelihood approach (see Figure 1 c-d).

1

0

Martin Fahrenberger

@FahrenbergerM

4 days

Together with Christopher Esk we developed cta-seq, a cell-targeted PCR amplification approach inspired by TAP-seq. This allowed us to sequence 18 selected cells twice at two vastly different depths, with the ultra-deep versions serving as ground-truth gene expression profiles.

1

0

Martin Fahrenberger

@FahrenbergerM

4 days

While benchmarking our normalization method GTestimate, we quickly realized that simulations weren’t enough — we needed ground-truth gene expression profiles of the same single cells sequenced twice.

1

0

Martin Fahrenberger

@FahrenbergerM

4 days

🧪 cta-seq: Ultra-deep scRNA-seq for a small set of cells Our new paper was published yesterday in GigaScience! https://t.co/p7YoUL7kqf Today I want to highlight our new cta-seq method for ultra-deep scRNA-seq of a small set of cells.

1

0

2

Martin Fahrenberger

@FahrenbergerM

5 days

P.S. If you like what you read, I’m currently looking for a new position as a Bioinformatician / Data Scientist in academia or industry — feel free to message me.

0

Martin Fahrenberger

@FahrenbergerM

5 days

GTestimate is an easy-to-use R package that can serve as a drop-in replacement for Seurat’s NormalizeData() and integrates smoothly into common workflows. Check back tomorrow or follow me for more details over the coming days.

1

0

Martin Fahrenberger

@FahrenbergerM

5 days

In our paper, we show that replacing ML with the Good–Turing estimator, which accounts for unobserved genes, markedly improves relative gene-expression and cell–cell distance estimation.

1

0

Martin Fahrenberger

@FahrenbergerM

5 days

scRNA-seq data are notoriously shallow — only ~5 000 molecules per cell are sequenced — yet many normalization methods still rely on the Maximum Likelihood (ML) estimator to calculate relative gene expression per cell, which does not perform well at such shallow sampling depths.

1

0