I have big news!
I will be leaving
@roslininstitute
and
@EdinburghUni
at the end of September to take up a new position as Principal Data Scientist at
@DSM
!
Bioinformatics over the years:
1990s: doing a BLAST search
2000s: analysing 30 microarrays
2010s: nalysing 6Tb of NGS
2020s: creating a cloud the size of Netflix to reanalyse the whole of SRA for one figure
In today's
@Nature
Briefing: a protein that boosts ageing monkeys’ memory, slowed-down time in the early Universe and economic crashes boost globalization. 🧵
Hello this is your grants office. There is an unprecedented opportunity to bid for £30,000 of funding, but this has to be spent by next Tuesday. We're hoping to fund 1000 projects of £30 each. Attached is the 7 page form which must be submitted within 12 minutes of reading this
@nilshomer
Visualise *everything*
Look at sequence reads, sequence assemblies, data, alignments, SNPs, PCA all of your numerical data.
Get into the *details* of your data
PhD student: I hate writing
Post-doc: I hate writing, but I know I can do it
ECR: hey I'm pretty good at writing
Prof: the speed of bullshit from my hands, I am like superman
As research committee season seems to be upon us again:
1) computational research is real research
2) no it doesn't take less time or effort than lab work
3) no you don't always need a hypothesis or specific biological question
Amusing conversation over lunch.
Academia:
you ask for 24 muffins
the funder gives you 20 muffins, and congratulates you on your 24 new muffins
Your university takes 10 muffins
You can't eat any muffins until legal say so, at a cost of 2 muffins
1/2
Scientists for the last 50 years: there's going to be a global pandemic virus jump from animals
Public: ooooh did Dean Koontz predict COVID-19???!!!
Scientists: ... but ....
Public: my GOD how did Koontz know??!
Things people don't warn you about parenthood:
1) you will be permanently ill for at least 5 years
2) crying babies are not upset or sad, they are absolutely furious
3) you can take them to the most amazing places, but their favourite part of the day will be the ice cream
Scientists inbox:
50% fake conference invites
30% fake journal invites
15% admin
3% questions from post docs they could answer themselves using google
1% "can you retweet my job advert?"
0.9% unavoidable meeting invites
0.1% science I'm interested in
Idea: when you retire, don't become an emeritus professor to "finish all your projects".
Instead, coach an early career researcher in your field, hand over your expertise and ideas and go do the gardening.
Just gave my name in Starbucks as "BLAST is almost never the right tool" and when the barista called my name 100 angry biologists turned up and ripped my legs off
1. New technology
2. "Oh my god it works" papers
3. "We need special tools" papers
4. "These are the best tools" papers
5. "A Bayesian approach to..." papers
6. Fight on StackExchange
7. Anonymous post on pubpeer
8. "Oh my god it's crap!" papers
9. "Microbiome"
Repeat
New to bioinformatics? The main thing to remember is nothing has been written before, so remember to write everything from scratch in your favourite language 👍
I'll get pushback for this, but if you don't recognise that Excel is an incredible piece of software that has some incredible features that we all could use, then you're wrong and perhaps not as objective about things as you think you are.
A square wheel can roll smoothly only if the ground consists of evenly shaped inverted catenaries of the right size and curvature [The square-wheeled tricycle at the National Museum of Mathematics in New York City. Full video: ]
#newPI
announcements
Everywhere: excited to start my new lab, I'm hiring multiple post-docs and students, hit me up!
UK: excited to start my new lab! I've got £500, where do people buy cheap paperclips?
I see Excel is being hated on again. To be honest, its quick visual sorting, filtering and graphing functions are exceptional. Fantastic for initial exploration of the data. I use it daily.
Once I see the data, of course
#rstats
for serious graphs :)
Not sure why people find parenting and academia hard. Here's a tip: simply split your time:
50% work
40% childcare
20% housework
20% cooking and meal prep
20% leisure time
10% family engagement
And don't forget to get 8 hours sleep!
# of papers using single cell techniques: 10,000
# of papers describing novel bioinformatics technique on single cell data: 1,000,000,000,000
(approximate numbers)
I first joined academia when I was 28. I had no PhD, I had never written a paper nor a grant, and now it was a fundamental part of my job.
I was terrified. Utterly frozen by the enormity of having to do something I had no idea how to do. 1/n
16S RIGHT TOTALLY TRANSFORMATIVE TECHNOLOGY BUT THE UNIVERSAL PRIMERS AREN'T UNIVERSAL AND YOUR CHOICE OF V REGION REALLY MATTERS AND ALL THE DATABASES ARE WRONG AND THERE'S LOADS OF ERRORS IN THE DATA WHICH INFLATES YOUR OTU COUNT AND THERE'S A MASSIVE DEBATE ABOUT ASVs...
Your DNA is contaminated
Your genome is contaminated
Your microbiome is contaminated
Your reagents are contaminated
The databases are contaminated
PRINT THIS OUT and give it to your students and post-docs
Assembled a protozoan genome from cattle rumen.
Largest contig top BLAST hit is Sphaeramia orbicularis - common name Orbiculate cardinalfish.
I guess fish have protozoa in their microbiomes too :)
Unpopular opinion - this is why genome assembly should be left to experts
As more and more people sequence genomes, we will eventually sequence the common ancestor of all life on earth and it will be PhiX and freaking illumina adapters
Mindfulness for bioinformatics
- only use one core
- don't run jobs simultaneously
- let your job run "in the moment"
- clear your mind whilst the server does the work
Bioinformatics is having to re-run your whole pipeline because some (essential) tool at the end has an undocumented requirement not to have dots in the filename
SEE EVERYONE GOES ON ABOUT LONG READS MAKING BETTER MAGS BUT REALLY IT'S ALL ABOUT DNA EXTRACTION AND YOU CAN'T GET LONG DNA FROM BEAD BEATING AND IF YOU DON'T BEAD BEAT HARD ENOUGH YOU BIAS THE SAMPLE AND IF YOU SIZE SELECT YOU BIAS THE SAMPLE AND U WON'T HAVE ENOUGH DNA
How would I know if my own research area was this wrong?
Our usual safeguards won’t save us: peer review, meta-analysis, 100s of conceptual replications, listening to eminent researchers. All failed.
This should be keeping us up at night.
OK,
#protip
you might hate.
Microsoft Word can open PDFs.
Yes it can. File -> Open -> Choose PDF -> confirm conversion
PDFs can then be edited and saved as PDF or doc(x).
It can open PDF tables.
PDF tables can be copied and pasted into Excel.
Excel can save csv, tsv etc
Twitter laid me off today. If you know of any open positions for project managers or senior software engineers, let me know. I was in charge of the feature that made sure every bioinformatics debate descended into an argument about Python vs R.
It's finally here! Our new paper in Nature Methods:
A comparison of single-coverage and multi-coverage metagenomic binning reveals extensive hidden contamination | Nature Methods
Can confirm that you can work your way all the way up to professor of bioinformatics, head of department, a 20 year career, and your collaborators will still ask you to upload the reads to ENA
2020 science predictions:
- Illumina will give up their pursuit of PacBio. PacBio will eventually fold and Illumina will buy the technology anyway (probably 2021).
- Nanopore will overtake Illumina in terms of cost-per-gb and PacBio in terms of quality
So, big news. We've been doing a massive comparison of short read aligners in terms of accuracy and speed. The results are surprising! We now recommend you change all your pipelines to use NCBI BLAST 👍
1. SNP MP fondled two 17yr old girls behind his wife's back
2. SNP MSP de-frauded a pro-indy group
3. SNP MSP immorally flipped houses for a profit
4. Former SNP first minister accused of bullying
5. Current SNP first minister accused of cover up
But yeah sure, you're "better"
Universities: we want spin outs!
Scientists: sure we've got ideas....
Universities: great! You'll have to work 60-80 hour weeks, generate all your own investment, never see your family, and we will take a massive cut of all profits
Scientists: ummmm.... no
So, just to be clear, the University mandatory travel agent:
1) gets the same flight prices we do
2) uses the same websites we do
3) won't choose seats or meals
4) costs us money
Have I got that right?
I have big news!
I will be leaving
@roslininstitute
and
@EdinburghUni
at the end of September to take up a new position as Principal Data Scientist at
@DSM
!
Bioinformatician's log, stardate 2157.
We have colonized Mars, after Trump III's nuclear war with the EU-Korean alliance vaporized the Earth's atmosphere.
We have with us the nanopore device for detecting Martian microbes.... but... we still can't f*cking install QIIME
I took a needed break this PM to work on a swing bed for our backyard. Many mistakes to fix and much more to do,but I am looking forward to a nap this weekend.
Amazing. We have discovered a method to create complete microbial assemblies using ILLUMINA metagenomics data, no need for long reads, perfect circular genomes every time. We call it Super Assembly MAGS, or SUPERASSMAGS for short. Preprint out soon!