
Jim Shaw
@jim_elevator
Followers
521
Following
385
Media
29
Statuses
216
Postdoc with Heng Li at Dana-Farber/Harvard Med School. Math PhD from @UofT with @YunWilliamYu. Working on methods for analyzing (metagenomic) sequencing data.
Joined April 2018
See my bluesky thread on myloasm. I'm pretty excited by it's abililty to assist in understanding microbiome genomics at high resolution. https://t.co/zKFqM6cdUv Github:
github.com
A new high-resolution long-read metagenome assembler for even noisy reads - bluenote-1577/myloasm
0
0
5
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler! Nanopore's getting accurate, but 1. Can this lead to better metagenome assemblies? 2. How, algorithmically, to leverage them? with co-author Max Marin and supervised by Heng Li @lh3lh3
High-resolution metagenome assembly for modern long reads with myloasm https://t.co/lGIxJcirLX
#biorxiv_bioinfo
1
19
55
Stunning feat of applied sequencing bioinformatics. Congrats to the authors!!
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
0
0
16
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
5
150
375
skani v0.3.0 is released. https://t.co/dEkIzxIbDr * 30-40% potential reduction in memory * Breaking changes to indexing and searching databases Calculate ANI for contigs, genomes. Search vs > 140k genomes: pre-indexed GTDB-R226 available for download.
github.com
Fast, robust ANI and aligned fraction for (metagenomic) genomes and contigs. - bluenote-1577/skani
0
23
54
I'm excited to share that I will soon join UCLA's Bioengineering department as an Assistant Professor. I am incredibly fortunate to have landed my dream job in this current atmosphere, and I would like to thank my mentors, colleagues, and friends for their support.
23
25
660
Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment. https://t.co/aLB9juf08k
2
64
240
New life update! 🎆 🎓 This Fall, I will be joining the Department of Computer Science at Johns Hopkins University (@JHUCompSci) as an Assistant Professor, with an affiliation at the new Data Science and AI Institute (@HopkinsDSAI).
37
28
651
@csuhuangneng developed longcallR for joint SNP calling and phasing from long RNA-seq reads, AND for identifying allele-specific splicing/junctions (ASJ). Although ASJs of statistical significance are rare, a large fraction involve unannotated junctions. In Rust!
SNP calling, haplotype phasing and allele-specific analysis with long RNA-seq reads https://t.co/3EN0zYkvMq
#biorxiv_bioinfo
0
2
15
I'm mostly active on bluesky nowadays, so see the bsky thread for more details: https://t.co/S6NUW8wpZy
0
0
2
Announcing myloasm, a new long-read (ONT R10/PacBio) metagenome assembler. With @lh3lh3. https://t.co/ingqEXblza
2
29
83
GASTON, our method to learn “topographic maps” of gene expression, is out now @naturemethods! IMO the coolest part is a new model of *spatial gradients in sparse data*. As is typical for bio papers, it’s buried in Methods, but see below for a quick outline on the math 👇
Gene expression topography analysis by GASTON portrays domain organization and spatial gradients of gene expression and cell type composition using spatially resolved transcriptomics data. @uthsavc @benjraphael @PrincetonCS
https://t.co/MbDWiI9F1s
6
15
51
Tech Alert! 🚀🧬 We can now determine the sequence of DNA with non-canonical bases in a direct and high-throughput manner with Nanopore sequencing. Check out our preprint for details:
biorxiv.org
The discovery of synthetic xeno-nucleic acids (XNAs) that can basepair as unnatural bases (UBs) to expand the genetic alphabet has spawned interest in many applications, from synthetic biology to DNA...
1
25
92
Hifiasm 0.21.0 has been released. It now has a beta module for direct assembly of ONT R10 simplex reads. Initial tests with regular simplex reads show very promising results!
github.com
Since Hifiasm-0.20.0 (r639): New Feature: Introduced a beta module for ONT assembly using ONT simplex R10 reads. To enable this feature, add the --ont option as shown below: hifiasm -t64 --ont -o...
3
45
109
Happy to see our method for T2T genome assembly published! It addresses an important limitation of string graph, that is, the contained reads. Led by @skamath5e "Telomere-to-telomere assembly by preserving contained reads" https://t.co/vhTF5JBeN4
SPECIAL ISSUE! This month @genomeresearch publishes a diverse collection of research and review articles in a special issue highlighting advances in long-read sequencing applications in biology and medicine. https://t.co/4ezPHyvmXH.
0
6
22
Just tried Sylph https://t.co/tXL3g9aOIg by @jim_elevator. Processed >2000 samples in a few hours on google cloud powered by @TerraBioApp. Roughly 15min per sample. Briefly checked the abundances and they match pretty well with what we got using an independent workflow.
github.com
ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash. - bluenote-1577/sylph
1
7
18
I gave a talk recently at Ben Langmead's group on my post on fast computation of random minimizers. Was super fun! Blogpost: https://t.co/9mcbyLT9cP Recording: https://t.co/znCQixg07t
2
10
46
A new paper from our amazing @sladky_on (jointly supervised with @VeselyPavel_mff) on super space-efficient indexing of arbitrary k-mer sets, introducing the Masked Burrows-Wheeler Transform (MBWT).
FroM Superstring to Indexing: a space-efficient index for unconstrained k-mer sets using the Masked Burrows-Wheeler Transform (MBWT) https://t.co/YagAm5YBz3
#biorxiv_bioinfo
2
13
20
Lastly, thanks to users of sylph and those who have talked/messaged me about it. The referees were fair and helpful as well. New users, let me know how you find the software! https://t.co/WGX3W1yOjT 5/5
github.com
ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash. - bluenote-1577/sylph
0
0
5
Secondly, apparently Oxford Nanopore (@oxfordnanopore) found that sylph was the most accurate metagenome profiler in their whitepaper. Very neat, and I didn't even find out about this until recently. 4/5 https://t.co/7tNmiL0xrU
2
1
10