Jim Shaw @jim_elevator X Profile

Jim Shaw

@jim_elevator

Followers

522

Following

385

Media

29

Statuses

216

Postdoc with Heng Li at Dana-Farber/Harvard Med School. Math PhD from @UofT with @YunWilliamYu. Working on methods for analyzing (metagenomic) sequencing data.

https://t.co/iEIu7Hd6L9

Joined April 2018

Don't wanna be here? Send us removal request.

Jim Shaw

@jim_elevator

3 months

See my bluesky thread on myloasm. I'm pretty excited by it's abililty to assist in understanding microbiome genomics at high resolution. https://t.co/zKFqM6cdUv Github:

github.com

A new high-resolution long-read metagenome assembler for even noisy reads - bluenote-1577/myloasm

0

5

Jim Shaw

@jim_elevator

3 months

Preprint out for myloasm, our new nanopore / HiFi metagenome assembler! Nanopore's getting accurate, but 1. Can this lead to better metagenome assemblies? 2. How, algorithmically, to leverage them? with co-author Max Marin and supervised by Heng Li @lh3lh3

bioRxiv Bioinfo

@biorxiv_bioinfo

3 months

High-resolution metagenome assembly for modern long reads with myloasm https://t.co/lGIxJcirLX #biorxiv_bioinfo

1

19

55

Jim Shaw

@jim_elevator

3 months

Stunning feat of applied sequencing bioinformatics. Congrats to the authors!!

Rayan Chikhi

@RayanChikhi

3 months

🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL

0

15

Rayan Chikhi

@RayanChikhi

3 months

🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL

5

151

380

Jim Shaw

@jim_elevator

4 months

skani v0.3.0 is released. https://t.co/dEkIzxIbDr * 30-40% potential reduction in memory * Breaking changes to indexing and searching databases Calculate ANI for contigs, genomes. Search vs > 140k genomes: pre-indexed GTDB-R226 available for download.

github.com

Fast, robust ANI and aligned fraction for (metagenomic) genomes and contigs. - bluenote-1577/skani

0

23

54

Yosuke Tanigawa

@yk_tani

6 months

I'm excited to share that I will soon join UCLA's Bioengineering department as an Assistant Professor. I am incredibly fortunate to have landed my dream job in this current atmosphere, and I would like to thank my mentors, colleagues, and friends for their support.

23

25

656

Heng Li

@lh3lh3

6 months

Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment. https://t.co/aLB9juf08k

2

64

241

Uthsav Chitra

@uthsavc

6 months

New life update! 🎆 🎓 This Fall, I will be joining the Department of Computer Science at Johns Hopkins University (@JHUCompSci) as an Assistant Professor, with an affiliation at the new Data Science and AI Institute (@HopkinsDSAI).

37

28

649

Heng Li

@lh3lh3

7 months

@csuhuangneng developed longcallR for joint SNP calling and phasing from long RNA-seq reads, AND for identifying allele-specific splicing/junctions (ASJ). Although ASJs of statistical significance are rare, a large fraction involve unannotated junctions. In Rust!

bioRxiv Bioinfo

@biorxiv_bioinfo

7 months

SNP calling, haplotype phasing and allele-specific analysis with long RNA-seq reads https://t.co/3EN0zYkvMq #biorxiv_bioinfo

0

2

15

Jim Shaw

@jim_elevator

7 months

I'm mostly active on bluesky nowadays, so see the bsky thread for more details: https://t.co/S6NUW8wpZy

0

2

Jim Shaw

@jim_elevator

7 months

Announcing myloasm, a new long-read (ONT R10/PacBio) metagenome assembler. With @lh3lh3. https://t.co/ingqEXblza

2

29

83

Uthsav Chitra

@uthsavc

11 months

GASTON, our method to learn “topographic maps” of gene expression, is out now @naturemethods! IMO the coolest part is a new model of *spatial gradients in sparse data*. As is typical for bio papers, it’s buried in Methods, but see below for a quick outline on the math 👇

Nature Methods

@naturemethods

11 months

Gene expression topography analysis by GASTON portrays domain organization and spatial gradients of gene expression and cell type composition using spatially resolved transcriptomics data. @uthsavc @benjraphael @PrincetonCS https://t.co/MbDWiI9F1s

6

15

51

Niranjan Nagarajan

@NiranjanTW

1 year

Tech Alert! 🚀🧬 We can now determine the sequence of DNA with non-canonical bases in a direct and high-throughput manner with Nanopore sequencing. Check out our preprint for details:

biorxiv.org

The discovery of synthetic xeno-nucleic acids (XNAs) that can basepair as unnatural bases (UBs) to expand the genetic alphabet has spawned interest in many applications, from synthetic biology to DNA...

1

25

92

Haoyu Cheng

@ChengChhy

1 year

Hifiasm 0.21.0 has been released. It now has a beta module for direct assembly of ONT R10 simplex reads. Initial tests with regular simplex reads show very promising results!

github.com

Since Hifiasm-0.20.0 (r639): New Feature: Introduced a beta module for ONT assembly using ONT simplex R10 reads. To enable this feature, add the --ont option as shown below: hifiasm -t64 --ont -o...

3

45

109

Chirag Jain

@chirgjain

1 year

Happy to see our method for T2T genome assembly published! It addresses an important limitation of string graph, that is, the contained reads. Led by @skamath5e "Telomere-to-telomere assembly by preserving contained reads" https://t.co/vhTF5JBeN4

Genome Research

@genomeresearch

1 year

SPECIAL ISSUE! This month @genomeresearch publishes a diverse collection of research and review articles in a special issue highlighting advances in long-read sequencing applications in biology and medicine. https://t.co/4ezPHyvmXH.

0

6

22

Chenhao Li

@li_chenhao

1 year

Just tried Sylph https://t.co/tXL3g9aOIg by @jim_elevator. Processed >2000 samples in a few hours on google cloud powered by @TerraBioApp. Roughly 15min per sample. Briefly checked the abundances and they match pretty well with what we got using an independent workflow.

github.com

ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash. - bluenote-1577/sylph

1

7

18

Ragnar {Groot Koerkamp} 🦋

@curious_coding

1 year

I gave a talk recently at Ben Langmead's group on my post on fast computation of random minimizers. Was super fun! Blogpost: https://t.co/9mcbyLT9cP Recording: https://t.co/znCQixg07t

2

10

46

Karel Břinda

@KarelBrinda

1 year

A new paper from our amazing @sladky_on (jointly supervised with @VeselyPavel_mff) on super space-efficient indexing of arbitrary k-mer sets, introducing the Masked Burrows-Wheeler Transform (MBWT).

bioRxiv Bioinfo

@biorxiv_bioinfo

1 year

FroM Superstring to Indexing: a space-efficient index for unconstrained k-mer sets using the Masked Burrows-Wheeler Transform (MBWT) https://t.co/YagAm5YBz3 #biorxiv_bioinfo

2

13

20

Jim Shaw

@jim_elevator

1 year

Lastly, thanks to users of sylph and those who have talked/messaged me about it. The referees were fair and helpful as well. New users, let me know how you find the software! https://t.co/WGX3W1yOjT 5/5

github.com

ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash. - bluenote-1577/sylph

0

5

Jim Shaw

@jim_elevator

1 year

Secondly, apparently Oxford Nanopore (@oxfordnanopore) found that sylph was the most accurate metagenome profiler in their whitepaper. Very neat, and I didn't even find out about this until recently. 4/5 https://t.co/7tNmiL0xrU

2

1

10