shenwei356 Profile Banner
Wei Shen 沈 伟 Profile
Wei Shen 沈 伟

@shenwei356

Followers
2K
Following
4K
Media
70
Statuses
2K

Associate professor of Bioinformatics at Chongqing Medical University, China. Lab: https://t.co/67yy7iHTqZ Personal: https://t.co/PgK3FxXeWb https://t.co/mvJTdAyoHz

Chongqing, China
Joined November 2013
Don't wanna be here? Send us removal request.
@shenwei356
Wei Shen 沈 伟
3 months
LexicMap paper is out!🎉 BTW, we've just released v0.8.0, with reduced indexing and searching memory usage, more features (e.g., limiting search by TaxId), and more utilities to improve the usability. https://t.co/bqbgtAoLFz
Tweet card summary image
github.com
v0.8.0 - 2025-09-10 No changes to the index format (see Index format changelog). New commands: lexicmap utils merge-search-results: Merge a query's search results from multiple indexes. lexic...
@NatureBiotech
Nature Biotechnology
3 months
Efficient sequence alignment against millions of prokaryotic genomes with LexicMap https://t.co/QBuQ9vZ1iy
3
31
102
@zhuqiyun
Qiyun Zhu
4 days
The scikit-bio paper in online in Nature Methods! Many thanks to our collaborators, community contributors and reviewers! We couldn’t have done it without you. https://t.co/bvbMdwMtUY #Bioinformatics #OpenSource
Tweet card summary image
nature.com
Nature Methods - Scikit-bio: a fundamental Python library for biological omic data analysis
4
53
290
@MicrobiomeVIF
Microbiome Virtual International Forum
28 days
It's Monday! ...and a new #MVIF program is out! 🤩 Free registration: https://t.co/h8GhACapmd Highlights: 🇺🇸 Vanessa Hale 🇰🇷 Jun Hyung Cha Keynote: 🇺🇸 Katherine Lemon Talks: 🇺🇸 Meenakshi Chakraborty 🇨🇳 Wei Shen @shenwei356 🇺🇸 Johanna Gutleben @jo_goodlife
0
4
4
@nomad421
𝕐
1 month
This paper highlights some recent security vulnerabilities in sequencers & directly related software (here, with a focus on ONT devices). The intersection of sequencing tech & security seems under-developed! https://t.co/7RHUAfrwWb ONT’s CVE guidelines: https://t.co/nMdGurXCHw
Tweet card summary image
nature.com
Nature Communications - Portable genome sequencers are revolutionizing genomic research. However, their reliance on external systems introduces new vulnerabilities that threaten the security of...
1
2
5
@strnr
Stephen Turner 🦋 @stephenturner.us
2 months
Efficient and accurate search in petabase-scale sequence repositories https://t.co/XDORBMJa8P 🧬🖥️🧪 MetaGraph: https://t.co/5VGSGCB30R Code: https://t.co/R6H4vXE4ti
2
23
77
@shenwei356
Wei Shen 沈 伟
2 months
I sincerely appreciate the opportunity to visit EMBL-EBI. The guidance and support I received from Zamin Iqbal, John Lees and other colleagues have been immensely valuable, leading to a positive transformation in my career path. 😀
@emblebi
EMBL-EBI
3 months
There are millions of openly available microbial genomes, but searching them can be slow. Until now 🥁 Introducing LexicMap, a new alignment tool that lets you search these data in minutes, helping track antibiotic resistance, trace outbreaks, and more. https://t.co/UnQCBDst65
0
0
8
@emblebi
EMBL-EBI
3 months
There are millions of openly available microbial genomes, but searching them can be slow. Until now 🥁 Introducing LexicMap, a new alignment tool that lets you search these data in minutes, helping track antibiotic resistance, trace outbreaks, and more. https://t.co/UnQCBDst65
0
10
20
@shenwei356
Wei Shen 沈 伟
3 months
Learn more:
0
0
2
@RayanChikhi
Rayan Chikhi
3 months
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
5
151
380
@AJamesMcCarthy
Andrew McCarthy
4 months
199
349
4K
@jim_elevator
Jim Shaw
4 months
skani v0.3.0 is released. https://t.co/dEkIzxIbDr * 30-40% potential reduction in memory * Breaking changes to indexing and searching databases Calculate ANI for contigs, genomes. Search vs > 140k genomes: pre-indexed GTDB-R226 available for download.
Tweet card summary image
github.com
Fast, robust ANI and aligned fraction for (metagenomic) genomes and contigs. - bluenote-1577/skani
0
23
54
@shenwei356
Wei Shen 沈 伟
5 months
Yes, I just recommended it to my students.
@smllmp
Samuel Lampa - [email protected]
5 months
https://t.co/We4bDz0cRb is shaping up to becoming one of the absolute top resources for learning hands on #bioinformatics and #genomics today!
0
2
11
@lemire
Daniel Lemire
5 months
7
15
81
@RayanChikhi
Rayan Chikhi
7 months
Slides from my talk (with Kamil Jaron) on an history of k-mers in bioinformatics:
1
31
85
@nomad421
𝕐
6 months
This seems like an awesome course! https://t.co/byOMSDIfCE! If there were more hours in the day, I'd want to put something like this together at UMD.
0
4
12
@shenwei356
Wei Shen 沈 伟
7 months
Also updated - taxid-changelog to May, 2025 https://t.co/9AabacigLF - gtdb-taxdump to GTDB r226 https://t.co/bKrIReUSOG - ictv-taxdump to VMR_MSL40
0
1
0
@shenwei356
Wei Shen 沈 伟
7 months
TaxonKit v0.20.0 is adapted to recent rank changes in NCBI Taxonomy.
Tweet card summary image
github.com
Changes TaxonKit v0.20.0 This version is mainly for maintaining compatibility with NCBI's recent changes(1, 2). Please remove the ranks.txt file in ~/.taxonkit/ or other directories containi...
@NCBI
NCBI
10 months
Updates coming to #NCBITaxonomy! We are introducing two new ranks, domain and realm, and discontinuing the rank superkingdom. Learn more: https://t.co/IqvlgNqjG5
1
14
34
@shenwei356
Wei Shen 沈 伟
8 months
⚡️LexicMap v0.7.0 fixed a minor bug in index building and improved the alignment accuracy! Please rebuild the existing index. Sorry for the inconvenience. 🥹 https://t.co/05zJ4N1UdA
Tweet card summary image
github.com
v0.7.0 - 2025-04-11 Please rebuild the index, as some seeds in the genome end regions were missed during computation. lexicmap index: Fix a little bug in seed desert filling -- forgot to fill the...
1
5
26
@KarelBrinda
Karel Břinda
8 months
A decade ago, we had thousands of bacterial genomes. Now, we have millions. How to scale computational methods? Our paper in @naturemethods answers this: use evolutionary history to guide compression and search. …From terabytes to tens of GBs… w/@Baym @ZaminIqbal et al. 🧵1/
3
53
169
@baym
Michael Baym
8 months
Thrilled that our work on this problem with @KarelBrinda, @ZaminIqbal, and others is out in @naturemethods today! We used phylogenetic compression (described in the thread) to compress every microbe ever sequenced onto a flash drive so that it can be searched with a laptop!
@baym
Michael Baym
3 years
So we asked: what sets the fundamental limit on computation on large genomic databases? Evolution! The irreducible entropy in genome collections is bounded by the most parsimonious path to introduce that variability. In other words, optimal compression should echo phylogeny. 4/
3
29
134