shenwei356 Profile Banner
Wei Shen 沈 伟 Profile
Wei Shen 沈 伟

@shenwei356

Followers
2K
Following
4K
Media
70
Statuses
2K

Associate professor of Bioinformatics at Chongqing Medical University, China. Lab: https://t.co/67yy7iIrgx Personal: https://t.co/E5GOnSMIRW https://t.co/mXnnqslpi1

Chongqing, China
Joined November 2013
Don't wanna be here? Send us removal request.
@shenwei356
Wei Shen 沈 伟
1 month
LexicMap paper is out!🎉 BTW, we've just released v0.8.0, with reduced indexing and searching memory usage, more features (e.g., limiting search by TaxId), and more utilities to improve the usability. https://t.co/bqbgtAoLFz
Tweet card summary image
github.com
v0.8.0 - 2025-09-10 No changes to the index format (see Index format changelog). New commands: lexicmap utils merge-search-results: Merge a query's search results from multiple indexes. lexic...
@NatureBiotech
Nature Biotechnology
1 month
Efficient sequence alignment against millions of prokaryotic genomes with LexicMap https://t.co/QBuQ9vZ1iy
3
30
99
@strnr
Stephen Turner 🦋 @stephenturner.us
7 days
Efficient and accurate search in petabase-scale sequence repositories https://t.co/XDORBMJa8P 🧬🖥️🧪 MetaGraph: https://t.co/5VGSGCB30R Code: https://t.co/R6H4vXE4ti
2
23
76
@shenwei356
Wei Shen 沈 伟
15 days
I sincerely appreciate the opportunity to visit EMBL-EBI. The guidance and support I received from Zamin Iqbal, John Lees and other colleagues have been immensely valuable, leading to a positive transformation in my career path. 😀
@emblebi
EMBL-EBI
17 days
There are millions of openly available microbial genomes, but searching them can be slow. Until now 🥁 Introducing LexicMap, a new alignment tool that lets you search these data in minutes, helping track antibiotic resistance, trace outbreaks, and more. https://t.co/UnQCBDst65
0
0
8
@emblebi
EMBL-EBI
17 days
There are millions of openly available microbial genomes, but searching them can be slow. Until now 🥁 Introducing LexicMap, a new alignment tool that lets you search these data in minutes, helping track antibiotic resistance, trace outbreaks, and more. https://t.co/UnQCBDst65
0
10
20
@shenwei356
Wei Shen 沈 伟
1 month
Learn more:
0
0
2
@RayanChikhi
Rayan Chikhi
1 month
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open. https://t.co/dDBtAjfdYL
5
151
376
@AJamesMcCarthy
Andrew McCarthy
2 months
199
352
4K
@jim_elevator
Jim Shaw
2 months
skani v0.3.0 is released. https://t.co/dEkIzxIbDr * 30-40% potential reduction in memory * Breaking changes to indexing and searching databases Calculate ANI for contigs, genomes. Search vs > 140k genomes: pre-indexed GTDB-R226 available for download.
Tweet card summary image
github.com
Fast, robust ANI and aligned fraction for (metagenomic) genomes and contigs. - bluenote-1577/skani
0
23
54
@shenwei356
Wei Shen 沈 伟
3 months
Yes, I just recommended it to my students.
@smllmp
Samuel Lampa - [email protected]
3 months
https://t.co/We4bDz0cRb is shaping up to becoming one of the absolute top resources for learning hands on #bioinformatics and #genomics today!
0
2
11
@lemire
Daniel Lemire
3 months
7
15
81
@RayanChikhi
Rayan Chikhi
5 months
Slides from my talk (with Kamil Jaron) on an history of k-mers in bioinformatics:
1
31
87
@nomad421
𝕐
4 months
This seems like an awesome course! https://t.co/byOMSDIfCE! If there were more hours in the day, I'd want to put something like this together at UMD.
0
4
12
@shenwei356
Wei Shen 沈 伟
5 months
Also updated - taxid-changelog to May, 2025 https://t.co/9AabacigLF - gtdb-taxdump to GTDB r226 https://t.co/bKrIReUSOG - ictv-taxdump to VMR_MSL40
0
0
0
@shenwei356
Wei Shen 沈 伟
5 months
TaxonKit v0.20.0 is adapted to recent rank changes in NCBI Taxonomy.
Tweet card summary image
github.com
Changes TaxonKit v0.20.0 This version is mainly for maintaining compatibility with NCBI's recent changes(1, 2). Please remove the ranks.txt file in ~/.taxonkit/ or other directories containi...
@NCBI
NCBI
8 months
Updates coming to #NCBITaxonomy! We are introducing two new ranks, domain and realm, and discontinuing the rank superkingdom. Learn more: https://t.co/IqvlgNqjG5
1
13
34
@shenwei356
Wei Shen 沈 伟
6 months
⚡️LexicMap v0.7.0 fixed a minor bug in index building and improved the alignment accuracy! Please rebuild the existing index. Sorry for the inconvenience. 🥹 https://t.co/05zJ4N1UdA
github.com
v0.7.0 - 2025-04-11 Please rebuild the index, as some seeds in the genome end regions were missed during computation. lexicmap index: Fix a little bug in seed desert filling -- forgot to fill the...
1
5
26
@KarelBrinda
Karel Břinda
6 months
A decade ago, we had thousands of bacterial genomes. Now, we have millions. How to scale computational methods? Our paper in @naturemethods answers this: use evolutionary history to guide compression and search. …From terabytes to tens of GBs… w/@Baym @ZaminIqbal et al. 🧵1/
3
52
168
@baym
Michael Baym
6 months
Thrilled that our work on this problem with @KarelBrinda, @ZaminIqbal, and others is out in @naturemethods today! We used phylogenetic compression (described in the thread) to compress every microbe ever sequenced onto a flash drive so that it can be searched with a laptop!
@baym
Michael Baym
3 years
So we asked: what sets the fundamental limit on computation on large genomic databases? Evolution! The irreducible entropy in genome collections is bounded by the most parsimonious path to introduce that variability. In other words, optimal compression should echo phylogeny. 4/
3
29
134
@krsahlin
Kristoffer Sahlin
7 months
As for my project (Project 1 in the list), please help spread the word to students interested in doing a PhD in Computational Biology/Bioinformatics. Note that PhD students here are employed with a salary, that the position comes with benefits, and that there are no tuition fees
1
2
4
@shenwei356
Wei Shen 沈 伟
7 months
🚀 ​LexicMap v0.6.0 is released! ✅ ​More accurate alignments! 🎯 ​Higher sensitivity for short queries (>100bp)! 💡 ​Denser seeds, same index size! 🔬 Function: Efficient seq alignment in millions of prokaryotic genomes! 📖 Docs: https://t.co/pyW1q73c7p https://t.co/oHgwT5c4k9
Tweet card summary image
github.com
v0.6.0 - 2025-03-25 This version is compatible with indexes created by previous versions (requires a one-time, automatic preprocessing), but rebuilding the index is recommended for more accurate re...
0
7
25
@shenwei356
Wei Shen 沈 伟
7 months
thank you all!
0
0
1