Academic Torrents
@academictorrent
Followers
256
Following
41
Media
7
Statuses
205
A U.S. 501c3 nonprofit that provides a platform for sharing research data, BitTorrent education, and we coordinate volunteer hosting. Led by @josephpaulcohen
the internet
Joined December 2015
Grok-1, 3 days, 300GB, 5638 downloads, 1.6PB downloaded, 8.12MB/s average download speed. Here is a map of the 2000 hosting locations! #academictorrents
https://t.co/XkcK5WRyn9
1
2
6
academictorrents.com
Mistral 7B is a 7.3B parameter model that: - Outperforms Llama 2 13B on all benchmarks - Outperforms Llama 1 34B on many benchmarks - Approaches CodeLlama 7B performance on code, while remaining good...
magnet:?xt=urn:btih:208b101a0f51514ecf285885a8b0f6fb1a1e4d7d&dn=mistral-7B-v0.1&tr=udp%3A%2F% https://t.co/OdtBUsbMKD%3A1337%2Fannounce&tr=https%3A%2F%https://t.co/HAadNvH1t0%3A443%2Fannounce RELEASE ab979f50d7d406ab8d0b07d09806c72c
0
0
6
The Transmission BT team just released the long anticipated version 4! Congratulations @transmissionbt ! https://t.co/b3SmyZXPkS
github.com
Transmission 4.0.0 Highlights This is a major release, both in numbering and in effort! It's been in active development for over a year and has a huge list of changes -- over a thousand commits...
1
1
5
New Torrent! CAMUS Cardiac Acquisitions for Multi-structure Ultrasound Segmentation (Dataset)
1
0
0
New Torrent! HMC-QU echocardiography ultrasound recordings (Dataset)
academictorrents.com
The HMC-QU benchmark dataset is created by the collaboration between Hamad Medical Corporation (HMC), Tampere University, and Qatar University. The usage of data has been approved by the local ethics...
0
0
0
New Torrent! STructured Analysis of the Retina (Dataset)
academictorrents.com
The STARE (STructured Analysis of the Retina) Project was conceived and initiated in 1975 by Michael Goldbaum, M.D., at the University of California, San Diego. It was funded by the U.S. National...
1
0
3
New Torrent! Data of the White Matter Hyperintensity (WMH) Segmentation Challenge (Dataset)
0
0
0
New Torrent! Totalsegmentator CT Dataset (Dataset)
academictorrents.com
In 1204 CT images we segmented 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) covering a majority of relevant classes for most use cases. The CT images were randomly sampled...
0
0
1
New Torrent! Penn Treebank III 3 LDC99T42 (Dataset)
0
0
0
New Torrent! INbreast: toward a full-field digital mammographic database (Dataset)
0
1
1
New Torrent! The Oxford-IIIT Pet Dataset (Dataset)
academictorrents.com
We have created a 37 category pet dataset with roughly 200 images for each class. The images have a large variations in scale, pose and lighting. All images have an associated ground truth annotation...
0
0
0
New Torrent! The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions (Dataset)
0
0
0
New Torrent! Reddit comments/submissions 2005-06 to 2022-06 (Dataset)
academictorrents.com
Reddit comments and submissions from 2005-06 to 2022-06 collected by pushshift which can be found here These are zstandard compressed ndjson files. Example python scripts for parsing the data can be...
0
0
2
New Torrent! TAC KBP Comprehensive English Source Corpora LDC2018T03 (Dataset)
academictorrents.com
# TAC KBP Comprehensive English Source Corpora 2009-2014 See also the [training and evaluation data](). # Introduction TAC KBP Comprehensive English Source Corpora 2009-2014 was developed by the...
0
0
0
New Torrent! Spanish Gigaword 3rd edition LDC2011T12 (Dataset)
0
0
0
New Torrent! French Gigaword 3rd edition LDC2011T10 (Dataset)
academictorrents.com
# French Gigaword Third Edition - Linguistic Data Consortium ### Introduction French Gigaword Third Edition is a comprehensive archive of newswire text data that has been acquired over several years...
0
0
0
New Torrent! Chinese Gigaword 5th edition LDC2011T13 (Dataset)
academictorrents.com
# Chinese Gigaword Fifth Edition - Linguistic Data Consortium ### Introduction Chinese Gigaword Fifth Edition was produced by the Linguistic Data Consortium (LDC). It is a comprehensive archive of...
0
0
0
New Torrent! English Gigaword 5th edition LDC2011T07 (Dataset)
academictorrents.com
# English Gigaword Fifth Edition - Linguistic Data Consortium ### Introduction English Gigaword Fifth Edition is a comprehensive archive of newswire text data that has been acquired over several...
0
0
0
New Torrent! Abstract Meaning Representation AMR Annotation Release 3.0 LDC2017T10 (Dataset)
0
0
0
New Torrent! DARPA BOLT Egyptian Arabic Treebank Conversational Telephone Speech NLP LDC2021T12 (Dataset)
academictorrents.com
# BOLT Egyptian Arabic Treebank - Conversational Telephone Speech - Linguistic Data Consortium ### Introduction BOLT Egyptian Arabic Treebank - Conversational Telephone Speech was developed by the...
0
0
0