Explore tweets tagged as #DataSets
@BaselightDB
Baselight
19 hours
Finding it hard to analyze data with SQL? That changes today with Baselight AI, your data analyst copilot! Baselight AI lets you extract insights from Baselight datasets (and your own uploads!), audit reasoning, and verify the data used to reach conclusions. We're launching
4
12
20
@PhysInHistory
Physics In History
2 years
This is the most detailed model of a human cell to date, obtained using x-ray, NMR and cryoelectron microscopy datasets. ‘Cellular landscape cross-section through a eukaryotic cell.’ - by Evan Ingersoll and Gael McGill.
334
3K
11K
@XDWang101
XuDong Wang
8 hours
Objectness should be user-defined — not human-label-defined! Unsupervised SAM 2 (UnSAMv2) makes it real✨ 1 point + a continuous granularity slider = the mask you want! UnSAMv2 beats SAM2: +16% NoC-90, +26% 1-IoU, +37% AR on 11+ datasets (w/ just 6k unlabeled images)!💪 1/n
1
10
13
@omoalhajaabiola
Omoalhaja
2 years
I will be posting some data sets soon We will have some questions related to the datasets as a way of building our portfolio in public Interested?
56
18
196
@ratafar13
Rukayat Rauf | #DataFestAfrica2025 🇳🇬 🇫🇷
2 years
Using ChatGPT to generate Datasets. We have been here for the past two hours. Anyways, we are still together so far it's giving me the results I want 😁. Slow and Steady win the race
4
0
23
@tidalyst
Idarabong
11 months
If I made a folder for datasets, will you use it? Been thinking of how to pay it forward and doing my own bits; after all I have consumed a lot of "free" contents myself.
1
1
3
@this_investor
That Investor
11 hours
Back in July 2024, I made a long-term bet on quantum computing. Here is Why I am Now Shorting It All $QBTS $IONQ $RGTI $QUBT Long Term Thesis: Quantum will transform AI, simulation, and any workload that demands compute with massive datasets. Timing: In 2024 Market didn’t
5
1
9
@alex_prompter
Alex Prompter
7 days
This one blew my mind 🤯 Alibaba just released a paper called AgentEvolver and it basically turns agent training into a self-improving loop that doesn’t need human-made datasets or brute-force RL. Instead of relying on expensive task construction, random exploration, and giant
45
144
661
@tom_doerr
Tom Dörr
9 days
List of public real-time datasets and sources
2
49
609
@adithya_s_k
Adithya S K
12 hours
Nano Bana Pro just changed the entire game for ancient manuscript OCR Until now you had to rely on messy scans, huge datasets, and expensive manual annotations You can now generate high quality synthetic ground truth in ancient styles with almost no effort. This is easily my
9
20
308
@JOSEonsol
JOSEon.sol
3 days
OPENLEDGER STUDIO IS LIVE 🚀 After the mainnet moment, it finally feels real: @OpenledgerHQ Studio is up and running! And everything we've been imagining for months is now working. Today I was able to log in, explore, test, review datasets, see the leaderboard in action, and
16
0
20
@andimarafioti
Andi Marafioti
13 hours
Just dropped: 🎉 NVIDIA Nemotron-Parse v1.1 Next-gen OCR for parsing PDFs & PPTs into structured, machine-ready output (text + bounding boxes + semantic classes). Ready for commercial use and to generate datasets🚀 Check the examples on Hugging face! https://t.co/pfOz13AQCz
6
37
211
@DiscoverWujiang
Discover Wujiang
18 hours
Kuavo humanoid robots🤖 have got recognition! Developed by LEJU ROBOT in #Wujiang district, #Suzhou, they've recently snagged the first two digital intelligent property certificates in embodied AI robotics in Jiangsu province, for their cutting-edge industrial datasets: "Parts
0
0
3
@OpenledgerFdn
Openledger Foundation
2 days
The first phase of OpenLedger Mainnet is now live with the launch of OPEN Datanet Contribution. This marks the beginning of a fully on-chain pipeline where your datasets become the foundation for the next generation of specialized AI. The video walks through how anyone can log
51
71
287
@eddybuild
Eddy Xu
10 days
you can download Egocentric-10K on an apache 2.0 license here: https://t.co/0GZGRl36AG
6
33
473
@yohaniddawela
Yohan
17 hours
Climate datasets are the most commonly used geospatial data out there. But how accurate are they? A new study examines this (and the results are surprising):
1
7
18
@franzipol
franzipol
8 days
I finally found some time to compile and run LichtFeld Studio by @janusch_patas Just finished my first test using a previously trained PLY, and everything is working smoothly. Next step will be training a #3dgs on one of my own datasets.
1
2
21
@ducha_aiki
Dmytro Mishkin 🇺🇦
19 hours
RoMa v2: Harder Better Faster Denser Feature Matching @Parskatt et 11 al. tl;dr: in title. Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture. https://t.co/jLia5dKmFv
4
15
89
@moondreamai
moondream
3 days
Announcing RefCOCO-M, a refreshed RefCOCO with pixel-accurate masks and the problematic prompts removed. Better data for better evaluation. https://t.co/BqayflYv2v
17
53
1K