Explore tweets tagged as #DataSets
Finding it hard to analyze data with SQL? That changes today with Baselight AI, your data analyst copilot! Baselight AI lets you extract insights from Baselight datasets (and your own uploads!), audit reasoning, and verify the data used to reach conclusions. We're launching
4
12
20
This is the most detailed model of a human cell to date, obtained using x-ray, NMR and cryoelectron microscopy datasets. ‘Cellular landscape cross-section through a eukaryotic cell.’ - by Evan Ingersoll and Gael McGill.
334
3K
11K
Objectness should be user-defined — not human-label-defined! Unsupervised SAM 2 (UnSAMv2) makes it real✨ 1 point + a continuous granularity slider = the mask you want! UnSAMv2 beats SAM2: +16% NoC-90, +26% 1-IoU, +37% AR on 11+ datasets (w/ just 6k unlabeled images)!💪 1/n
1
10
13
I will be posting some data sets soon We will have some questions related to the datasets as a way of building our portfolio in public Interested?
56
18
196
Using ChatGPT to generate Datasets. We have been here for the past two hours. Anyways, we are still together so far it's giving me the results I want 😁. Slow and Steady win the race
4
0
23
If I made a folder for datasets, will you use it? Been thinking of how to pay it forward and doing my own bits; after all I have consumed a lot of "free" contents myself.
1
1
3
Back in July 2024, I made a long-term bet on quantum computing. Here is Why I am Now Shorting It All $QBTS $IONQ $RGTI $QUBT Long Term Thesis: Quantum will transform AI, simulation, and any workload that demands compute with massive datasets. Timing: In 2024 Market didn’t
5
1
9
This one blew my mind 🤯 Alibaba just released a paper called AgentEvolver and it basically turns agent training into a self-improving loop that doesn’t need human-made datasets or brute-force RL. Instead of relying on expensive task construction, random exploration, and giant
45
144
661
Nano Bana Pro just changed the entire game for ancient manuscript OCR Until now you had to rely on messy scans, huge datasets, and expensive manual annotations You can now generate high quality synthetic ground truth in ancient styles with almost no effort. This is easily my
9
20
308
OPENLEDGER STUDIO IS LIVE 🚀 After the mainnet moment, it finally feels real: @OpenledgerHQ Studio is up and running! And everything we've been imagining for months is now working. Today I was able to log in, explore, test, review datasets, see the leaderboard in action, and
16
0
20
Just dropped: 🎉 NVIDIA Nemotron-Parse v1.1 Next-gen OCR for parsing PDFs & PPTs into structured, machine-ready output (text + bounding boxes + semantic classes). Ready for commercial use and to generate datasets🚀 Check the examples on Hugging face! https://t.co/pfOz13AQCz
6
37
211
The first phase of OpenLedger Mainnet is now live with the launch of OPEN Datanet Contribution. This marks the beginning of a fully on-chain pipeline where your datasets become the foundation for the next generation of specialized AI. The video walks through how anyone can log
51
71
287
Climate datasets are the most commonly used geospatial data out there. But how accurate are they? A new study examines this (and the results are surprising):
1
7
18
I finally found some time to compile and run LichtFeld Studio by @janusch_patas Just finished my first test using a previously trained PLY, and everything is working smoothly. Next step will be training a #3dgs on one of my own datasets.
1
2
21
RoMa v2: Harder Better Faster Denser Feature Matching @Parskatt et 11 al. tl;dr: in title. Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture. https://t.co/jLia5dKmFv
4
15
89
Announcing RefCOCO-M, a refreshed RefCOCO with pixel-accurate masks and the problematic prompts removed. Better data for better evaluation. https://t.co/BqayflYv2v
17
53
1K