#DataSets X Hashtag | Muskviewer

Explore tweets tagged as #DataSets

Baselight

@BaselightDB

19 hours

Finding it hard to analyze data with SQL? That changes today with Baselight AI, your data analyst copilot! Baselight AI lets you extract insights from Baselight datasets (and your own uploads!), audit reasoning, and verify the data used to reach conclusions. We're launching

4

12

20

Physics In History

@PhysInHistory

2 years

This is the most detailed model of a human cell to date, obtained using x-ray, NMR and cryoelectron microscopy datasets. ‘Cellular landscape cross-section through a eukaryotic cell.’ - by Evan Ingersoll and Gael McGill.

334

3K

11K

XuDong Wang

@XDWang101

8 hours

Objectness should be user-defined — not human-label-defined! Unsupervised SAM 2 (UnSAMv2) makes it real✨ 1 point + a continuous granularity slider = the mask you want! UnSAMv2 beats SAM2: +16% NoC-90, +26% 1-IoU, +37% AR on 11+ datasets (w/ just 6k unlabeled images)!💪 1/n

1

10

13

Omoalhaja

@omoalhajaabiola

2 years

I will be posting some data sets soon We will have some questions related to the datasets as a way of building our portfolio in public Interested?

56

18

196

Rukayat Rauf | #DataFestAfrica2025 🇳🇬 🇫🇷

@ratafar13

2 years

Using ChatGPT to generate Datasets. We have been here for the past two hours. Anyways, we are still together so far it's giving me the results I want 😁. Slow and Steady win the race

4

0

23

Idarabong

@tidalyst

11 months

If I made a folder for datasets, will you use it? Been thinking of how to pay it forward and doing my own bits; after all I have consumed a lot of "free" contents myself.

1

3

That Investor

@this_investor

11 hours

Back in July 2024, I made a long-term bet on quantum computing. Here is Why I am Now Shorting It All $QBTS $IONQ $RGTI $QUBT Long Term Thesis: Quantum will transform AI, simulation, and any workload that demands compute with massive datasets. Timing: In 2024 Market didn’t

5

1

9

Alex Prompter

@alex_prompter

7 days

This one blew my mind 🤯 Alibaba just released a paper called AgentEvolver and it basically turns agent training into a self-improving loop that doesn’t need human-made datasets or brute-force RL. Instead of relying on expensive task construction, random exploration, and giant

45

144

661

Tom Dörr

@tom_doerr

9 days

List of public real-time datasets and sources

2

49

609

Adithya S K

@adithya_s_k

12 hours

Nano Bana Pro just changed the entire game for ancient manuscript OCR Until now you had to rely on messy scans, huge datasets, and expensive manual annotations You can now generate high quality synthetic ground truth in ancient styles with almost no effort. This is easily my

9

20

308

JOSEon.sol

@JOSEonsol

3 days

OPENLEDGER STUDIO IS LIVE 🚀 After the mainnet moment, it finally feels real: @OpenledgerHQ Studio is up and running! And everything we've been imagining for months is now working. Today I was able to log in, explore, test, review datasets, see the leaderboard in action, and

16

0

20

Andi Marafioti

@andimarafioti

13 hours

Just dropped: 🎉 NVIDIA Nemotron-Parse v1.1 Next-gen OCR for parsing PDFs & PPTs into structured, machine-ready output (text + bounding boxes + semantic classes). Ready for commercial use and to generate datasets🚀 Check the examples on Hugging face! https://t.co/pfOz13AQCz

6

37

211

Discover Wujiang

@DiscoverWujiang

18 hours

Kuavo humanoid robots🤖 have got recognition! Developed by LEJU ROBOT in #Wujiang district, #Suzhou, they've recently snagged the first two digital intelligent property certificates in embodied AI robotics in Jiangsu province, for their cutting-edge industrial datasets: "Parts

0

3

Openledger Foundation

@OpenledgerFdn

2 days

The first phase of OpenLedger Mainnet is now live with the launch of OPEN Datanet Contribution. This marks the beginning of a fully on-chain pipeline where your datasets become the foundation for the next generation of specialized AI. The video walks through how anyone can log

51

71

287

Eddy Xu

@eddybuild

10 days

you can download Egocentric-10K on an apache 2.0 license here: https://t.co/0GZGRl36AG

6

33

473

Yohan

@yohaniddawela

17 hours

Climate datasets are the most commonly used geospatial data out there. But how accurate are they? A new study examines this (and the results are surprising):

1

7

18

franzipol

@franzipol

8 days

I finally found some time to compile and run LichtFeld Studio by @janusch_patas Just finished my first test using a previously trained PLY, and everything is working smoothly. Next step will be training a #3dgs on one of my own datasets.

1

2

21

Dmytro Mishkin 🇺🇦

@ducha_aiki

19 hours

RoMa v2: Harder Better Faster Denser Feature Matching @Parskatt et 11 al. tl;dr: in title. Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture. https://t.co/jLia5dKmFv

4

15

89

moondream

@moondreamai

3 days

Announcing RefCOCO-M, a refreshed RefCOCO with pixel-accurate masks and the problematic prompts removed. Better data for better evaluation. https://t.co/BqayflYv2v

17

53

1K