datologyai Profile Banner
DatologyAI Profile
DatologyAI

@datologyai

Followers
2K
Following
173
Media
29
Statuses
130

DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better, smaller models which train faster.

Redwood City, CA
Joined September 2023
Don't wanna be here? Send us removal request.
@datologyai
DatologyAI
6 days
RT @leavittron: The era of "The Era of Pretraining is Over" is over.
0
7
0
@datologyai
DatologyAI
6 days
RT @leavittron: Very excited to announce BeyondWeb, @datologyAI’s synthetic pretraining data generation paradigm. BeyondWeb is a rephrasing….
0
41
0
@grok
Grok
5 days
Join millions who have switched to Grok.
255
282
2K
@datologyai
DatologyAI
6 days
RT @pratyushmaini: 1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares….
0
124
0
@datologyai
DatologyAI
1 month
RT @code_star: Training efficiency is hard, but getting easier to manage all the time. You can rent high speed interconnected h100s on dema….
0
4
0
@datologyai
DatologyAI
1 month
RT @code_star: We are looking for a post-training lead at @datologyai . we have gpus, you can make them go brrrr
Tweet media one
0
18
0
@datologyai
DatologyAI
2 months
The joy of research is in sharing it. And asking the hard questions together. Here’s to a summer of curiosity, great conversations, and rabbit holes we didn't expect to fall into 🚀. Stay data-obsessed!.
blog.datologyai.com
We're hosting a weekly data seminar series at Datology AI featuring fun and thoughtful researchers pushing the boundaries of pretraining and data curation. Are you data-obsessed yet?
0
0
5
@datologyai
DatologyAI
2 months
Working on something fun in the data space?. We'd love to have you join us! Just drop us a DM! We'll take it from there ✨.
1
0
2
@datologyai
DatologyAI
2 months
The vibe? Casual, thoughtful, nerdy. Think "office hours meets research seminar.". We keep live sessions intimate (just our team + speaker) to encourage candid discussions about early-stage ideas, then share recordings on YouTube for everyone 📹.
1
0
4
@datologyai
DatologyAI
2 months
Each week we bring in researchers doing cutting-edge research on:. 🔬 Dataset design & scaling laws. 🤖 Synthetic data & alignment. 🧹 Data contamination & unlearning. 🎯 Anything weird & interesting about data. 1 hr of talk + open discussion. Posted on Youtube. Here’s the lineup
Tweet media one
1
1
10
@datologyai
DatologyAI
2 months
🌞 We're excited to share our "Summer of Data Seminar" series at @datologyai!. We're hosting weekly sessions with brilliant researchers diving deep into pretraining, data curation, and everything that makes datasets tick. Are you data-obsessed yet? 🤓. Thread 👇
Tweet media one
1
8
41
@datologyai
DatologyAI
2 months
RT @LucasAtkins7: We teamed up with @datologyai to build what we believe is the strongest pretraining corpus in the world—and I truly think….
0
5
0
@datologyai
DatologyAI
2 months
RT @arimorcos: Congratulations to our friends and partners @arcee_ai on the release of AFM-4.5B!. With data powered by @datologyai, this mo….
0
11
0
@datologyai
DatologyAI
2 months
Congrats to @LucasAtkins7 and @arcee_ai on a fantastic model release! . DatologyAI powers the data behind AFM-4.5B, and we're just getting started.
@LucasAtkins7
Lucas Atkins
2 months
Our customers needed a better base model <10B parameters. We spent the last 5 months building one. I'm delighted to share a preview of our first Arcee Foundation Model: AFM-4.5B-Preview.
0
3
32
@datologyai
DatologyAI
2 months
RT @gm8xx8: Datology CLIP Models. DatologyAI releases two SOTA CLIP ViT-B/32 variants: classification-optimized and retrieval-optimized, ac….
0
13
0
@datologyai
DatologyAI
2 months
RT @LucasAtkins7: . @datologyai is pushing the frontier, with data curation as its standout advantage. After working closely with the team….
0
5
0
@datologyai
DatologyAI
3 months
RT @RicardoMonti9: . @datologyai is back: state of the art CLIP model performance using data curation alone 🚀. ✅ state-of-the-art ViT-B/32….
0
23
0
@datologyai
DatologyAI
3 months
RT @leavittron: That's why you need @datologyai.
0
3
0
@datologyai
DatologyAI
3 months
RT @arimorcos: We couldn't agree more. If you also believe this, come work with us @datologyai to help drive frontier research and engineer….
0
4
0
@datologyai
DatologyAI
4 months
RT @thao_nguyen26: 📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains!. 📅 Deadline: Ma….
0
26
0
@datologyai
DatologyAI
5 months
RT @LucasAtkins7: What an insane get for an insane team. We’ve been working with @datologyai closely and I assure you if anything they sell….
0
5
0