
Jay Chia - daft.ai
@JayChia5
Followers
358
Following
151
Media
56
Statuses
424
Cofounder @ Eventual. Works on Daft (https://t.co/f2BxW6m2uo) the data engine for AI. LESS OOM MORE ZOOM
San Francisco, CA
Joined August 2022
Ohhhhh yeah data is so back baby.
i think this is comparatively slept on. the Current Thing if you depart from OAI (eg TML?) you are doing "we are extremely cracked and we will do custom RL for you!" as a service. 3 dudes in a basement immediately worth $500m. so it's notable when a former OAI RL person says
0
0
2
âWhatâs your Roman Empire?â. âDaft. URL downloads, dynamic streaming executionâŠ.â. đđđ.
A glimpse into life at Eventual, Series A announcement edition! Thank you all again for the love and support we received last week. Fun fact: Our meeting rooms are named after Daft Punk songs. Can you guess who is under the Daft Punk helmet?
0
0
4
Actually crazy. Went through our emails and yes we interviewed him back in 2022 for our first hire at Eventual. Did not make it past our bar, but I remember him being decent. Seems there's a pattern also of him approaching open-source companies.
PSA: thereâs a guy named Soham Parekh (in India) who works at 3-4 startups at the same time. Heâs been preying on YC companies and more. Beware. I fired this guy in his first week and told him to stop lying / scamming people. He hasnât stopped a year later. No more excuses.
1
4
15
Incredibly excited to work with the team at @felicis especially @AstasiaMyers who has been an absolute POWERHOUSE for us.
Multimodal is the new default for AI, and legacy infrastructure canât handle it. Eventual built @DaftEngine to redefine how we process video, audio, and images at scale. Weâre proud to lead their Series A and work with @sammy_sidhu and @JayChia5. đ
0
2
12
I couldn't be more proud of our team for this AMAZING milestone today!. To everyone who's followed this journey from the very beginning: a heartfelt thank you, and a promise - there's a revolution coming for multimodal data and AI. We're leading the charge :).
Today we're announcing that Eventual has raised $30M in Seed and Series A funding from @CRV and @felicis as well as @ycombinator, @M12vc and @Citi and others. The AI era needs data infrastructure built for AI, not retrofitted. đ§”
2
0
17
In fact, @desmondcheongzx vibe-coded a custom data sink that would stitch images in the dataframe together into a video and save that as the output of the pipeline. Wacky, but tbh skyâs the limit here :).
0
0
1
Multimodal/unstructured data often means user-defined data. Thatâs why youâre going to need User-Defined Data Sources and Sinks. This is how you get the best-in-class performance from the daft engine + integration with whatever crazy format you can cook up.
Introducing User-Defined Data Sources & Sinks. Now you can write to any format â propriety, vectorDB, whatever â with full distributed power in Daft. We even wrote a @trychroma sink in ~100 lines, LIVE demo + PR open đ„
2
0
5
Daft is now PySpark API-compatible :). Switching your Spark code to Daft is literally 2 lines of code. ```.from daft.pyspark import SparkSession.spark = SparkSession.builder.local().getOrCreate().```. #AntiSparkSocialClub.
â
No JVMs â
No JARs â
Local or Distributed â
One engine for all. Daft #LaunchWeek Day 3: SPARK CONNECT FOR DAFT đ. Switch from PySpark to Daft with just TWO lines of code and run the SAME Spark queries, but faster and simpler. And easily scale from local to distributed with Ray.
2
6
26
Cannot be understated how much of a paradigm shift this was for @daftengine . In analytics, your data usually gets SMALLER (aggregations, groupby etc). In multimodal/AI⊠it tends to EXPAND with HUGE heap memory usage. Think: downloading data from urls, running models etc.
đ„ Fixed batch sizes are old news. DAY 2 OF DAFT #LAUNCHWEEK: Introducing Dynamic Execution for Multimodal Data Processing. Daft is built to adapt in real time to multimodal workloads. Resize images, upload to S3, write to parquet â All optimized. All in one pipeline. âš Ditch
1
0
3