Scott Haines
@newfront
Followers
833
Following
5K
Media
112
Statuses
3K
Developer Relations Engineer @ Buf • Speaker • Trainer | #DatabricksMVP | Author @OReillyMed | ❤️ #ApacheSpark. ❤️ #Dogs. #DatabricksMVP. Views are my own
California, USA
Joined November 2008
Congratulations to @dawnsongtweets, who has been selected as a 2025 #AI2050 Senior Fellow by @SchmidtSciences! @Berkeley_EECS The prestigious fellowship honors researchers advancing responsible innovation and the development of AI that benefits humanity.
We're excited to welcome 28 new AI2050 Fellows! This 4th cohort of researchers are pursuing projects that include building AI scientists, designing trustworthy models, and improving biological and medical research, among other areas. https://t.co/8oY7xdhxvF
0
2
5
Thu, Nov 11 @ 9 AM PT ✈️ Flight SQL + Delta: Building the Future of High-Speed Data Exchange Learn how @ApacheArrow Flight, DataFusion, and Delta Lake enable fast SQL over Arrow Flight, plus how Delta VACUUM/OPTIMIZE integrate into a Flight SQL server backed by DataFusion for
0
3
5
Enjoy over 250 free #Scala and #ZIO videos! As always, many thanks to @zivergetech for making these videos free for everyone! 🙏
We've finally uploaded all ZIO courses of the brilliant @alvinalexander on our YouTube Channel 💪 Check out the courses available: ➡️ Introduction to Scala 3 ➡️ Advanced Scala 3 ➡️ Introduction to Functional Programming ➡️ Functional Programming in Depth ➡️ Functional
0
21
69
This is a big deal -- ingest data into any lakehouse table with no infra setup, just calling a REST API. Simple, efficient and powerful.
Announcing Public Preview of Zerobus Ingest, part of Lakeflow Connect: Push event data directly into your lakehouse—no message bus required! Simplified ingestion. Reduced latency. Built for scale. - Up to 100 MB/sec per connection - Latency as low as 5 seconds - Native Delta +
1
3
36
Finished. Absolute, undeniable banger. Full review to come, but outstanding work @IslingtonJames @SagaPressBooks
23
15
434
Discover how deletion vectors can take Delta Lake performance to the next level. 🚀 In this 5-minute video, @YoussefMrini breaks down how deletion vectors speed up delete, update, and merge operations by marking rows logically—no need to rewrite entire files. 🎥 Watch:
0
3
6
We’re diving deep into Flight SQL + Delta to show how open technologies like Apache Arrow Flight, #DataFusion, and Delta Lake are reshaping high-performance analytics! 🚀 🗓️ November 11 🕝 9:00 AM PT 🔗 Register: https://t.co/sIZCvxPdde See how: 🔹 Flight SQL provides a
0
1
4
0
1
3
At MotherDuck, we want to make it easy for our customers to build Agentic Applications on top of our Database. With that in mind, we have written a guide on proven patterns for building these, as well as built functions with the AI engineer in mind. One of those functions is Use
0
3
15
For streaming data the gap between "hello world" tutorials and bulletproof production systems is filled with hard-learned lessons. Join Bartosz Konieczny and our own Scott Haines Oct 22 at 9am PT for Linux Foundation webinar “Streaming Data Design Patterns”
linuxfoundation.org
Get insights from the best open source projects and people. View one of our upcoming or on-demand webinars on topics from Kubernetes to security.
0
2
4
All set for @smalldatasf 2025 ✈️ Flights, Airbnb, and access confirmed. Can’t wait to learn what’s next in the data world, and how simplicity keeps winning. If you’re going, let’s connect. Say hi if you see me, we can talk #Data, #Arc, @duckdb , @motherduck, and more. I’ll be
0
2
3
Our SIGMOD paper with @XinyuZeng218 + @huanchenzhang + @wesmckinn + @pateljm on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet. 📄 Paper: https://t.co/fnFwN9gxbZ 📁 Code: https://t.co/R5nhlVvsea
6
53
319
I know not everyone will have time to do a reread of book #1 before The Strength of the Few releases (3.5 weeks to go!!), so for those interested, here's the link to an ‘interlude’ I wrote - a bonus chapter that serves as a recap for The Will of the Many: https://t.co/zZsK60dPhr
40
96
991
30 minutes to go! 🚨 Join us at 9AM PT for Diving into Streaming Data Design Patterns for Delta Lake with @newfront and @waitingforcode. 🗓️ Oct 14 🎥 Streaming LIVE to X, YouTube & LinkedIn #opensource #oss #deltalake #linuxfoundation #streaming
⏰ Final Reminder – Delta Lake Webinar Tomorrow! Wondering if data engineering design patterns can unlock new insights into Delta Lake? Or how Delta Lake can become a key part of your streaming data architecture? Join @newfront (@bufbuild) and @waitingforcode as they tackle
0
1
4
⏰ Final Reminder – Delta Lake Webinar Tomorrow! Wondering if data engineering design patterns can unlock new insights into Delta Lake? Or how Delta Lake can become a key part of your streaming data architecture? Join @newfront (@bufbuild) and @waitingforcode as they tackle
1
2
5
📅 TODAY: Scott Haines (Buf) & Youssef Mrini explore #DeltaLake 4.0 at 12pm ET/9am PT ⏰ Learn about #DeltaConnect - read/write Delta Lake tables remotely from any #gRPC client + spark-connect on Buf! Sign up:
linkedin.com
Login to LinkedIn to keep in touch with people you know, share ideas, and build your career.
0
1
3
📣 Join us for our next 𝗢𝗽𝗲𝗻 𝗟𝗮𝗸𝗲𝗵𝗼𝘂𝘀𝗲 + 𝗔𝗜 webinar on October 30 at 9AM PT! Register ➡️ https://t.co/KFup8M8Q8B Most lineage graphs are overly complex and lack meaningful context, and years of chasing “total visibility” proved that more metadata doesn’t create
0
1
1
One of the most powerful new capabilities in Delta Lake 4.0 is support for collations, giving you much finer control over how text is compared and sorted. ✅ In this clip, @YoussefMrini explains how to enable collations: 🔹 Define a default collation at the table level 🔹
0
3
5
I’m excited to share that I’ll be speaking at Open Lakehouse + AI Paris on November 13! I’m looking forward to connecting with fellow builders, data enthusiasts, and open source advocates at this incredible event 🙌 👉 Register here: https://t.co/g2QjACVGvt
#OpenLakehouse #Paris
0
1
1