David Anderson
@alpinegizmo
Followers
1K
Following
21K
Media
83
Statuses
5K
Software Practice Lead @confluentinc, Committer @ApacheFlink Also: https://t.co/CQ9sy6smov
Seattle
Joined March 2007
We’ve had a blast building this. Hope you like it!
Have you ever found yourself thinking…Flink Watermarks…WTF? This new tool might be for you :) It's a hands-on, scrollytelling walkthrough of what watermarks in #ApacheFlink are, why they matter, and how to use them. Try it out: https://t.co/IS6hd1mwkf
0
1
2
Yesterday we hosted SF's most exclusive demo day. Only 10 teams, all of them with moonshot visions. Here's an inside look at the demos: (🧵)
57
123
2K
I have a new video up on YouTube — https://t.co/p0mnfCZ9tS — about doing complex event processing with #apacheflink. This might sound like an obscure topic, but I have found this technique to be super-powerful, and broadly applicable.
0
0
1
Prequal is a must-read paper that makes YouTube +5-10% faster. They show that load balancing w/ requests-in-flight and latency outperforms weighted round robin on CPU load (and why). A rare paper showing distributed systems research working in production. Here’s how: 1/4
6
40
367
It's not too hard to build a stream processor, right? Operators, network, add checkpoints, done. Making state and snapshots work really well and efficiently has been a continuous effort since we introduced this in @ApacheFlink in 2015. Here is a nice article revisiting all that
alibabacloud.com
This article provides a comprehensive overview of the state management evolution and the Flink 2.0 storage-computing separation architecture based on Alibaba's internal practices.
4
9
78
Watermarks are at the heart of what makes stream processing with #FlinkSQL possible. Join @alpinegizmo as he explains the ins and outs of watermarking in in the latest video from his new course on #ApacheFlink SQL. Check it out on Confluent Developer ➡️ https://t.co/TaqXU3HeYk
0
1
4
You haven't seen complexity in art until you've seen this 600-year-old painting. It's so detailed that modern medicine has been able to diagnose this man's exact type of blindness. And that's where things get mind-bendingly strange — this whole thing is an illusion... 🧵
489
6K
31K
He’s right! @alpinegizmo and I can’t stop, won’t stop practicing for #current24.
0
2
6
Meet the new emerging role: The Data Streaming Engineer. During our day 2 #Current24 Keynote, learn how this role is bringing together the work of app developers, data engineers, data scientists and more—potentially redefining the work you already do.
1
5
8
Blogged: Predicting the Future of Distributed Systems https://t.co/qwEYZ2SMkd
blog.colinbreck.com
There are significant changes happening in distributed systems.
3
57
220
Our pre-#current24 summer meetup series continues on July 25th with some #apacheFlink! 🐿️ Sign up now to catch @sharon_rxie and @alpinegizmo as they dive into timing in #flinkSQL and enrichment patterns. ⤵️ https://t.co/U5CA9t4OzD
meetup.com
Hello Streamers! This summer, we’d like to cordially invite everyone to join us in the Bay Area for a series of four meetups leading up to Current. Join us as we gather t
0
6
9
Adding queue semantics as an API over logs provides the cooperative consumption advantages of traditional queues while fixing some of the queue structure's disadvantages: lack of replay and the need for write amplification. Read more in @vanlightly's blog:
jack-vanlightly.com
With the announcement of KIP-932, Queues for Kafka , I thought it was worthwhile a revisit of the subject of queues vs logs and how we actually can build better queues on top of logs.
0
4
11
1/ There is a mini open source drama, this time because RedPanda, a company with a source-available Apache Kafka clone, bought an open source connector framework and made licensing and trademark changes to thwart other startup competition. (a thread)
1
34
209
I've spent the past ~3 weeks going through the entire history of deep learning and reimplementing all the core breakthroughs. It has completely changed my beliefs about deep learning progress and where we're headed. Progress tracker in thread (all resources at the end) 👇
54
363
3K
On July 25th, we're inviting #apacheFlink to our #apacheKafka party. Hear from @alpinegizmo and Sharon Xie. https://t.co/U5CA9t4gK5
meetup.com
Hello Streamers! This summer, we’d like to cordially invite everyone to join us in the Bay Area for a series of four meetups leading up to Current. Join us as we gather t
1
4
5
It’s no secret that LLM training data is running out. How close are we to the limit? To answer that, here's an estimate of the total amount of text in the world from every major source:
75
345
2K
👨🍳 Looking for practical #ApacheFlink use case examples? We prepared a cookbook with a collection of best practices, with real-life use cases including: 🌰 Joining and deduplicating data 🌰 Deserializing JSON from #Kafka Plus many more! Start cooking: https://t.co/oFD5QGZS8g
0
4
8
❌ Delete the duplicates! With exactly-once semantics, each message is delivered precisely once. But how does it work? Learn about the concept and how #ApacheFlink enables exactly-once processing for your real-time streaming data with our new video:
0
4
8
Looking for practical #ApacheFlink use case examples? Our experts, @alpinegizmo, @MartijnVisser82, and Chesnay Schepler, have gathered a collection of recipes 🧑🍳 to help you tackle a variety of common on-premise challenges. Explore the resources: https://t.co/oFD5QGZS8g
0
3
5
Thrilled to share that I'll be talking about using #ApacheFlink for data enrichment at #Current24 in Austin, Sept 17-18: https://t.co/1X5GL6LDxY Hope to see you there!
current.confluent.io
The Data Streaming Event. Now in Bengaluru, London and New Orleans.
0
0
3