Sameer Agarwal
@sagrw
Followers
779
Following
70
Media
5
Statuses
124
Co-Founder & CTO @DeductiveAI. Previously led query engines @Facebook, open-source @ApacheSpark @Databricks, created #BlinkDB, PhD in Databases @UCBerkeley
Palo Alto, CA
Joined April 2009
Reasoning over extremely large amounts of data under uncertainty is one of the most fascinating problems in computer science. In this first of many under-the-hood posts, we describe how we think about incident investigation at @DeductiveAI as an online hypothesis-ranking
deductive.ai
Learn how Bayesian inference helps AI agents reason about production incidents by ranking root-cause hypotheses, updating beliefs incrementally, and converging under noisy, contradictory signals.
0
3
3
Every major outage I have seen had two root causes. A small bug in the system and a big assumption in someone’s head. We always overindex on fixing the bug and underindex on exposing the assumption.
0
0
2
Over the past year, we’ve been focused on a simple idea: engineers shouldn’t have to sift through dashboards, logs, and code to understand why something broke. Working with several amazing customers like @DoorDash, @Foursquare , and @Kumo_ai_team, we’ve seen the same pattern
0
1
5
It was standing room only for the best SRE leaders in SF to learn if they would still have a job in 5 years 🔥🔥🔥 At the InfraSF meet-up last week @mipsytipsy @__Achille__ @sagrw gave incredible ⚡️ talks on the future of o11y - the physics of computing scrapping your plans, the
3
2
41
I’ve been in the AI trenches since 2009, and LLMs are certainly a game-changer. But they also seem to be a warm-up act for the main event—the next cycle of AI innovation, coming in the next 12-18 months. Here are 3 areas we’re looking at to fuel this cycle, where founders can
39
126
757
#EuroSys23 Test-of-Time AWARD BlinkDB: queries with bounded errors and bounded response times on very large data Sameer Agarwal Barzan Mozafari Aurojit Panda Henry Milner Samuel Madden Ion Stoica Congratulations!
1
2
11
#ApacheSpark 2.3 is out! A blog post from on some of the key features: https://t.co/AghWgT1gYG or download it at
1
60
89
Announcing Microsoft Azure Databricks! A fast, easy and collaborative Apache® Spark™ based analytics platform optimized for Azure. https://t.co/GobjZ5lNvM
#analytics #Azure
2
80
80
Processing a billion rows/s in @ApacheSpark isn't cool anymore. You know what's cool? Processing a trillion rows/s
databricks.com
This blog post descr
0
0
2
Join @sagrw at #SparkSummit East to learn about the new #ApacheSpark features that help deal with bad actors in ETL https://t.co/w6NyXBwOde
0
3
8
Start New Year 2017 with Some Spark! Join @databricks & @workday for #ApacheSpark Meetup. Check it out and RSVP!
0
1
0
Get under the hood of #ApacheSpark 2.0's execution engine with @sagrw at #SparkSummit EU https://t.co/om6CvskxML
0
7
10
Tweet your favorite @ApacheSpark use case & win a free ticket to #SparkSummit in Brussels! #SparkSummitContest
https://t.co/Ij0EjZJb8g
0
8
1
Yelp is keeping up with the times by adding a filter for "PokéStops Nearby"
0
1
5
.@acmsigmod2016 this year will feature office hours with VCs! 💰
0
0
2
🆒 @rxin @sagrw @davies 🎙 Whole-stage code generation in version 2.0 of @ApacheSpark 🚀 https://t.co/TmCwoW8562
0
6
4
Read about why Spark 2.0 is 10x faster than its predecessors!
databricks.com
Discover how Apache Spark can join a billion rows per second on a laptop, showcasing its powerful data processing capabilities.
0
0
0
The emperor asked the vizier how he had accomplished this feat to which the vizier uttered one word: “hyper-log-log”
0
1
1
#ApacheSpark 2.0 Technical Preview Now Available on Databricks Community Edition. https://t.co/7rKGoiw7wE
0
52
36