Sam Arch 🇦🇺 Profile
Sam Arch 🇦🇺

@SamArchDB

Followers
542
Following
479
Media
2
Statuses
17

PhD Student in Databases @CMUDB with @andy_pavlo, Previously a Compiler Engineer @apple

Pittsburgh, PA
Joined July 2023
Don't wanna be here? Send us removal request.
@VLDBconf
VLDB 2025 🇬🇧
3 months
🧵Congrats to the #VLDB2025 award winners at @VLDBconf (London)! 🏆 • Best research papers: “Diva” (@niv_dayan @UofT + @KTHuniversity) & “AnyBlox” (@JanaGiceva @tum_db) • Runner-up: “The Key to Effective UDF Optimization” by @CMUDB @andy_pavlo @pateljm
1
5
13
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
1 year
The latest paper from the #1 CMU-DB PhD student @SamArchDB's is wild compilation DB magic! He automatically makes UDFs run 300x faster on @SQLServer and 1.3x faster on @duckdb. Code: https://t.co/Fm9n1qY8xu Paper:
Tweet card summary image
github.com
PRISM is a UDF optimization framework that deconstructs a UDF into separate inlinable and outlinable pieces, resulting in simpler queries and faster query plans. - SamArch27/PRISM
@pvldb
PVLDB
1 year
Vol:18 No:1 �� The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining https://t.co/dCl3qhCytr
2
31
226
@pvldb
PVLDB
1 year
Vol:18 No:1 → The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining https://t.co/dCl3qhCytr
1
15
60
@SamArchDB
Sam Arch 🇦🇺
1 year
The UDF world tour continues. This week, I'm stopping at UW Madison, Microsoft's Gray Systems Lab, and The University of Washington. See you all there. 10/24 @wiscdb 10/25 @GraySystemsLab 10/25 @uw_db
@SamArchDB
Sam Arch 🇦🇺
1 year
I am flying to the Bay Area to give talks about my new VLDB 2025 paper on UDFs with @andy_pavlo and @pateljm. With our new technique (UDF outlining), queries run up to 1000× faster than FROID. The paper will drop soon. 9/10 @databricks 9/11  @UCBerkeley 9/12 #HTAPSummit2024
2
3
33
@SamArchDB
Sam Arch 🇦🇺
1 year
I am flying to the Bay Area to give talks about my new VLDB 2025 paper on UDFs with @andy_pavlo and @pateljm. With our new technique (UDF outlining), queries run up to 1000× faster than FROID. The paper will drop soon. 9/10 @databricks 9/11  @UCBerkeley 9/12 #HTAPSummit2024
2
11
109
@clattner_llvm
Chris Lattner
1 year
This speaks to me: it’s the essence of building hard things that take time to have impact, but then matter in a big way - you have to love the process, not just the outcome!
@ash_lmb
Ash Lamb
1 year
Do what feels like play to you but looks like work to others.
11
56
482
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
1 year
VLDB'24 Paper #1: Collecting training data for ML models with DBs is $$$/slow. @wanshenl's Boot framework uses @PostgreSQL extensions to cutoff redundant queries. Offline training goes from weeks to hours! • Code: https://t.co/4l090D8bZP • Paper:
Tweet card summary image
github.com
Contribute to lmwnshn/boot development by creating an account on GitHub.
@pvldb
PVLDB
1 year
Vol:17 No:11 → Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems https://t.co/4xqZoDtgeh
1
10
68
@eatonphil
Phil Eaton
1 year
Video of @SamArchDB 's talk from NYC Systems August 2024 is now up! Dear UDFs, I Broke Up With You, But Now I'm Ready To Give You A Second Chance. Will You Take Me Back? Sincerely, SQL https://t.co/dy4SgltA6t
1
8
101
@jonobelotti_IO
Jonathon Belotti
1 year
🇦🇺s incoming
@eatonphil
Phil Eaton
1 year
The next NYC Systems talks are Thursday August 15th. Very pleased to have @jonobelotti_IO of @modal_labs speaking about serverless cold starts, and @SamArchDB of @CMUDB speaking about SQL UDFs! https://t.co/TN1JYnhdyt
0
1
8
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
1 year
@_TylerHillery We have three more bombs dropping this month from @lmwnshn + @SamArchDB + William Zhang. @abigale_kim just submitted her VLDB paper last week too. It's going to be a blow out year.
2
7
68
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
1 year
It took three years to finish, but our follow-up to the 2006 "What Goes Around Comes Around" is finally out! Stonebraker and I examine the last 20 years in databases and discuss why relational databases + SQL will continue to remain on top. 📄PDF: https://t.co/ZwTWSxXLWb
24
341
1K
@duckdb
DuckDB
2 years
We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck". This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format. Check out our
24
272
995
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
2 years
Here is a short video from my recent interview with @SCSatCMU's media team on the one year anniversary of the infamous @Jeopardy "matrix" scandal:
instagram.com
1
4
8
@abigale_kim
Abigale Kim (@abigalekim.bsky.social)
2 years
I am excited to announce that I will start my PhD in database systems at @wiscdb and work with @xiangyao_yu beginning Fall 2024! I'm super grateful to everyone who has supported me along the way :)
9
4
151
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
2 years
My #1 PhD student @butro successfully completed his PhD defense. Thanks to the committee (@pateljm @justinesherry @samrmadden). Matt's thesis is on accelerating databases with eBPF (telemetry, proxies, OLTP stores). You have 60 days to hire him. Expect fierce competition.
3
9
261