Explore tweets tagged as #ApacheArrow
🚀 "Apache Arrow, l’outil révolutionnaire pour une analyse de données performante et interopérable" Découvrez avec @bluxte comment intégrer Arrow dans Elasticsearch pour des traitements ultra-efficaces et des échanges de données sans pareil. #ApacheArrow #SunnyTech2025
0
6
9
Spice OSS v1.4.0 is out!⚡ . ➡️ @ApacheDataFusio v47 & @ApacheArrow v55 upgrades for 10x faster TopK queries. ➡️ @awscloud Glue connectors for S3 w/ @ApacheIceberg. ➡️ @databricksU2M OAuth for secure user authentication. ➡️ Cron-based schedules for dataset refresh and workers
4
2
6
@ApacheDataFusio @ApacheParquet and @ApacheArrow = the FDAP stack appears to be gaining momentum. Here it is mentioned at community over code Asia. Thanks to @OnlyXuanwo for the picture
0
2
16
Excited for @ApacheArrow + @huggingface🤗. There is no better combo for AI datasets. Better than Pandas (doesn't enable Arrow by default) Even Spark is embracing it!. Why?.😴 Lazy loading 🤝️Zero-copy.⚡ Uses efficient c++ ⏸️ Columnar operations. And:.😍 Super easy
1
2
30
Excited to see continued improvements in embedded query processing in the @ApacheArrow C++ project:
4
7
52
Modern data systems are evolving, with Apache Arrow and Parquet at the heart of this transformation. In short, what are they, and how are they powering other Data Tools?. @ApacheArrow :.Apache Arrow is a columnar in-memory data format designed to standardize how structured data
0
16
108
Variant is coming soon to @ApacheParquet in Rust . Huge thanks to @CMUDB for getting the process started in @ApacheArrow with a great draft PR to kick off Variant support: 🙏🙏🙏 Thank you
1
11
94
Today is officially the launch day for the 2nd Edition of my book, "In-Memory Analytics with Apache Arrow"!! . Gummi Bear the Chihuahua says you should go get yourself a copy and learn all about @ApacheArrow . 1/3
2
8
30
New blog post 🚨 Every data engineer should read it. @kszucs_ (@ApacheArrow PMC) announces how to drastically speed up Parquet files uploads and downloads. Yes, it can easily outspeed S3. Best part: the feature enabling this is open source.Link in 🧵
1
3
24
Our recent migration to @duckdb and @ApacheArrow has shown 5-10x increases in Hex project execution speeds depending on project size and complexity. This means Hex is faster overall due to more efficient data loading and processing. Here’s the behind-the-scenes look at the
2
9
99
You can query a #DeltaLake table using #SQL, #GraphQL, and #REST #API queries! #roapi automatically spins up read-only APIs and query frontends via #datafusion and #apachearrow without requiring you to write a single line of code!. Try it out: #opensource
3
13
49
Introducing Rift: Our compute engine that's 10x faster & cheaper than Spark for feature engineering. Built with @ApacheArrow, @duckdb, and @raydistributed for maximum performance. Learn how we did it: #MLOps #FeatureEngineering.
1
14
34
How did @dremio take on self-serve analytics with a "lakehouse"🤔. Hear from Dremio Founder @tshiran to learn how they made data more accessible through an open source data lakehouse approach using @ApacheArrow & @ApacheIceberg💪. And built a $2B company while doing it😲
1
1
4
Performance regression system works. It helps us find regressions across all the federated data systems that we constantly test with every build of Spice. In our TPC-H tests, @ApacheArrow in-memory is the fastest SQL engine (expected) with @duckdb, both file and in-memory mode a
0
2
9
📢 Don't miss Wes McKinney (@wesmckinn) at Data Council this year!! 📢.(he's one of our community "OG"s 😉). 👉Principal Architect @posit_pbc 👉co-founder @voltrondata @ApacheArrow 👉 co-creator #Python #pandas, 👉author "Python for Data Analysis" and more, and more, and more 😅
1
3
17
.@ApacheArrow ADBC 14 has been released. Apache Arrow is a columnar in-memory analytics layer designed to accelerate #bigdata. The release is now available from here: #opensource
0
1
8