
Apache Hudi
@apachehudi
Followers
3K
Following
298
Media
126
Statuses
493
Official twitter handle of Apache Hudi, an open data lakehouse platform. https://t.co/SXay7oHNah
Joined January 2019
Hudi 1.0 is the most powerful release to date for data lakehouses. Read the blog for details:. Secondary Indexing, Expression Indexes, Partial Updates, Non-blocking Concurrency Control, New LSM timeline, +more: #datalakehouse #opentableformat.
hudi.apache.org
Overview
0
10
34
RT @_xushiyan: โ๏ธ [Blog] Part 2 of building a RAG-based AI recommender ๐ค. A reliable, efficient data lakehouse isn't just a prerequisiteโitโฆ.
0
2
0
Read more about the support implementation from the RFC:
github.com
Upserts, Deletes And Incremental Processing on Big Data. - apache/hudi
0
0
0
๐ก Using SQL ๐๐๐๐ ๐ฉ๐ซ๐จ๐๐๐๐ฎ๐ซ๐๐ฌ can be a great help with inspecting and managing Hudi tables. Back in 2022, Hudi Spark integration added support for CALL procedures. You can perform a wide range of operations:. โ
๐
๐๐ญ๐๐ก ๐ญ๐๐๐ฅ๐ ๐ข๐ง๐๐จ: show table properties,
1
0
4
๐ Exciting news for the #ApacheHudi community!. Weโve enabled GitHub Discussions ๐.A dedicated space to ask questions, share ideas & collaborate. Join the conversation ๐
github.com
Explore the GitHub Discussions forum for apache hudi. Discuss code, ask questions & collaborate with the developer community.
0
0
2
After each write commit to your Hudi table, you can configure a callback function to be invoked, sending commit metadata to Kafka, Pulsar, or a custom HTTP endpoint to trigger downstream processing tailored to your business needs!. ๐Learn from the docs
hudi.apache.org
Apache Hudi provides the ability to post a callback notification about a write commit. This may be valuable if you need
0
0
1
๐ก You can configure ๐๐จ๐ฌ๐ญ-๐๐จ๐ฆ๐ฆ๐ข๐ญ ๐๐๐ฅ๐ฅ๐๐๐๐ค with @apachekafka , @apache_pulsar , and HTTP endpoints. ๐ More in the reply
1
0
3
๐ Follow instructions with more details in the release notes:
hudi.apache.org
Release 1.0.0 (docs)
0
0
0
1๏ธโฃ Restart your writer jobs with Hudi 1.x jar. Don't worryโit'll backward-compatible write to existing 0.x tables as long as you set ๐ต๐ผ๐ผ๐ฑ๐ถ๐ฒ.๐๐ฟ๐ถ๐๐ฒ.๐ฎ๐๐๐ผ.๐๐ฝ๐ด๐ฟ๐ฎ๐ฑ๐ฒ=false; if you're using metadata table or running async table services, disable them first.
1
0
0
๐ก ๐ง๐ถ๐ฝ ๐ผ๐ณ ๐๐ต๐ฒ ๐ฑ๐ฎ๐: Having some globally shared Hudi configs used by every job? .Set Hudi's global config directory and put those into ๐ต๐๐ฑ๐ถ-๐ฑ๐ฒ๐ณ๐ฎ๐๐น๐๐.๐ฐ๐ผ๐ป๐ณ !. #apachehudi #lakehouse #dataengineering
2
0
1
RT @Onehousehq: ๐ A new chapter "Running Hudi in Production" is now available in the early release of "Apache Hudiโข: The Definitive Guide"โฆ.
0
2
0
Join this month's developer sync call with PMC member @_xushiyan on July 23, 5 PM PT to learn about the Rust implementation of @apachehudi with API bindings in Python and C++ !. โญ๏ธ GitHub repo: ๐ Joining instructions:
0
4
6
RT @_xushiyan: ๐ก ๐๐ผ๐ ๐ฑ๐ผ๐ฒ๐ ๐ฑ๐ฎ๐๐ฎ-๐๐ธ๐ถ๐ฝ๐ฝ๐ถ๐ป๐ด ๐๐ผ๐ฟ๐ธ ๐ถ๐ป @apachehudi ?. The metadata table (a multi-modal index system) located within your Hudi tabโฆ.
0
3
0
RT @_xushiyan: ๐ New Blog: Building a RAG-based AI Recommender (Part 1/2). ๐ ๐ What's inside: .โฆ How RAG works endโฆ.
0
9
0
What's more, the 0.4.0 Hudi-rs artifacts are being added to @raydistributed and @daftengine to unlock additional query engine support such as incremental query, time-travel query, and reading MOR tables!.
1
0
1