apachehudi Profile Banner
Apache Hudi Profile
Apache Hudi

@apachehudi

Followers
3K
Following
298
Media
126
Statuses
493

Official twitter handle of Apache Hudi, an open data lakehouse platform. https://t.co/SXay7oHNah

Joined January 2019
Don't wanna be here? Send us removal request.
@apachehudi
Apache Hudi
9 months
Hudi 1.0 is the most powerful release to date for data lakehouses. Read the blog for details:. Secondary Indexing, Expression Indexes, Partial Updates, Non-blocking Concurrency Control, New LSM timeline, +more: #datalakehouse #opentableformat.
Tweet card summary image
hudi.apache.org
Overview
0
10
34
@apachehudi
Apache Hudi
10 hours
๐Ÿ’ผ We've changed the schedule of office hours to cater for different timezones! . ๐ŸŒŽ America-friendly time: Every Wednesday, 9 AM Pacific Time.๐ŸŒ Asia-friendly time: Every Thursday, 10 AM Indian Standard Time. ๐Ÿ”— Check out the office hours page for meeting links to join!
Tweet media one
0
1
1
@apachehudi
Apache Hudi
11 hours
RT @_xushiyan: โœ๏ธ [Blog] Part 2 of building a RAG-based AI recommender ๐Ÿค–. A reliable, efficient data lakehouse isn't just a prerequisiteโ€”itโ€ฆ.
0
2
0
@apachehudi
Apache Hudi
1 day
Read more about the support implementation from the RFC:
Tweet card summary image
github.com
Upserts, Deletes And Incremental Processing on Big Data. - apache/hudi
0
0
0
@apachehudi
Apache Hudi
1 day
๐Ÿ’ก Using SQL ๐‚๐€๐‹๐‹ ๐ฉ๐ซ๐จ๐œ๐ž๐๐ฎ๐ซ๐ž๐ฌ can be a great help with inspecting and managing Hudi tables. Back in 2022, Hudi Spark integration added support for CALL procedures. You can perform a wide range of operations:. โœ… ๐…๐ž๐ญ๐œ๐ก ๐ญ๐š๐›๐ฅ๐ž ๐ข๐ง๐Ÿ๐จ: show table properties,
Tweet media one
1
0
4
@apachehudi
Apache Hudi
2 days
๐Ÿš€ Exciting news for the #ApacheHudi community!. Weโ€™ve enabled GitHub Discussions ๐ŸŽ‰.A dedicated space to ask questions, share ideas & collaborate. Join the conversation ๐Ÿ‘‰
Tweet card summary image
github.com
Explore the GitHub Discussions forum for apache hudi. Discuss code, ask questions & collaborate with the developer community.
0
0
2
@apachehudi
Apache Hudi
2 days
After each write commit to your Hudi table, you can configure a callback function to be invoked, sending commit metadata to Kafka, Pulsar, or a custom HTTP endpoint to trigger downstream processing tailored to your business needs!. ๐Ÿ‘‰Learn from the docs
hudi.apache.org
Apache Hudi provides the ability to post a callback notification about a write commit. This may be valuable if you need
0
0
1
@apachehudi
Apache Hudi
2 days
๐Ÿ’ก You can configure ๐๐จ๐ฌ๐ญ-๐œ๐จ๐ฆ๐ฆ๐ข๐ญ ๐œ๐š๐ฅ๐ฅ๐›๐š๐œ๐ค with @apachekafka , @apache_pulsar , and HTTP endpoints. ๐Ÿ‘‡ More in the reply
Tweet media one
1
0
3
@apachehudi
Apache Hudi
3 days
๐Ÿ‘‰ Follow instructions with more details in the release notes:
hudi.apache.org
Release 1.0.0 (docs)
0
0
0
@apachehudi
Apache Hudi
3 days
2๏ธโƒฃ Upgrade and resume table service runners with the 1.x jar.3๏ธโƒฃ Upgrade readers with the 1.x jar.4๏ธโƒฃ Now turn on ๐—ต๐—ผ๐—ผ๐—ฑ๐—ถ๐—ฒ.๐˜„๐—ฟ๐—ถ๐˜๐—ฒ.๐—ฎ๐˜‚๐˜๐—ผ.๐˜‚๐—ฝ๐—ด๐—ฟ๐—ฎ๐—ฑ๐—ฒ in writer config, and the migration is done! Time to profit.
1
0
0
@apachehudi
Apache Hudi
3 days
1๏ธโƒฃ Restart your writer jobs with Hudi 1.x jar. Don't worryโ€”it'll backward-compatible write to existing 0.x tables as long as you set ๐—ต๐—ผ๐—ผ๐—ฑ๐—ถ๐—ฒ.๐˜„๐—ฟ๐—ถ๐˜๐—ฒ.๐—ฎ๐˜‚๐˜๐—ผ.๐˜‚๐—ฝ๐—ด๐—ฟ๐—ฎ๐—ฑ๐—ฒ=false; if you're using metadata table or running async table services, disable them first.
1
0
0
@apachehudi
Apache Hudi
3 days
๐Ÿ’ก ๐˜๐จ๐ฎ ๐œ๐š๐ง ๐ž๐š๐ฌ๐ข๐ฅ๐ฒ ๐ฆ๐ข๐ ๐ซ๐š๐ญ๐ž ๐ญ๐จ ๐‡๐ฎ๐๐ข ๐Ÿ.๐ฑ ๐ญ๐จ ๐ ๐š๐ข๐ง ๐ฆ๐š๐ฃ๐จ๐ซ ๐ฉ๐ž๐ซ๐Ÿ๐จ๐ซ๐ฆ๐š๐ง๐œ๐ž ๐›๐จ๐จ๐ฌ๐ญย ๐Ÿš€. Steps ๐Ÿ‘‡
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
1
3
@apachehudi
Apache Hudi
23 days
By default, /๐—ฒ๐˜๐—ฐ/๐—ต๐˜‚๐—ฑ๐—ถ/๐—ฐ๐—ผ๐—ป๐—ณ/ will be the directory to place the file. Or you can set env var ๐—›๐—จ๐——๐—œ_๐—–๐—ข๐—ก๐—™_๐——๐—œ๐—ฅ to choose another directory. Happy lakehousing!.
0
0
0
@apachehudi
Apache Hudi
23 days
โœ… Perfect for configs like "hoodie.parquet.compression.codec" or "hoodie.datasource.write.hive_style_partitioning" for example, so you can reduce some repeating configs, and easily manage some standardized table setup across your data lakes.
1
0
0
@apachehudi
Apache Hudi
23 days
๐Ÿ’ก ๐—ง๐—ถ๐—ฝ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐—ฑ๐—ฎ๐˜†: Having some globally shared Hudi configs used by every job? .Set Hudi's global config directory and put those into ๐—ต๐˜‚๐—ฑ๐—ถ-๐—ฑ๐—ฒ๐—ณ๐—ฎ๐˜‚๐—น๐˜๐˜€.๐—ฐ๐—ผ๐—ป๐—ณ !. #apachehudi #lakehouse #dataengineering
2
0
1
@apachehudi
Apache Hudi
1 month
RT @Onehousehq: ๐ŸŽ‰ A new chapter "Running Hudi in Production" is now available in the early release of "Apache Hudiโ„ข: The Definitive Guide"โ€ฆ.
0
2
0
@apachehudi
Apache Hudi
1 month
Join this month's developer sync call with PMC member @_xushiyan on July 23, 5 PM PT to learn about the Rust implementation of @apachehudi with API bindings in Python and C++ !. โญ๏ธ GitHub repo: ๐Ÿ‘‰ Joining instructions:
Tweet media one
0
4
6
@apachehudi
Apache Hudi
1 month
RT @_xushiyan: ๐Ÿ’ก ๐—›๐—ผ๐˜„ ๐—ฑ๐—ผ๐—ฒ๐˜€ ๐—ฑ๐—ฎ๐˜๐—ฎ-๐˜€๐—ธ๐—ถ๐—ฝ๐—ฝ๐—ถ๐—ป๐—ด ๐˜„๐—ผ๐—ฟ๐—ธ ๐—ถ๐—ป @apachehudi ?. The metadata table (a multi-modal index system) located within your Hudi tabโ€ฆ.
0
3
0
@apachehudi
Apache Hudi
2 months
RT @_xushiyan: ๐Ÿš€ New Blog: Building a RAG-based AI Recommender (Part 1/2). ๐Ÿ‘‰ ๐Ÿ“š What's inside: .โœฆ How RAG works endโ€ฆ.
0
9
0
@apachehudi
Apache Hudi
2 months
Interested in contributing to the next release? Explore starter issues here:
0
0
0
@apachehudi
Apache Hudi
2 months
What's more, the 0.4.0 Hudi-rs artifacts are being added to @raydistributed and @daftengine to unlock additional query engine support such as incremental query, time-travel query, and reading MOR tables!.
1
0
1