
Chengzhi Zhao
@ChengzhiZhao
Followers
105
Following
118
Media
11
Statuses
503
Data Engineer | Data Content Writer | AI | Contributor of Airflow, Flink | Personal Blog https://t.co/4D2l15P0FX | DIYer
Joined July 2011
π Ever wondered how Apache Flink handles late data? Check out my beginner's guide to Watermarks! Learn how Flink ensures accurate event-time processing. π‘π Read more here: #ApacheFlink #DataStreaming #BigData #TechGuide.
chengzhizhao.com
Struggling with late or out-of-order data? Learn how Apache Flink Watermarks work with event time to build accurate, reliable real-time stream processing systems.
0
0
0
π Uncover the truth about #ApacheSpark performance! π Coalesce(1) vs. Repartition(1) β which one wins? π Dive into the details and optimize your Spark jobs! #BigData #DataEngineering #TechBlog. Read more:
chengzhizhao.com
We will discuss a neglected part of Apache Spark Performance between coalesce(1) and repartition(1), and it could be one of the things to be attentive to when you check the Spark job performance.
0
0
0
π Struggling with Airflow schedules? Learn how to master `schedule_interval` in Apache Airflow with this easy guide! β° From basics to advanced tips, weβve got you covered. Check it out now! π #Airflow #DataEngineering #ETL #TechTips.
chengzhizhao.com
The airflow schedule interval could be a challenging concept to comprehend, even for developers work on Airflow for a while find difficult to grasp. A confusing question arises every once a while on...
0
0
0
π Discover how to use R for data analysis to find the perfect Cocomelon video for your kids! ππΆ Learn data-driven parenting with this fun tutorial. Check it out here: #DataScience #Parenting #Cocomelon #RStats.
chengzhizhao.com
I will share my journey on using R for Data Analysis: building an end-to-end solution for exploring trending Cocomelon videos using R from scratch.
0
0
0
π 5 game-changing tips for #DataProfessionals to master self-promotion! From building your brand to leveraging LinkedIn, this guide has it all. π‘ Check it out now: #CareerGrowth #DataScience #PersonalBrand.
chengzhizhao.com
Getting the work done isn't the journey's end. Your work should be your channel to get YOU self-promotion. I will give five tips to get self-promotion as data professionals
0
0
0
π Exciting read! "Data Engineering in 2025: A Practical Guide for New Grads" is here! ππ‘ Learn how to thrive in the AI-first era with key skills, tools, and trends. Perfect for fresh grads! π₯ #DataEngineering #AI #CareerGrowth #TechTrends. Read more:
chengzhizhao.com
Explore how AI in data engineering is shaping the future. This 2025 guide helps new grads build the skills, tools, and mindset to thrive in a cloud-driven, AI-first world.
0
0
0
π Learn how to visualize your monthly expenses in a comprehensive way by creating a Sankey diagram in R! Perfect for tracking spending habits. Check it out here: #DataViz #RStats #PersonalFinance #SankeyDiagram.
chengzhizhao.com
Personal budgeting APP like Mint/Personal Capital/Clarity only provide three limited types of charts. Have you ever wondered if charts are good enough to get better ideas on your monthly income and...
0
0
0
π Built a tool to visualize expenses in a Sankey diagram! πΈπ Check out how I did it and track your spending like a pro. #PersonalFinance #DataViz #TechBlog π
chengzhizhao.com
My main goal is to enable people without programming experience to use the powerful Sankey Diagram by simply uploading the transaction CVS file from the popular site Mint.com.
0
0
0
π Struggling with slow Spark jobs? Learn how to tackle data skew in Apache Spark like a pro! π Discover practical tips & tricks to optimize performance. #BigData #ApacheSpark #DataEngineering #Optimization. Read more here:
chengzhizhao.com
"Why my Spark job is running slow?" is an inevitable question. We will cover how to identify Spark data skew and how to handle data skew with different options, including key salting
0
0
0
π Learn how to create stunning data animations in R with this step-by-step guide! πβ¨ Perfect for visualizing trends & patterns. Check it out here: #DataScience #RStats #DataViz #Animation #Tutorial.
chengzhizhao.com
Have you seen any beautiful racing bar chart data animation on Youtube and wondered how it was built? I will show you how to use gganimate in R to animate data by creating a racing bar chart as an...
0
0
0
π Dive into the ultimate guide for #DataEngineering! From tools to best practices, this resource covers it all. Perfect for beginners & pros alike. Check it out now! π #Tech #BigData #DataScience.
chengzhizhao.com
The data engineering space is evolving. Here are the resources I collected for practical data engineering resource.
0
0
0
π Learn how to snag the best deals in real-time using R & Mage! ππ Check out this guide to automate deal hunting & save big. #DataScience #DealHunting #Automation #RStats.π
chengzhizhao.com
How to find the best deals and coupons promptly can save you money and time. We can quickly build a weekend project that automatically finds the best deals on time with R and Mage
0
0
0
π Exciting read! "Data Engineering in 2025: A Practical Guide for New Grads" is your roadmap to thriving in the AI-first era. Learn key skills, tools, and trends to stay ahead! ππ§ #DataEngineering #AI #CareerGrowth #NewGrads Read more:
chengzhizhao.com
Explore how AI in data engineering is shaping the future. This 2025 guide helps new grads build the skills, tools, and mindset to thrive in a cloud-driven, AI-first world.
0
0
0
π Beyond basic prompts! Discover how LLM MCP tackles real-world challenges with the Airflow 3.0 auto-update example. Learn advanced techniques & practical insights! π₯ #AI #MachineLearning #Airflow #TechInnovation. Read more:
chengzhizhao.com
Learn how LLM + MCP synergy revolutionizes complex tasks. An Apache Airflow 3.0 case study demonstrates auto-updating DAGs and overcoming AI limitations.
0
0
0
π Boost your #ApacheSpark performance! Learn how to optimize the UNION operator for faster query speeds. Check out these pro tips now! π #BigData #DataEngineering #PerformanceOptimization.
chengzhizhao.com
We will focus on the Apache Spark Union Operator Performance with examples, show you the physical query plan, and share techniques for optimization in this story.
0
0
0
π The AI Wake-Up Call for Data Engineers! Discover why LLMs & MCP are game-changers in data engineering. Don't get left behindβread now! π #AI #DataEngineering #LLMs #MachineLearning #TechTrends.
chengzhizhao.com
AI isn't coming for data engineering β it's becoming part of it. In this post, I explore how Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Model Context Protocol (MCP) are...
0
0
0
Learn how to create stunning data animations in R with this step-by-step guide! πβ¨ Perfect for visualizing trends & patterns. Check it out here: #DataScience #RStats #DataViz #Tutorial.
chengzhizhao.com
Have you seen any beautiful racing bar chart data animation on Youtube and wondered how it was built? I will show you how to use gganimate in R to animate data by creating a racing bar chart as an...
0
0
0
π Boost your #ApacheSpark performance with PySpark examples & discover new 4.0 features! Learn tuning tips & tricks in this ultimate guide. π₯ Check it out: #BigData #DataScience #PySpark #PerformanceTuning.
chengzhizhao.com
The ultimate guide to Apache Spark. Learn performance tuning with PySpark examples, fix common issues like data skew, and explore new Spark 4.0 features.
0
0
0
π₯ Data Engineering is heating up in June 2025! Check out the latest trends & innovations shaping the future. Donβt miss out! #DataEngineering #TechTrends #BigData #Innovation. Read more:
chengzhizhao.com
Stay current with the essential data engineering news from June 2025. This monthly roundup covers the biggest announcements from Databricks' Data + AI Summit, new Snowflake features, Apache Flink...
0
0
0
π 6 Side Project Ideas for Data Engineers! Whether you're new or experienced, these projects will sharpen your skills & boost your portfolio. Check it out here: #DataEngineering #SideProjects #CareerGrowth #TechSkills.
chengzhizhao.com
Data engineers can work on some side projects to get experience. Those projects could initiate impressive discussions to help you land a dream job. We will introduce 6 data engineering side project...
0
0
0