DATA ENGINEERING
@dataengwithaws
Followers
3K
Following
4K
Media
20
Statuses
10K
upgrade your knowledge in DATA ENGINEERING with A W S
Joined August 2023
π· What we learned this week at @dataengwithaws π· Where to start as a beginner π· AWS tools for smarter pipelines π· Personal stories from the trenches π· How to build a modern lakehouse stack
0
0
3
"Fun Friday / Weekly Wrap" Light content, summarize week, build community feel. data eng tip before the weekend: take time to document your pipeline. yes even if itβs ugly. especially if itβs ugly. future you will thank you π·
4
0
6
(Summary thread): π· What we learned this week at @dataengwithaws π· Where to start as a beginner π· AWS tools for smarter pipelines π· Personal stories from the trenches π· How to build a modern lakehouse stack Follow for more next week π· and donβt forget to touch grass
1
0
1
"Fun Friday / Weekly Wrap" Light content, summarize week, build community feel. data eng tip before the weekend: take time to document your pipeline. yes even if itβs ugly. especially if itβs ugly. future you will thank you π·
0
0
1
Want a Notion board with step-by-step AWS projects? thinking of putting one together for free. Like this tweet if that sounds helpful π· might just drop it soon π·
0
1
2
"Deep Dive Thread Day" Showcase expertise, provide value, attract retweets. π· Building a Data Lake on AWS (2025 Edition) Tired of theory? Hereβs a real-world stack that works π· Ingest β Kafka / Kinesis Buffer β S3 (raw zone) Process β Glue / Lambda / EMR Store β S3
0
1
1
(Engagement tweet): Data engineers, whatβs one AWS service you wish you learned earlier? Iβll start: π· Step Functions. Saved me from messy spaghetti code flows. reply and share yours π·
0
0
3
"Real Talk Wednesday" Show vulnerability, personal story, engagement driver. I failed my first data pipeline project. like BAD. no retries broke on nulls cost $700 overnight but I learned more in those 3 days than any course ever taught me
0
0
2
One underrated AWS combo for beginners: π· CloudWatch + SNS Monitor logs or ETL job failures Automatically send alerts to email/Slack It's like having a robot yell at you when stuff breaks π· and honestly.. we all need that
0
0
2
"Tips & Tools Tuesday" Share actionable tips and resources for current & aspiring data engineers. If youβre still writing batch jobs for everything, try this instead: π· Lambda + EventBridge for real-time triggers π· Glue streaming jobs for incremental ETL π· Kinesis for
0
0
0
(Visual + Short Thread): π· 5 AWS services every beginner data engineer should mess with: (a tiny thread) S3 β your data lake starts here Glue β managed ETL + crawlers Lambda β serverless magic Athena β SQL on files π· CloudWatch β logs or it didnβt happen Just play with them.
0
0
3
"Foundations & Vibes" Attract beginners, build emotional connection, start light. data engineering seems complex when you're just starting out. pipelines, spark, cloud, kafka.. itβs a lot but here's the truth β you don't need to learn everything at once. start small. pick a tool.
0
0
0
π¬ Did you get both answers right? Let me know in the replies! π§΅π #AWS #Serverless #EventDriven (5/5) Let me know if you need any tweaks before posting! π
0
0
0
π― Key Takeaway: Lambda + SQS = Event-driven powerhouse πͺ πΉ Decouples producers & consumers πΉ Handles spikes in workload πΉ Scales automatically This duo is the backbone of many serverless architectures! (4/5)
1
0
1
π Why not the others? β SNS β Good for pub/sub but not core to event-driven workflows. β DynamoDB β Useful for data storage, but not essential for event processing. β EC2 β Provides compute, but not serverless like Lambda. β EBS β Block storage, not related to event-driven
1
0
0
π‘ Correct Answers: A) Lambda & E) SQS Why? π€ β
Lambda β Runs code in response to events, no servers to manage. β
SQS β Message queue that helps decouple producers & consumers. Together, they enable scalable & fault-tolerant event-driven architectures. (2/5)
1
0
0
β‘ Question: Which TWO AWS services are essential for building an event-driven architecture? A) Lambda B) SNS C) DynamoDB D) EC2 E) SQS F) EBS Drop your two answers below! π (1/5)
1
0
0
π¬ Did you get it right? Drop a β
in the comments if you did! If not, let me know what confused you. Let's learn together! π #AWS #Cloud #DataEngineering (4/4)
1
0
0
π Why not the others? β DynamoDB β NoSQL key-value & document DB, but not optimized for graph queries. β RDS β Relational database, not ideal for graph-based relationships. β EC2 β Just provides compute capacity, not a DB service. Neptune specializes in graph relationships!
1
0
0
π‘ Answer: B) Amazon Neptune Amazon Neptune is a fully managed graph database service designed for applications that need to analyze highly connected data. Use cases: πΉ Social networking πΉ Recommendation engines πΉ Fraud detection πΉ Knowledge graphs (2/4)
1
0
0