Javier Montero
@DataSlugger
Followers
260
Following
5K
Media
505
Statuses
4K
Sr. Data Engineer 🛠️| Baseball ❤️| ETL & Warehousing | Python Dev | @Medium Writer | Passionate ⚙ raw data ➡️ insights | Let's connect & explore data together!
Houston, TX
Joined June 2009
So excited about my ETL Pipeline on #Github Actions started working without errors. I'll share more details in a blog post in a few weeks about my #Python ETL code, #Datavisualization & #PostgreSQL cloud DB.
1
0
5
Your Dockerfile: - FROM ubuntu:latest - Installs 47 packages - Runs as root - 2.3GB image size - Rebuild takes 15 minutes - Security scan shows 89 vulnerabilities Your app: - A Python script - 3 dependencies - Could run on Alpine in a 50MB image You're not using containers.
141
251
5K
This week we wrapped up our @databricks series! We covered: 1. Importing Data from flat files and connecting to Data Sources 2. Analyzing data using SQL Editor and Notebooks 3. Visualizing and Creating Dashboards 4. Using AI to code and talk with our Databases You can learn all
11
75
447
People focus too much on their career and not enough on their personal lives. We weren't meant to work 9-5, go home to watch tv all night, and repeat for 40+years! We were made for connection and people. I want you to have a great career and be very successful, but that doesn't
8
31
195
You only need to read four books to truly get what’s going on in ML and data engineering: - Fundamentals of Data Engineering by Joe Reis - Designing Data Intensive Applications by Martin Kleppmann - AI engineering by Chip Huyen - Designing Machine Learning Systems by Chip
17
161
1K
If you want to get started with system design, learn these 12 concepts... 🧵
18
86
569
The best engineers I know delete more code than they write Junior engineers add features. Senior engineers remove complexity Every line of code you write is a liability. It needs to be maintained. It can break. It adds cognitive load to anyone who reads it later The best pull
44
37
403
My favourite stack for data projects (at the moment): Airflow - scheduling/orchestration of pipelines dbt - SQL-based transformations with testing + lineage Postgres - database Docker - containerisation Apache Superset - BI/visualisation. The best bit? All open source.
2
9
72
A common mistake I see: People treat message queues and streams as if they’re interchangeable. They’re not. Here’s the difference 👇 Message Queue (MQ) - Designed for delivery guarantees (at-least-once, exactly-once). - Great for sporadic workloads and asynchronous tasks. -
15
109
997
One of the often overlooked parts of being an EM is project management. You may not always have a dedicated project manager on your team, which means you have the ✨distinct pleasure✨ of filling that role. 🐞 New episode: Project Management as an EM https://t.co/wGZ4zaLojC
ladybug.dev
As an engineering manager, project management isn’t just a skill—it’s part of the job. In this episode, we unpack what effective project management looks like from the EM seat.
1
8
14
Format your SQL code in SQL Server Management Studio 22 with the free SQL Formatter from @madskristensen - I just got a PR approved to enable this (for SSMS) - download from VSIX Gallery #sqlserver #ssms
https://t.co/YG3u8DdIxP
vsixgallery.com
Adds support for the Format Document command for SQL files. Use Ctrl+K, Ctrl+D to execute the command to format the document. Configure the formatting behavior through options.
3
14
114
Did you know you can open a Fabric Warehouse in Visual Studio Code by simply clicking on the "Open in Visual Studio Code" button in the Warehouse Explorer? This is a game-changer for pro-developers! 🎉 #FabricWarehouse #MicrosoftFabric
1
1
10
It's pretty!
Announcing the release of SSMS 22 Preview 1 - New accessible themes - Zoom results grid - Improvements to connection dialog - and much more https://t.co/R30kVechBE
1
1
9
It's hard to believe when developers say they are firing many questions at LLMs to review and analyze their code and using the agent mode in their Visual Studio IDEs, but then claim in the same sentence that their code is free of AI assistance.
0
0
1
In my experience, Data Cleaning vs Data Transformation Initially, I viewed data cleaning and transformation as similar tasks since both prepare data. However, I soon discovered that each serves an unique purpose, enriching my understanding and appreciation for the intricacies
0
0
0
Today, I interviewed a candidate for a Junior Machine Learning Engineer position. I began with basic questions, but the candidate used buzzwords to explain concepts. When I probed deeper into those terms, she couldn’t provide clear explanations. I asked her to write code for
133
120
3K
I have just ported the SQL Formatter by @madskristensen to SSMS 21! Format your SQL code based on the SQL ScriptDom formatting engine (and yes, raise your voice to fix it's shortcommings) - download and install from here: https://t.co/HvIS4PDFyk
vsixgallery.com
Adds support for the Format Document command for SQL files. Use Ctrl+K, Ctrl+D to execute the command to format the document. Configure the formatting behavior through options.
6
19
112
📢 New in Visual Studio: Bring Your Own Model to Chat We’re expanding flexibility in Copilot for Visual Studio. You can now connect API keys from providers like OpenAI, Anthropic, and Google to run Chat on the model of your choice. This update gives you: ✅ Choice – Access the
7
39
193