DataguyPhill Profile Banner
Phillip Manywanda Profile
Phillip Manywanda

@DataguyPhill

Followers
2
Following
17
Media
137
Statuses
310

Joined April 2024
Don't wanna be here? Send us removal request.
@DataguyPhill
Phillip Manywanda
10 months
@TDataImmersed @DabereNnamani βœ… Cleaned messy data βœ… Uncovered job trends βœ… Created powerful visuals EDA & Visualization are πŸ”‘ for Data Science! Want to see everything? Check out my notebook: 🌐 https://t.co/AYkbQwLt05 Which visualization do you use most? Let’s discuss! πŸš€πŸ
anaconda.com
0
0
0
@DataguyPhill
Phillip Manywanda
10 months
@TDataImmersed @DabereNnamani πŸ”₯ Seaborn for Advanced Plots Heatmap: Correlation between key variables πŸ”₯ Box Plot: Job title vs company ratings 🎭 Pair Plot: Relationships between salary, rating & founding year Aesthetics + Insights = πŸ’‘
0
0
0
@DataguyPhill
Phillip Manywanda
10 months
@TDataImmersed @DabereNnamani πŸ“‰ Matplotlib for EDA Histogram: Salary distribution πŸ’° Bar Chart: Top locations for Data Science jobs πŸ—ΊοΈ Line Plot: Salary trends by company size 🏒 Visualizing data brings numbers to life! πŸ”₯
0
0
0
@DataguyPhill
Phillip Manywanda
10 months
@TDataImmersed @DabereNnamani πŸ“Š EDA = Knowing Your Data Summary stats for Rating, Salary, and Revenue Identified top job titles & their average ratings Analyzed salary trends by company size EDA helps spot patterns & anomalies fast! πŸš€
0
0
0
@DataguyPhill
Phillip Manywanda
10 months
@TDataImmersed @DabereNnamani 🧼 Data Cleaning is the foundation of good analysis! Handled missing values πŸ•΅οΈ Extracted & cleaned Salary Estimate πŸ’° Standardized Company Names & Locations πŸ“ Data cleaning = better insights! βœ…
0
0
0
@DataguyPhill
Phillip Manywanda
10 months
πŸš€ Week 6 was all about Exploratory Data Analysis (EDA) & Visualization! I cleaned, analyzed, and visualized an uncleaned dataset of Data Science jobs using Pandas, Matplotlib & Seaborn. Let's break it down! 🧡 @TDataImmersed #TDI @DabereNnamani
5
0
0
@DataguyPhill
Phillip Manywanda
11 months
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
Wrap-Up & Full Notebook βœ… Data cleaned βœ… New features created βœ… Data merged βœ… Insights uncovered This was real-world data prep at its finest! Check out my full notebook here: 🌐 h https://anaconda.cloud/share/notebooks/bab3f1ea-092c-4be5-ac0d-4b16fad8224e/overview
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
String Cleaning & Deck Extraction πŸ”‘ Text manipulation in Pandas I extracted the deck from the Cabin column to analyze survival rates by deck. πŸ“· Question ➑️ πŸ“· My Solution Text data isn’t always cleanβ€”Pandas makes it easy!
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
πŸ”„ Merge vs. Concatenate? merge() = Joins datasets on a key (like PassengerId) concat() = Stacks datasets (vertically or horizontally) πŸ“· Question ➑️ πŸ“· My Solution These techniques help when dealing with multiple data sources!
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
Creating New Features πŸ› οΈ Feature Engineering I added: βœ… FamilySize = (sibsp + parch + 1) βœ… FarePerPerson = Fare Γ· FamilySize πŸ“· Question ➑️ πŸ“· My Solution Why? These features give new insights into passengers’ social & economic backgrounds!
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
πŸ’° Outliers distort averages! I detected extreme fare prices using the IQR method and capped them instead of removing. πŸ“· Question ➑️ πŸ“· My Solution Capping ensures we keep all data while limiting extreme values! πŸ›³οΈ
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
πŸ‘€ Data transformation step! Instead of 1, 2, 3, I converted Pclass into "1st Class", "2nd Class", "3rd Class" for better readability. πŸ“· Question ➑️ πŸ“· My Solution Why? Clear labels improve data storytelling! πŸ“Š
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
πŸ” Duplicate records skew analysis! Using drop_duplicates(), I checked and removed any duplicates in Titanic data. πŸ“· Question ➑️ πŸ“· My Solution Have you ever encountered duplicate headaches? 🀯
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
You may not know what to do with missing values... πŸ€” Drop or Fill? dropna() – Remove missing data (good if there’s little missing) fillna() – Replace missing values (mean, median, etc.) I used the median for Age to avoid outliers! πŸ“·
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
Finding Missing Data πŸ” Identifying missing values in the Titanic dataset using Pandas: πŸ“· Question ➑️ πŸ“· My Solution Missing values can break analysisβ€”step 1 is always detection!
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
🧼 Why is data cleaning important? Missing values can bias analysis πŸ“‰ Duplicates distort insights πŸ”„ Outliers skew statistics πŸ“Š A clean dataset = better decisions! βœ…
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
πŸš€ Week 5 was all about Data Cleaning & Transformation with Pandas! From handling missing values to merging DataFrames, this was a deep dive into real-world data prep. Let’s break it down! πŸ§΅πŸ‘‡
11
0
0
@DataguyPhill
Phillip Manywanda
11 months
@DabereNnamani @TDataImmersed @JacobAjala That wraps up my Week 3 highlights! 🐍 Want to explore the complete code and dive into more details? Check it out here: 🌐 https://t.co/VRhcj8ulP5 What was your favorite part? Let’s discuss! ✨
anaconda.com
0
0
0
@DataguyPhill
Phillip Manywanda
11 months
@DabereNnamani @TDataImmersed @JacobAjala πŸ“Š NumPy Adventures NumPy made math magical! I: Built and manipulated 1D/2D arrays Found fare stats (min, max, mean) for Titanic data Explored indexing and random arrays 🎲✨ πŸ“· Questions ➑️ πŸ“· My Solutions How do YOU use NumPy? Let me know! 🐍
0
0
0