KDnuggets
@kdnuggets
Followers
220K
Following
876
Media
32K
Statuses
85K
Data Science • Machine Learning • AI • Analytics • Founded by Gregory Piatetsky-Shapiro • Edited by @mattmayo13 • KD stands for Knowledge Discovery
San Juan, PR
Joined February 2009
In 2025, analysts shifted from building monolithic dashboards to creating composable data products, turning repeated analyses into reusable building blocks. #statology
statology.org
In 2025, analysts shifted from building monolithic dashboards to creating composable data products, turning repeated analyses into reusable building blocks.
6
1
11
Learn Docker by doing with five beginner-friendly projects covering hosting, multi-container apps, CI, and monitoring.
kdnuggets.com
Learn Docker by doing with five beginner-friendly projects covering hosting, multi-container apps, CI, and monitoring.
0
1
0
Discover the data skills that truly mattered to hiring managers in 2025 beyond technical tools. #statology
statology.org
Discover the data skills that truly mattered to hiring managers in 2025 beyond technical tools.
6
0
9
In 2025, “using AI” no longer just means chatting with a model, and you’ve probably already noticed that shift yourself. #teachthemachine
machinelearningmastery.com
Explore the top 5 LLM models powering autonomous AI agents in 2025, from OpenAI o1 to open-source alternatives.
0
1
1
Best OCR and vision language models you can run locally that transform documents, tables, and diagrams into flawless markdown copies with benchmark-crushing accuracy.
kdnuggets.com
Best OCR and vision language models you can run locally that transform documents, tables, and diagrams into flawless markdown copies with benchmark-crushing accuracy.
0
2
7
This article is divided into three parts; they are: • Understanding the Architecture of Llama or GPT Model • Creating a Llama or GPT Model for Pretraining • Variations in the Architecture The architectur... #teachthemachine
machinelearningmastery.com
Natural language generation (NLG) is challenging because human language is complex and unpredictable. A naive approach of generating words randomly one by one would not be meaningful to humans....
6
0
6
How can we reason with uncertainty and make smarter decisions from data? This article explains the key probability ideas in data science.
kdnuggets.com
How can we reason with uncertainty and make smarter decisions from data? This article explains the key probability ideas in data science.
4
0
8
Learn how to use conditional formatting in Excel with practical examples from basic highlighting to advanced formula-driven rules. #statology
statology.org
Learn how to use conditional formatting in Excel with practical examples from basic highlighting to advanced formula-driven rules.
1
1
3
How AI Cuts Costs and Adds Value for Data Science Workflows (Sponsored) #teachthemachine
0
1
0
Looking ahead to 2026, the most impactful trends are not flashy frameworks but structural changes in how data pipelines are designed, owned, and operated.
kdnuggets.com
Looking ahead to 2026, the most impactful trends are not flashy frameworks but structural changes in how data pipelines are designed, owned, and operated.
1
0
1
Data leakage is an often accidental problem that may happen in machine learning modeling. #teachthemachine
machinelearningmastery.com
In this article, you will learn what data leakage is, how it silently inflates model performance, and practical patterns for preventing it across common workflows.
5
1
9
This article explains how Gistr transforms the way data professionals interact with their most valuable asset: their accumulated knowledge.
kdnuggets.com
This article explains how Gistr transforms the way data professionals interact with their most valuable asset: their accumulated knowledge.
4
0
6
This is a list of top LLM and VLMs that are fast, smart, and small enough to run locally on devices as small as a Raspberry Pi or even a smart fridge.
kdnuggets.com
This is a list of top LLM and VLMs that are fast, smart, and small enough to run locally on devices as small as a Raspberry Pi or even a smart fridge.
2
1
5
Discover which statistical methods dominate healthcare, finance, tech, retail, and sports analytics, and why these industry-method pairings became indispensable standards. #statology
statology.org
Discover which statistical methods dominate healthcare, finance, tech, retail, and sports analytics, and why these industry-method pairings became indispensable standards.
1
1
2
Machine learning models possess a fundamental limitation that often frustrates newcomers to natural language processing (NLP): they cannot read. #teachthemachine
machinelearningmastery.com
In this article, you will learn practical ways to convert raw text into numerical features that machine learning models can use, ranging from statistical counts to semantic and contextual embeddings.
0
0
3
"Maybe next quarter" is the most expensive phrase in data strategy. Silos inflate time-to-insight, waste top-tier talent, and kill models before they're built. Here’s how cloud + AI automation eliminate the silo tax: https://t.co/xqXo0PKTsY In #partnership with Ingram Micro
6
0
7
This article is divided into four parts; they are: • How Logits Become Probabilities • Temperature • Top- k Sampling • Top- p Sampling When you ask an LLM a question, it outputs a vector of logits. #teachthemachine
machinelearningmastery.com
Large Language Models (LLMs) can produce varied, creative, and sometimes surprising outputs even when given the same prompt. This randomness is not a bug but a core feature of how the model samples...
7
0
8
Your data scientists aren't slow. Your data architecture is. Up to 80% of their time is lost to cleaning, wrangling, and reconciling siloed data. That's not a delay cost. It's an innovation tax. Read more on the real cost of inaction: https://t.co/xqXo0PKTsY In #partnership with
3
0
3
From daily weather measurements or traffic sensor readings to stock prices, time series data are present nearly everywhere. #teachthemachine
machinelearningmastery.com
Training and comparing two robust deep learning architecture for a single, common time series analysis task: all step-by-step.
1
0
1
Spending too much time on repetitive tasks? These Python scripts will help you automate the mundane stuff that drains your productivity.
kdnuggets.com
Spending too much time on repetitive tasks? These Python scripts will help you automate the mundane stuff that drains your productivity.
0
1
1