🦜 Towards Data Science - Medium
@towardsdatascience.com.source.rss----7f60cf5620c9---4@rss-parrot.net
I'm an automated parrot! I relay a website's RSS feed to the Fediverse. Every time a new post appears in the feed, I toot about it. Follow me to get all new posts in your Mastodon timeline!
Brought to you by the RSS Parrot.
---
Your home for data science. A Medium publication sharing concepts, ideas and codes. - Medium
Your feed and you don't want it here? Just
e-mail the birb.
Which Regularizer Should You Actually Use? Lessons from 134,400 Simulations
https://towardsdatascience.com/which-regularizer-should-you-actually-use-lessons-from-134400-simulations/
Published: May 2, 2026 15:00
A practitioner's decision framework for Ridge, Lasso, and ElasticNet based on three quantities you can compute before fitting a model
The post Which Regularizer Should You Actually Use? Lessons from 134,400 Simulations appeared first on Towards Data…
How a 2021 Quantization Algorithm Quietly Outperforms Its 2026 Successor
https://towardsdatascience.com/how-a-2021-quantization-algorithm-quietly-outperforms-its-2026-successor/
Published: May 2, 2026 13:00
One scale parameter determines accuracy in rotation-based vector quantization.
The post How a 2021 Quantization Algorithm Quietly Outperforms Its 2026 Successor appeared first on Towards Data Science.
How to Get Hired in the AI Era
https://towardsdatascience.com/how-to-get-hired-in-the-ai-era/
Published: May 1, 2026 16:30
What people actually look for when hiring juniors that stand out.
The post How to Get Hired in the AI Era appeared first on Towards Data Science.
Churn Without Fragmentation: How a Party-Label Bug Reversed My Headline Finding
https://towardsdatascience.com/fractured-local-britain-voter-volatility-in-english-councils-2018-2022/
Published: May 1, 2026 15:00
A data quality case study from English local elections on categorical normalisation, metric validation, and why raw labels should never define analytical groups.
The post Churn Without Fragmentation: How a Party-Label Bug Reversed My Headline Finding…
Ghost: A Database for Our Times?
https://towardsdatascience.com/ghost-a-database-for-our-times/
Published: May 1, 2026 13:30
The first database built for AIÂ Agents
The post Ghost: A Database for Our Times? appeared first on Towards Data Science.
Why Powerful Machine Learning Is Deceptively Easy
https://towardsdatascience.com/why-powerful-ml-is-deceptively-easy/
Published: May 1, 2026 12:00
Or why what appears powerful can be methodologically fragile
The post Why Powerful Machine Learning Is Deceptively Easy appeared first on Towards Data Science.
A Gentle Introduction to Stochastic Programming
https://towardsdatascience.com/a-gentle-introduction-to-stochastic-programming/
Published: April 30, 2026 16:30
How to make decisions when your spreadsheet is lying about the future
The post A Gentle Introduction to Stochastic Programming appeared first on Towards Data Science.
Proxy-Pointer RAG: Multimodal Answers Without Multimodal Embeddings
https://towardsdatascience.com/proxy-pointer-rag-multimodal-answers-without-multimodal-embeddings/
Published: April 30, 2026 15:00
Structure is all you need
The post Proxy-Pointer RAG: Multimodal Answers Without Multimodal Embeddings appeared first on Towards Data Science.
How to Study the Monotonicity and Stability of Variables in a Scoring Model using Python
https://towardsdatascience.com/how-to-study-the-monotonicity-and-stability-of-variables-in-a-scoring-model-using-python/
Published: April 30, 2026 13:30
How can you validate that your variables tell a consistent risk?
The post How to Study the Monotonicity and Stability of Variables in a Scoring Model using Python appeared first on Towards Data Science.
Why AI Engineers Are Moving Beyond LangChain to Native Agent Architectures
https://towardsdatascience.com/why-ai-engineers-are-moving-beyond-langchain-to-native-agent-architectures/
Published: April 30, 2026 12:00
Frameworks accelerated the first wave of LLM apps, but production demands a different architecture.
The post Why AI Engineers Are Moving Beyond LangChain to Native Agent Architectures appeared first on Towards Data Science.
4 YAML Files Instead of PySpark: How We Let Analysts Build Data Pipelines Without Engineers
https://towardsdatascience.com/4-yaml-files-instead-of-pyspark-how-we-let-analysts-build-data-pipelines-without-engineers/
Published: April 29, 2026 16:30
How we replaced Python pipelines with dlt, dbt, and Trino — and cut delivery time from weeks to one day.
The post 4 YAML Files Instead of PySpark: How We Let Analysts Build Data Pipelines Without Engineers appeared first on Towards Data Science.
Ensembles of Ensembles of Ensembles: A Guide to Stacking
https://towardsdatascience.com/ensembles-of-ensembles-of-ensembles/
Published: April 29, 2026 15:00
The best machine learning model is not one model
The post Ensembles of Ensembles of Ensembles: A Guide to Stacking appeared first on Towards Data Science.
Agentic AI: How to Save on Tokens
https://towardsdatascience.com/agentic-ai-how-to-save-on-tokens/
Published: April 29, 2026 13:30
Caching, lazy-loading, routing, compaction, and more
The post Agentic AI: How to Save on Tokens appeared first on Towards Data Science.
System Design Series: Apache Flink from 10,000 Feet, and Building a Flink-powered Recommendation Engine
https://towardsdatascience.com/system-design-series-apache-flink-from-10000-feet-and-building-a-flink-powered-recommendation-engine/
Published: April 29, 2026 12:00
A deep dive into how Apache Flink works, why it exists, and learning it while building a real-time recommendation engine
The post System Design Series: Apache Flink from 10,000 Feet, and Building a Flink-powered Recommendation Engine appeared first on…
Let the AI Do the Experimenting
https://towardsdatascience.com/let-the-ai-do-the-experimenting/
Published: April 28, 2026 16:30
Using autoresearch to optimise marketing campaigns under budget constraints
The post Let the AI Do the Experimenting appeared first on Towards Data Science.
Correlation Doesn’t Mean Causation! But What Does It Mean?
https://towardsdatascience.com/correlation-doesnt-mean-causation-but-what-does-it-mean/
Published: April 28, 2026 15:00
What does correlation tells us?
The post Correlation Doesn’t Mean Causation! But What Does It Mean? appeared first on Towards Data Science.
The Next Frontier of AI in Production Is Chaos Engineering
https://towardsdatascience.com/the-next-frontier-of-ai-in-production-is-chaos-engineering/
Published: April 28, 2026 13:30
Blast-radius control tells you how much to break. Intent tells you what breaking it will teach. Only one of these has mature tooling.
The post The Next Frontier of AI in Production Is Chaos Engineering appeared first on Towards Data Science.
PyTorch NaNs Are Silent Killers — So I Built a 3ms Hook to Catch Them at the Exact Layer
https://towardsdatascience.com/pytorch-nans-are-silent-killers-i-built-a-3ms-hook-to-catch-them-at-the-exact-layer/
Published: April 28, 2026 12:00
NaNs don’t crash your training — they quietly destroy it.
After losing hours to a silent failure in a ResNet training run, I built a lightweight detector that pinpoints the exact layer and batch where things break. Using forward hooks and gradient checks,…
A Career in Data Is Not Always a Straight Line, and That’s Okay
https://towardsdatascience.com/a-career-in-data-is-not-always-a-straight-line-and-thats-okay/
Published: April 27, 2026 16:30
Sabrine Bendimerad on why flexibility is a crucial data science skill, the risks of outsourcing human thinking to AI agents, and the changing terrain of career paths today.
The post A Career in Data Is Not Always a Straight Line, and That’s Okay appeared…
How Spreadsheets Quietly Cost Supply Chains Millions
https://towardsdatascience.com/how-spreadsheets-quietly-cost-supply-chains-millions/
Published: April 27, 2026 14:00
A simulation of how a single forecast change moves through five planning teams, and why most retailers lose money in the gap between Sales and Stores.
The post How Spreadsheets Quietly Cost Supply Chains Millions appeared first on Towards Data Science.
Comparing Explicit Measures to Calculation Groups in Tabular Models
https://towardsdatascience.com/comparing-explicit-measures-to-calculation-groups-in-tabular-models/
Published: April 27, 2026 12:00
With the advent of UDFs and their combination with calculation groups, I see a lot of discussion about not creating explicit measures but instead offering calculation groups to report creators.
The post Comparing Explicit Measures to Calculation Groups in…
Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning
https://towardsdatascience.com/bytes-speak-all-languages-cross-script-name-retrieval-via-contrastive-learning/
Published: April 26, 2026 15:00
Why learn 8 scripts when you can learn 256 bytes?
The post Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning appeared first on Towards Data Science.
I Reduced My Pandas Runtime by 95% — Here’s What I Was Doing Wrong
https://towardsdatascience.com/i-reduced-my-pandas-runtime-by-95-heres-what-i-was-doing-wrong/
Published: April 26, 2026 13:00
Most slow Pandas code "works", until it doesn't. Learn how to spot hidden bottlenecks, avoid costly row-wise operations, and know when Pandas is no longer enough.
The post I Reduced My Pandas Runtime by 95% — Here’s What I Was Doing Wrong appeared first on…