ML-Affairs Blog
Production notes on ML systems, streaming architectures, and the gap between research and reality
ML Engineering Needs A Taxonomy
Why ML Engineer, Data Scientist, Data Engineer, and Software Engineer are still used too ambiguously, why that damages hiring, ownership, and delivery quality, and why the industry needs clearer role definitions.
Coding Got Cheap. Verification Did Not.
Why AI coding tools increase code supply faster than teams can verify it, and why smaller PRs, merge queues, property-based tests, static analysis, and explicit guarantees matter more than hype.
Kafka Streams vs Flink Is The Wrong Question
A production-minded comparison of Kafka Streams and Flink that focuses on state, recovery, rescaling, and platform boundaries.
PyFlink In 2026: Better Than Its Reputation, Still Not Frictionless
Why PyFlink becomes attractive once Python training and Java prediction start drifting apart, and where the JVM/runtime boundary still costs you.
From Model Validation To Pipeline Validation
Why retrospective ML evaluation needs pipeline validation, not just model validation, and how future leakage can distort backtests.
Harmonizing Avro and Python: A Dance of Data Classes
Why Avro schema discipline matters in data engineering and how to generate Python data classes from Avro schemas.
Agile In Action: Bridging Data Science and Engineering
What Agile looked like to me in 2023 at Vortexa: helping data scientists and engineers learn together, communicate clearly, and ship ML systems that can survive production.
Dynamic(i/o) Why you should start your ML-Ops journey with wrapping your I/O
Why ML pipelines need I/O abstraction, and how wrapping file and storage concerns improves maintainability, testing, and delivery.
Complete Guide to Python Envs (MacOS)
A practical guide to Python environments on macOS, including system Python pitfalls and the setup patterns that actually work.
A BREXIT NLP Dataset!
How I built a BREXIT-related NLP dataset and why real-world labeling and collection choices matter in applied NLP.
Style Transfer in Heraklion
An intuitive explanation of neural style transfer, grounded in Heraklion imagery and the core mechanics behind the model.
Agile Data Science
Why agile matters in data science work, where experimentation meets delivery, and how teams can move from prototypes to value.
AWS ML Certification
A practical guide to the AWS Machine Learning Specialty exam, including difficulty, study scope, and how long preparation usually takes.
Just do it!
A short reflection on why public writing matters, why criticism is not the point, and why you should publish before you feel ready.
