Book Learning Path

01

Operators vs Architects

System thinking for machine learning

Draw two diagrams for any ML system you have used or built: the model view and the system view. Compare both and identify where you are still operating as an operator rather than an architect.

Open lab

02

Competitions vs Business

Moving from leaderboard thinking to product outcomes

Take a competition-style ML task and rewrite it as a business system. Define the user, decision, metric, failure mode, and production feedback loop.

Open lab

03

Feature Engineering

Turning raw data into useful model signal

Choose a dataset and design three feature groups: raw, transformed, and behavioural/contextual. Explain which feature could leak future information and how you would prevent it.

Open lab

04

Data Is Geometry

Understanding datasets through structure and distance

Run a PCA or UMAP-style analysis on a dataset and explain what the projection reveals, what it hides, and which clusters require additional investigation.

Open lab

05

Correlation, Causation, and Why Your Model Lies

Leakage, bias, and misleading patterns

Audit a model or dataset for leakage and misleading correlation. Find one feature or segment where the model could look good in training and fail in reality.

Open lab

06

The Interpretability Problem

Explaining model behaviour without pretending certainty

Prepare a model explanation for a technical reviewer and a non-technical stakeholder. Show what the explanation proves and what it does not prove.

Open lab

07

Evaluation as a Discipline

Evaluation beyond one headline metric

Build an evaluation scaffold with global metrics, segment metrics, and failure examples. Find where the model performs acceptably overall but poorly for a specific slice.

Open lab

08

Building Systems, Not Demos

Production readiness and operational thinking

Design a minimal production-ready ML pipeline: feature contracts, model registry, readiness gate, monitoring, and rollback path.

Open lab

09

The Modern ML Stack

Choosing tools by system responsibility

Assemble a minimal ML stack for one project. Explain what handles tracking, serving, monitoring, data quality, and experiment review.

Open lab

10

RAG Done Correctly

Retrieval as an evaluated system

Design a minimal RAG pipeline with chunking, embeddings, retrieval, and reranking. Define how you would measure Recall@5 and where the pipeline can fail.

Open lab

11

Agent Architectures: Hype vs Reality

Safe agent loops and control boundaries

Design a safe agent loop with hard limits, allow-list, logging, and fallback. Compare it with a simple prompt chain for the same task.

Open lab

12

Fine-Tuning vs Prompting

Cost, quality, and operational trade-offs

Compare a prompted baseline with a hypothetical fine-tuned variant. Define the evaluation protocol, expected quality gain, and ROI crossover point.

Open lab

13

Quantization and Efficient Inference

Latency, throughput, cost, and quality

Compare FP16, INT8, and INT4/AWQ as deployment choices. Explain the trade-off between speed, memory, and output quality for one use case.

Open lab

14

Building Your ML Machine

Local, cloud, and hybrid compute choices

Estimate your GPU usage pattern and decide between buying, renting, or hybrid compute. Write a break-even logic for your case.

Open lab

15

API Dependence vs Computational Sovereignty

Provider risk, routing, and auditability

Design a data classification policy and gateway routing rules for an AI product. Include audit logging and a fallback path for provider changes.

Open lab

16

The Math You Actually Need

Practical mathematical reasoning for ML engineers

Solve three practical reasoning tasks: tensor shapes in attention, A/B test power, and gradient debugging. Explain the engineering consequence of each result.

Open lab

17

The Data Science Layer

Data audit before model ambition

Run a data science audit: leakage, class imbalance, temporal split violations, missingness, and baseline sanity checks.

Open lab

18

Competitive Programming and ML Thinking

Algorithmic discipline inside ML systems

Translate an ML retrieval or ranking task into an algorithmic problem. Compare brute-force search with indexed search by latency and recall.

Open lab

19

How Big Tech Evaluates Engineers

Structured ML system design communication

Take one ML system design prompt and answer it through six steps: objective, data, model, evaluation, serving, monitoring.

Open lab

20

What You Build Next

Turning the book into a concrete project path

Choose one project path after the book. Define the first two weeks: objective, dataset, prototype, evaluation, and evidence you will collect.

Open lab

Machine Learning companion

Your next steps

Operators vs Architects

Competitions vs Business

Feature Engineering

Data Is Geometry

Correlation, Causation, and Why Your Model Lies

The Interpretability Problem

Evaluation as a Discipline

Building Systems, Not Demos

The Modern ML Stack

RAG Done Correctly

Agent Architectures: Hype vs Reality

Fine-Tuning vs Prompting

Quantization and Efficient Inference

Building Your ML Machine

API Dependence vs Computational Sovereignty

The Math You Actually Need

The Data Science Layer

Competitive Programming and ML Thinking

How Big Tech Evaluates Engineers

What You Build Next