Posts
All the articles I've posted.
GenAI Interview Prep
Deep-dive series for ML engineers preparing for GenAI and LLM engineering roles.
GenZ to AI Enz
A series on AI engineering for CS students and early-career engineers.
-
Bias, Variance, and the Tradeoff Every Model Faces
Why models fail in two opposite ways — being too rigid or too sensitive — and how to find the sweet spot between them.
-
Dropout and Overfitting: Teaching a Network Not to Cheat
What overfitting is, why it happens, and how dropout stops a network from memorising the training data.
-
Transformer Architecture & Key Design Decisions
A deep dive into the transformer architecture, why decoder-only models won, and the key design decisions — RoPE, GQA, Flash Attention, MoE — that define every modern LLM.
-
Normalization: BatchNorm, LayerNorm, and Why Transformers Need a Different One
Why activations drift as they pass through deep networks, and how BatchNorm and LayerNorm fix it in different ways.