GenAI Interview Prep
A deep-dive series for experienced ML engineers preparing for GenAI and LLM engineering roles — from transformer internals to production inference, RAG, agents, and system design.
-
Transformer Architecture & Key Design Decisions
A deep dive into the transformer architecture, why decoder-only models won, and the key design decisions — RoPE, GQA, Flash Attention, MoE — that define every modern LLM.