Tag: llm
All the articles with the tag "llm".
-
Transformer Architecture & Key Design Decisions
A deep dive into the transformer architecture, why decoder-only models won, and the key design decisions — RoPE, GQA, Flash Attention, MoE — that define every modern LLM.
-
GenZ to AI Enz: Series Index
Full table of contents for the GenZ to AI Enz series - every post and walkthrough in order.
-
GenZ to AI Enz: A Roadmap for CS Grads Breaking into AI
A complete series taking CS students and early-career engineers from zero ML knowledge to building real AI systems with LLMs and agents.
-
Fine-tuning Phi-2 with DPO on the Anthropic HH Dataset
Fine-tuning Microsoft's Phi-2 using Direct Preference Optimization (DPO) on the Anthropic Helpful and Harmless dataset with LoRA and 8-bit quantization.