Start Here: Learning Paths

If you are new to this site, the archive can feel large. This page groups the main books, articles, courses, and reference pages into a few learning paths so you can pick the route that matches what you want to understand next.

Build a Large Language Model From Scratch book cover

Build LLMs from scratch Architecture, tokenization, training, and PyTorch implementation.

Build a Reasoning Model From Scratch book cover

Reasoning models and advanced training Reinforcement learning, inference-time scaling, distillation, and evaluation.

LLM Architecture Gallery overview

Compare LLM architectures Attention variants, MoE, KV cache, and modern model families.

Machine Learning with PyTorch and Scikit-Learn book cover

Learn practical ML and PyTorch Applied machine learning, deep learning, and course material.

Learning Paths

Start with Build a Large Language Model (From Scratch) for the book and course links.
Use the LLMs-from-scratch code repository as the main implementation companion.
Read the BPE tokenizer article before diving into tokenization details.
Work through self-attention from scratch to build the core mental model.
Add the KV cache article once you start thinking about inference efficiency.

Read Understanding Reasoning LLMs for the high-level map of reasoning methods.
Read the inference-time scaling overview for test-time compute methods.
Read the RL for reasoning article for GRPO, reinforcement learning, and advanced post-training ideas.
Start with Build a Reasoning Model (From Scratch) for the book and repository links.
Use the reasoning-from-scratch repository as the main implementation companion.

Browse the LLM Architecture Gallery for a visual overview of model families.
Read The Big LLM Architecture Comparison for the main architecture narrative.
Read the visual guide to attention variants for MHA, GQA, MLA, sparse attention, and hybrids.
Read the workflow article to see how to inspect new open-weight releases.
Read the gallery announcement for context on how the reference collection is organized.

Use Machine Learning with PyTorch and Scikit-Learn for a full applied path.
Try PyTorch in One Hour for a compact refresher.
Study the university deep learning course for a longer lecture series and university material.
Use the deep learning resources page for older but still useful model notebooks and references.
Read Machine Learning Q and AI for concise explanations of common modern ML and AI concepts.

Shortcuts

Latest blog and notes archive All books LLM Architecture Gallery Courses More resources