arXiv frequently returns 503 to default Python urllib clients. Set a
real User-Agent header and retry on 429/5xx with exponential backoff.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- 03-rag, 04-semantic-search: env-var-before-imports fix in build/query scripts
- 03-rag: new libraries section, fetch_arxiv.py, exercises for larger corpus
and finding current SOTA models, formal references (Lewis, Booth)
- 04-semantic-search: libraries pointer back to Part III, larger corpus
subsection, model-update exercise, formal references
- 06-neural-networks: add Nielsen reference (recommended by student)
- README: vocab.md link, agentic systems in description, Ollama prereq for 02-05
- New: vocab.md (glossary organized by section)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
The LLM arc completes at section 05 (agentic systems), with
neural networks as a standalone ML deep-dive in section 06.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Scripts use relative paths for data files, so they must be run
from their own directory. Also link to PYTORCH.md for GPU setup.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- Fix checkpoint directory name in 01-nanogpt
- Add generative text references (OUTPUT, Love Letters)
- Add PYTORCH.md troubleshooting (MPS, CUDA, WSL)
- Minor spacing fix in 02-ollama
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- Use HTTPS for arXiv API (was returning 301 on HTTP)
- Point module 01 preliminaries to root uv sync instead of separate venv
- Clone nanoGPT into 01-nanogpt/ and add to .gitignore
- Add llama3.1:8B to module 02 models table
- Various editorial updates to modules 01 and 02
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>