Commit graph

13 commits

Author SHA1 Message Date
Eric Furst
6a03a0b9d1 fetch_arxiv: retry on transient errors, set User-Agent
arXiv frequently returns 503 to default Python urllib clients. Set a
real User-Agent header and retry on 429/5xx with exponential backoff.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-05-01 10:25:40 -04:00
Eric Furst
59e5f86884 Sync RAG and semantic-search updates from che-computing
- 03-rag, 04-semantic-search: env-var-before-imports fix in build/query scripts
- 03-rag: new libraries section, fetch_arxiv.py, exercises for larger corpus
  and finding current SOTA models, formal references (Lewis, Booth)
- 04-semantic-search: libraries pointer back to Part III, larger corpus
  subsection, model-update exercise, formal references
- 06-neural-networks: add Nielsen reference (recommended by student)
- README: vocab.md link, agentic systems in description, Ollama prereq for 02-05
- New: vocab.md (glossary organized by section)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-28 12:05:08 -04:00
Eric Furst
b37661e983 Update description and Ollama prerequisite for tool-use section
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-11 10:58:31 -04:00
Eric Furst
cab2ebfd9d Reorder: tool use is now 05, neural networks is 06
The LLM arc completes at section 05 (agentic systems), with
neural networks as a standalone ML deep-dive in section 06.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-11 10:54:03 -04:00
Eric Furst
aee8ecd7b8 Add PyTorch note and cd-into-directory instructions
Scripts use relative paths for data files, so they must be run
from their own directory. Also link to PYTORCH.md for GPU setup.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-10 20:44:24 -04:00
Eric Furst
794cdaea0d Sync changes from che-computing
- Fix checkpoint directory name in 01-nanogpt
- Add generative text references (OUTPUT, Love Letters)
- Add PYTORCH.md troubleshooting (MPS, CUDA, WSL)
- Minor spacing fix in 02-ollama

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-10 09:50:42 -04:00
Eric
564e75b824 Minor edits 2026-04-07 07:46:42 -04:00
Eric
2902e34256 NN lecture updates
- add noisy data fit to README
- add noisy data notebook
- add noisy standalone python script
- References and edits to README
2026-04-06 15:54:41 -04:00
Eric
896570f71c Cleanup edits to module 01 and 05
walkthroughs.
2026-04-02 12:55:14 -04:00
Eric
e10e411e41 Update module docs: fix arXiv URL, uv setup, nanoGPT clone path
- Use HTTPS for arXiv API (was returning 301 on HTTP)
- Point module 01 preliminaries to root uv sync instead of separate venv
- Clone nanoGPT into 01-nanogpt/ and add to .gitignore
- Add llama3.1:8B to module 02 models table
- Various editorial updates to modules 01 and 02

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-01 22:25:42 -04:00
Eric
7e4f0fb80b Add uv for dependency management and update workshop materials 2026-03-31 12:03:34 -04:00
Eric
4c88157a8e Update venv directory 2026-03-28 20:59:25 -04:00
Eric
1604671d36 Initial commit: LLM workshop materials
Five modules covering nanoGPT, Ollama, RAG, semantic search, and neural networks.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-28 07:11:01 -04:00