A hands-on workshop on Large Language Models and machine learning for engineers.

Jupyter Notebook 50.3%
Python 47.6%
Shell 2.1%

Find a file

Eric Furst f7d2b48f5a README updates, textbook polynomial cell, self-contained notebook Same set of changes as che-computing-dev/LLMs: - 03/04/05 READMEs: uv add workflow, required model caching - 05-tool-use: add Setup section, requirements.txt - 06-neural-networks: textbook cubic polynomial comparison cell - 06-neural-networks: add nn_workshop_colab.ipynb (self-contained, inline data) - vocab.md: catch up with terms from 02-05 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>		2026-05-04 10:18:10 -04:00
01-nanogpt	Sync changes from che-computing	2026-04-10 09:50:42 -04:00
02-ollama	Sync changes from che-computing	2026-04-10 09:50:42 -04:00
03-rag	README updates, textbook polynomial cell, self-contained notebook	2026-05-04 10:18:10 -04:00
04-semantic-search	README updates, textbook polynomial cell, self-contained notebook	2026-05-04 10:18:10 -04:00
05-tool-use	README updates, textbook polynomial cell, self-contained notebook	2026-05-04 10:18:10 -04:00
06-neural-networks	README updates, textbook polynomial cell, self-contained notebook	2026-05-04 10:18:10 -04:00
.gitignore	Update module docs: fix arXiv URL, uv setup, nanoGPT clone path	2026-04-01 22:25:42 -04:00
LICENSE	Initial commit: LLM workshop materials	2026-03-28 07:11:01 -04:00
pyproject.toml	Add uv for dependency management and update workshop materials	2026-03-31 12:03:34 -04:00
README.md	Sync RAG and semantic-search updates from che-computing	2026-04-28 12:05:08 -04:00
uv.lock	Add uv for dependency management and update workshop materials	2026-03-31 12:03:34 -04:00
vocab.md	README updates, textbook polynomial cell, self-contained notebook	2026-05-04 10:18:10 -04:00

README.md

LLMs for Engineers

CHEG 667-013 — Chemical Engineering with Computers
Department of Chemical and Biomolecular Engineering, University of Delaware

A hands-on workshop on Large Language Models and machine learning for engineers. Learn how to train a GPT from scratch, run local models, build retrieval-augmented generation systems, connect LLMs to tools to build agentic systems, and explore the underlying machine learning methods by implementing a simple neural network.

Sections

#	Topic	Description
01	nanoGPT	Train a small transformer on Shakespeare. Explore model parameters, temperature, and text generation.
02	Local models with Ollama	Run pre-trained LLMs locally. Summarize documents, query arXiv, generate code, build custom models.
03	Retrieval-Augmented Generation	Build a RAG system: chunk documents, embed them, and query with an LLM grounded in your own data.
04	Advanced retrieval	Hybrid BM25 + vector search with cross-encoder re-ranking. Compares summarization versus raw retrieval.
05	Tool use and agentic systems	Connect an LLM to Python functions via tool calling. Build a thermo assistant. Understand how ChatGPT, Claude, and Copilot actually work.
06	Building a neural network	Implement a one-hidden-layer network from scratch in numpy, then in PyTorch. Fits `C_p(T)` data for N₂.

Prerequisites

A terminal (macOS/Linux, or WSL on Windows)
Python 3.10+
Basic comfort with the command line
Ollama (sections 02–05)

Getting started

Clone this repository and work through each section in order:

git clone https://lem.che.udel.edu/git/furst/llm-workshop.git
cd llm-workshop

Each section has its own README.md with a full walkthrough, exercises, and any code or data needed. See vocab.md for a glossary of key terms organized by section.

Python environment

Install uv (a fast Python package manager), then:

uv sync

This creates a .venv/ virtual environment and installs all dependencies from the lock file.

Note: On Apple Silicon Macs, PyTorch GPU acceleration (MPS) works out of the box. On NVIDIA GPU machines, the default uv sync install may be CPU-only and you need to reinstall with CUDA support. See PYTORCH.md for troubleshooting and device-specific instructions.

cd into the section directory before running scripts or notebooks, since they reference local data files:

cd 06-neural-networks
uv run python nn_torch.py

Or activate the environment and run directly:

source .venv/bin/activate
cd 06-neural-networks
python nn_torch.py

License

MIT

Author

Eric M. Furst, University of Delaware

README.md Unescape Escape