Qwen3 From Scratch (#678)

* Qwen3 From Scratch * rev other file * upd * upd * upd * url fixes
2026-04-10 12:33:42 +00:00 · 2025-06-19 18:44:38 -05:00
parent e700c66b7a
commit 3d4bce6d57
10 changed files with 2640 additions and 6 deletions
--- a/README.md
+++ b/README.md
@@ -121,6 +121,7 @@ Several folders contain optional materials as a bonus for interested readers:
  - [Building a User Interface to Interact With the Pretrained LLM](ch05/06_user_interface)
  - [Converting GPT to Llama](ch05/07_gpt_to_llama)
  - [Llama 3.2 From Scratch](ch05/07_gpt_to_llama/standalone-llama32.ipynb)
+  - [Qwen3 From Scratch](ch05/11_qwen3/standalone-qwen3.ipynb)
  - [Memory-efficient Model Weight Loading](ch05/08_memory_efficient_weight_loading/memory-efficient-state-dict.ipynb)
  - [Extending the Tiktoken BPE Tokenizer with New Tokens](ch05/09_extending-tokenizers/extend-tiktoken.ipynb)
  - [PyTorch Performance Tips for Faster LLM Training](ch05/10_llm-training-speed)