Qwen3 and evaluation bonus materials (#869)

2026-04-10 12:33:42 +00:00 · 2025-10-08 18:22:19 -05:00
parent 7bd263144e
commit 1164cb3e8f
1 changed files with 11 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -194,6 +194,17 @@ Several folders contain optional materials as a bonus for interested readers:
  - [Direct Preference Optimization (DPO) for LLM Alignment](ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb)
  - [Building a User Interface to Interact With the Instruction Finetuned GPT Model](ch07/06_user_interface)

+More bonus material from the [reasoning from scratch](https://github.com/rasbt/reasoning-from-scratch) repository:
+
+- **Qwen3 (from scratch) basics**
+  - [Qwen3 source code walkthrough](https://github.com/rasbt/reasoning-from-scratch/blob/main/chC/01_main-chapter-code/chC_main.ipynb) 
+  - [Optimized Qwen3](https://github.com/rasbt/reasoning-from-scratch/tree/main/ch02/03_optimized-LLM)
+- **Evaluation**
+  - [Verifier-based evaluation (MATH-500)](https://github.com/rasbt/reasoning-from-scratch/tree/main/ch03)
+  - [Multiple-choice evaluation (MMLU)](https://github.com/rasbt/reasoning-from-scratch/blob/main/chF/02_mmlu)
+  - [LLM leaderboard evaluation](https://github.com/rasbt/reasoning-from-scratch/blob/main/chF/03_leaderboards)
+  - [LLM-as-a-judge evaluation](https://github.com/rasbt/reasoning-from-scratch/blob/main/chF/04_llm-judge)
+
 <br>
 &nbsp;