Qwen3 and Llama3 equivalency teests with HF transformers (#768)

* Qwen3 and Llama3 equivalency teests with HF transformers * update
2026-04-10 12:33:42 +00:00 · 2025-08-14 18:36:07 -05:00
parent 2e3205f747
commit 07c3122b5c
6 changed files with 199 additions and 8 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -1,4 +1,3 @@
-
 # Configs and keys
 ch05/07_gpt_to_llama/config.json
 ch07/02_dataset-utilities/config.json
@@ -78,6 +77,11 @@ ch07/01_main-chapter-code/gpt2-medium355M-sft-standalone.pth
 ch07/01_main-chapter-code/Smalltestmodel-sft-standalone.pth
 ch07/01_main-chapter-code/gpt2/

+Qwen3-0.6B-Base/
+Qwen3-0.6B/
+tokenizer-base.json
+tokenizer.json
+
 # Datasets
 the-verdict.txt