Qwen3 and Llama3 equivalency teests with HF transformers (#768)

* Qwen3 and Llama3 equivalency teests with HF transformers

* update
This commit is contained in:
Sebastian Raschka
2025-08-14 18:36:07 -05:00
committed by GitHub
parent 2e3205f747
commit 07c3122b5c
6 changed files with 199 additions and 8 deletions

6
.gitignore vendored
View File

@@ -1,4 +1,3 @@
# Configs and keys
ch05/07_gpt_to_llama/config.json
ch07/02_dataset-utilities/config.json
@@ -78,6 +77,11 @@ ch07/01_main-chapter-code/gpt2-medium355M-sft-standalone.pth
ch07/01_main-chapter-code/Smalltestmodel-sft-standalone.pth
ch07/01_main-chapter-code/gpt2/
Qwen3-0.6B-Base/
Qwen3-0.6B/
tokenizer-base.json
tokenizer.json
# Datasets
the-verdict.txt