mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Qwen3 and Llama3 equivalency teests with HF transformers (#768)
* Qwen3 and Llama3 equivalency teests with HF transformers * update
This commit is contained in:
committed by
GitHub
parent
2e3205f747
commit
07c3122b5c
6
.gitignore
vendored
6
.gitignore
vendored
@@ -1,4 +1,3 @@
|
||||
|
||||
# Configs and keys
|
||||
ch05/07_gpt_to_llama/config.json
|
||||
ch07/02_dataset-utilities/config.json
|
||||
@@ -78,6 +77,11 @@ ch07/01_main-chapter-code/gpt2-medium355M-sft-standalone.pth
|
||||
ch07/01_main-chapter-code/Smalltestmodel-sft-standalone.pth
|
||||
ch07/01_main-chapter-code/gpt2/
|
||||
|
||||
Qwen3-0.6B-Base/
|
||||
Qwen3-0.6B/
|
||||
tokenizer-base.json
|
||||
tokenizer.json
|
||||
|
||||
# Datasets
|
||||
the-verdict.txt
|
||||
|
||||
|
||||
Reference in New Issue
Block a user