Gemma 3 270M from scratch

This commit is contained in:
rasbt
2025-08-16 19:49:38 -05:00
parent 27fa95d24b
commit 8fd29ed079
8 changed files with 1391 additions and 0 deletions

2
.gitignore vendored
View File

@@ -77,6 +77,8 @@ ch07/01_main-chapter-code/gpt2-medium355M-sft-standalone.pth
ch07/01_main-chapter-code/Smalltestmodel-sft-standalone.pth
ch07/01_main-chapter-code/gpt2/
gemma-3-270m/
gemma-3-270m-it/
Qwen3-0.6B-Base/
Qwen3-0.6B/
tokenizer-base.json