fixes for code (#206)

* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
This commit is contained in:
Daniel Kleine
2024-06-12 03:59:48 +02:00
committed by GitHub
parent 1a65020d81
commit dcbdc1d2e5
12 changed files with 33 additions and 46 deletions

2
.gitignore vendored
View File

@@ -20,6 +20,7 @@ ch07/01_main-chapter-code/loss-plot.pdf
# Checkpoint files
appendix-A/01_main-chapter-code/model.pth
appendix-E/01_main-chapter-code/gpt2
ch05/01_main-chapter-code/gpt2/
@@ -33,6 +34,7 @@ ch06/02_bonus_additional-experiments/gpt2
ch06/03_bonus_imdb-classification/gpt2
ch07/01_main-chapter-code/gpt2-medium355M-sft.pth
ch07/01_main-chapter-code/gpt2/
# Datasets
appendix-E/01_main-chapter-code/sms_spam_collection.zip