Commit Graph

21 Commits

Author SHA1 Message Date
Daniel Kleine
79210eb393 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
Daniel Kleine
9a81230968 ch07 fixes (#204)
* updated .gitginore for ch07

* fixed extract_response()
2024-06-10 17:31:13 -05:00
rasbt
f86a929665 revert unnecessary changes 2024-05-27 07:37:06 -05:00
rasbt
b2ad4fb0d6 add comment 2024-05-27 07:18:07 -05:00
Daniel Kleine
7b397fcd46 updated .gitignore 2024-05-19 16:07:20 +00:00
Daniel Kleine
c78ceafe51 updated .gitignore with appendix artifacts 2024-05-15 06:30:24 +00:00
Daniel Kleine
d2fe7287a2 updated .gitignore with 06/02 und /03 artifacts 2024-05-14 12:16:24 +00:00
rasbt
37c33d6fee add chapter 6 unit test 2024-05-12 18:51:28 -05:00
rasbt
98c0723b3d update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
0448162fdc show downloads 2024-05-06 07:40:09 -05:00
rasbt
15d6f29cf8 ch06 csv 2024-05-06 07:16:30 -05:00
rasbt
c6528ede9e ch06 dataset 2024-05-06 06:55:56 -05:00
Sebastian Raschka
0f03c20483 Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
Sebastian Raschka
bae4b0fb08 Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Daniel Kleine
7d0b9b78b0 Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore

* removed prettier extension, added github alerts markdown extension

* specified download instructions and fixed code markdown

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-05 06:53:01 -05:00
rasbt
ac2bdb02bd make figures for appendix d 2024-03-31 21:22:49 -05:00
rasbt
35c6e12730 ignore ch05 tmp files 2024-03-23 06:52:08 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00
Sebastian Raschka
48253c4f88 Ch05 (#75)
* add chapter 5 main code
2024-03-17 21:07:19 -05:00
rasbt
55aa84ac5c remove OS temp files 2023-12-09 17:17:47 -06:00
rasbt
d66b23588d first sync 2023-07-23 13:18:13 -05:00