Commit Graph

38 Commits

Author SHA1 Message Date
Sebastian Raschka
25ea71e713 Alternative weight loading via .safetensors (#507) 2025-01-29 08:15:29 -06:00
Daniel Kleine
60acb94894 BPE: fixed typo (#492)
* fixed typo

* use rel path if exists

* mod gitignore and use existing vocab files

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-20 20:49:53 -06:00
Daniel Kleine
81eed9afe2 updated RoPE statement (#423)
* updated RoPE statement

* updated .gitignore

* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-10-30 08:00:08 -05:00
Daniel Kleine
d38083c401 Updated Llama 2 to 3 paths (#413)
* llama 2 and 3 path fixes

* updated llama 3, 3.1 and 3.2 paths

* updated .gitignore

* Typo fix

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-10-24 07:40:08 -05:00
Sebastian Raschka
8a448a4410 Llama 3 (#384)
* Implement Llama 3.2

* Add Llama 3.2 files

* exclude IMDB link because stanford website seems down
2024-10-05 07:52:15 -05:00
Sebastian Raschka
b993c2b25b Improve rope settings for llama3 (#380) 2024-10-03 08:29:54 -05:00
rasbt
6bc3de165c move access token to config.json 2024-09-23 08:56:16 -05:00
Sebastian Raschka
0467c8289b GPT to Llama (#368)
* GPT to Llama

* fix urls
2024-09-23 07:34:06 -05:00
Sebastian Raschka
76e9a9ec02 Add user interface to ch06 and ch07 (#366)
* Add user interface to ch06 and ch07

* pep8

* fix url
2024-09-21 20:33:00 -05:00
Daniel Kleine
eefe4bf12b Chainlit bonus material fixes (#361)
* fix cmd

* moved idx to device

* improved code with clone().detach()

* fixed path

* fix: added extra line for pep8

* updated .gitginore

* Update ch05/06_user_interface/app_orig.py

* Update ch05/06_user_interface/app_own.py

* Apply suggestions from code review

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-09-18 08:08:50 -07:00
Sebastian Raschka
ea9b4e83a4 Add chatpgpt-like user interface (#360)
* Add chatpgpt-like user interface

* fixes
2024-09-17 08:26:44 -05:00
Eric Thomson
da5236ee72 Adds .vscode folder to .gitignore (#314)
* added .vscode folder to .gitignore

* Update .gitignore

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-12 07:49:11 -05:00
Daniel Kleine
8318d1f002 minor DPO fixes (#298)
* fixed issues, updated .gitignore

* added closing paren

* fixed CEL spelling

* fixed more minor issues

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-05 08:40:46 -05:00
Daniel Kleine
3ac363d005 updated .gitignore for ch07/01 artefacts (#242)
* fixed markdown

* removed redundant imports

* updated .gitignore for ch07/01 artefacts
2024-06-22 18:12:01 -05:00
Sebastian Raschka
ec5baa1f33 Add CI tests for chapter 7 (#239) 2024-06-22 08:57:18 -05:00
Sebastian Raschka
b90c7ad2d6 Exercise solutions (#237) 2024-06-22 08:30:45 -05:00
Sebastian Raschka
6c0dc2362b Add standalone finetuning and evaluation scripts for chapter 7 (#234)
* add finetuning and eval scripts

* update link

* update links

* fix link
2024-06-21 05:23:24 -05:00
Daniel Kleine
dcbdc1d2e5 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
Daniel Kleine
da9f64215a ch07 fixes (#204)
* updated .gitginore for ch07

* fixed extract_response()
2024-06-10 17:31:13 -05:00
rasbt
42af52fef4 revert unnecessary changes 2024-05-27 07:37:06 -05:00
rasbt
dd7ba32b56 add comment 2024-05-27 07:18:07 -05:00
Daniel Kleine
e7914182c6 updated .gitignore 2024-05-19 16:07:20 +00:00
Daniel Kleine
fabdefe959 updated .gitignore with appendix artifacts 2024-05-15 06:30:24 +00:00
Daniel Kleine
88ee7793d4 updated .gitignore with 06/02 und /03 artifacts 2024-05-14 12:16:24 +00:00
rasbt
21172a6a7e add chapter 6 unit test 2024-05-12 18:51:28 -05:00
rasbt
2e47a6e61c update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
16e276f8df show downloads 2024-05-06 07:40:09 -05:00
rasbt
258dcad5ee ch06 csv 2024-05-06 07:16:30 -05:00
rasbt
83d5cea795 ch06 dataset 2024-05-06 06:55:56 -05:00
Sebastian Raschka
fc3d70f72f Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
Sebastian Raschka
dd51d4ad83 Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Daniel Kleine
44c0494406 Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore

* removed prettier extension, added github alerts markdown extension

* specified download instructions and fixed code markdown

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-05 06:53:01 -05:00
rasbt
ac2bdb02bd make figures for appendix d 2024-03-31 21:22:49 -05:00
rasbt
35c6e12730 ignore ch05 tmp files 2024-03-23 06:52:08 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00
Sebastian Raschka
48253c4f88 Ch05 (#75)
* add chapter 5 main code
2024-03-17 21:07:19 -05:00
rasbt
55aa84ac5c remove OS temp files 2023-12-09 17:17:47 -06:00
rasbt
d66b23588d first sync 2023-07-23 13:18:13 -05:00