Sebastian Raschka
|
25ea71e713
|
Alternative weight loading via .safetensors (#507)
|
2025-01-29 08:15:29 -06:00 |
|
Daniel Kleine
|
60acb94894
|
BPE: fixed typo (#492)
* fixed typo
* use rel path if exists
* mod gitignore and use existing vocab files
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2025-01-20 20:49:53 -06:00 |
|
Daniel Kleine
|
81eed9afe2
|
updated RoPE statement (#423)
* updated RoPE statement
* updated .gitignore
* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-30 08:00:08 -05:00 |
|
Daniel Kleine
|
d38083c401
|
Updated Llama 2 to 3 paths (#413)
* llama 2 and 3 path fixes
* updated llama 3, 3.1 and 3.2 paths
* updated .gitignore
* Typo fix
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-24 07:40:08 -05:00 |
|
Sebastian Raschka
|
8a448a4410
|
Llama 3 (#384)
* Implement Llama 3.2
* Add Llama 3.2 files
* exclude IMDB link because stanford website seems down
|
2024-10-05 07:52:15 -05:00 |
|
Sebastian Raschka
|
b993c2b25b
|
Improve rope settings for llama3 (#380)
|
2024-10-03 08:29:54 -05:00 |
|
rasbt
|
6bc3de165c
|
move access token to config.json
|
2024-09-23 08:56:16 -05:00 |
|
Sebastian Raschka
|
0467c8289b
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|
Sebastian Raschka
|
76e9a9ec02
|
Add user interface to ch06 and ch07 (#366)
* Add user interface to ch06 and ch07
* pep8
* fix url
|
2024-09-21 20:33:00 -05:00 |
|
Daniel Kleine
|
eefe4bf12b
|
Chainlit bonus material fixes (#361)
* fix cmd
* moved idx to device
* improved code with clone().detach()
* fixed path
* fix: added extra line for pep8
* updated .gitginore
* Update ch05/06_user_interface/app_orig.py
* Update ch05/06_user_interface/app_own.py
* Apply suggestions from code review
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-09-18 08:08:50 -07:00 |
|
Sebastian Raschka
|
ea9b4e83a4
|
Add chatpgpt-like user interface (#360)
* Add chatpgpt-like user interface
* fixes
|
2024-09-17 08:26:44 -05:00 |
|
Eric Thomson
|
da5236ee72
|
Adds .vscode folder to .gitignore (#314)
* added .vscode folder to .gitignore
* Update .gitignore
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-08-12 07:49:11 -05:00 |
|
Daniel Kleine
|
8318d1f002
|
minor DPO fixes (#298)
* fixed issues, updated .gitignore
* added closing paren
* fixed CEL spelling
* fixed more minor issues
* Update ch07/01_main-chapter-code/ch07.ipynb
* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb
* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb
* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-08-05 08:40:46 -05:00 |
|
Daniel Kleine
|
3ac363d005
|
updated .gitignore for ch07/01 artefacts (#242)
* fixed markdown
* removed redundant imports
* updated .gitignore for ch07/01 artefacts
|
2024-06-22 18:12:01 -05:00 |
|
Sebastian Raschka
|
ec5baa1f33
|
Add CI tests for chapter 7 (#239)
|
2024-06-22 08:57:18 -05:00 |
|
Sebastian Raschka
|
b90c7ad2d6
|
Exercise solutions (#237)
|
2024-06-22 08:30:45 -05:00 |
|
Sebastian Raschka
|
6c0dc2362b
|
Add standalone finetuning and evaluation scripts for chapter 7 (#234)
* add finetuning and eval scripts
* update link
* update links
* fix link
|
2024-06-21 05:23:24 -05:00 |
|
Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
Daniel Kleine
|
da9f64215a
|
ch07 fixes (#204)
* updated .gitginore for ch07
* fixed extract_response()
|
2024-06-10 17:31:13 -05:00 |
|
rasbt
|
42af52fef4
|
revert unnecessary changes
|
2024-05-27 07:37:06 -05:00 |
|
rasbt
|
dd7ba32b56
|
add comment
|
2024-05-27 07:18:07 -05:00 |
|
Daniel Kleine
|
e7914182c6
|
updated .gitignore
|
2024-05-19 16:07:20 +00:00 |
|
Daniel Kleine
|
fabdefe959
|
updated .gitignore with appendix artifacts
|
2024-05-15 06:30:24 +00:00 |
|
Daniel Kleine
|
88ee7793d4
|
updated .gitignore with 06/02 und /03 artifacts
|
2024-05-14 12:16:24 +00:00 |
|
rasbt
|
21172a6a7e
|
add chapter 6 unit test
|
2024-05-12 18:51:28 -05:00 |
|
rasbt
|
2e47a6e61c
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
16e276f8df
|
show downloads
|
2024-05-06 07:40:09 -05:00 |
|
rasbt
|
258dcad5ee
|
ch06 csv
|
2024-05-06 07:16:30 -05:00 |
|
rasbt
|
83d5cea795
|
ch06 dataset
|
2024-05-06 06:55:56 -05:00 |
|
Sebastian Raschka
|
fc3d70f72f
|
Data loader intuition with numbers (#132)
* data loader intuition with numbers
* fix link
* fix tests
|
2024-04-27 07:56:41 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
Daniel Kleine
|
44c0494406
|
Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore
* removed prettier extension, added github alerts markdown extension
* specified download instructions and fixed code markdown
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-05 06:53:01 -05:00 |
|
rasbt
|
ac2bdb02bd
|
make figures for appendix d
|
2024-03-31 21:22:49 -05:00 |
|
rasbt
|
35c6e12730
|
ignore ch05 tmp files
|
2024-03-23 06:52:08 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|
Sebastian Raschka
|
48253c4f88
|
Ch05 (#75)
* add chapter 5 main code
|
2024-03-17 21:07:19 -05:00 |
|
rasbt
|
55aa84ac5c
|
remove OS temp files
|
2023-12-09 17:17:47 -06:00 |
|
rasbt
|
d66b23588d
|
first sync
|
2023-07-23 13:18:13 -05:00 |
|