rasbt
|
8fd29ed079
|
Gemma 3 270M from scratch
|
2025-08-16 19:49:38 -05:00 |
|
Sebastian Raschka
|
71ef67be46
|
Qwen3 Coder Flash & MoE from Scratch (#760)
* Qwen3 Coder Flash & MoE from Scratch
* update
* refinements
* updates
* update
* update
* update
|
2025-08-01 19:13:17 -05:00 |
|
Sebastian Raschka
|
db3659241d
|
Add link to official video course (#741)
|
2025-07-13 10:35:12 -05:00 |
|
Sebastian Raschka
|
90c824506c
|
Simplify KV cache usage (#728)
* Simplify KV cache usage
* Swap mark text with ghostwriter
|
2025-07-08 12:56:55 -05:00 |
|
Sebastian Raschka
|
ce24cd428a
|
Update README.md
|
2025-07-06 17:58:33 -05:00 |
|
Sebastian Raschka
|
ab57562066
|
Add prerequisite section (#723)
|
2025-07-06 12:45:42 -05:00 |
|
Sebastian Raschka
|
cd03d5008a
|
Add link to free exercise PDF (#706)
|
2025-06-24 08:24:02 -05:00 |
|
Sebastian Raschka
|
3d4bce6d57
|
Qwen3 From Scratch (#678)
* Qwen3 From Scratch
* rev other file
* upd
* upd
* upd
* url fixes
|
2025-06-19 18:44:38 -05:00 |
|
Sebastian Raschka
|
9aed6f5a76
|
Add KV cache (#671)
|
2025-06-15 09:58:08 -05:00 |
|
Sebastian Raschka
|
e55e3e88e1
|
Alt weight loading code via PyTorch (#585)
* Alt weight loading code via PyTorch
* commit additional files
|
2025-03-27 20:10:23 -05:00 |
|
Sebastian Raschka
|
fa5760a8de
|
GitHub markdown updates (#545)
* GitHub markdown updates
* Apply suggestions from code review
* Apply suggestions from code review
|
2025-02-23 12:25:44 -06:00 |
|
Sebastian Raschka
|
18dd8ab633
|
Badge url updates
|
2025-02-17 12:07:47 -06:00 |
|
rasbt
|
acb1018917
|
update badges
|
2025-02-17 12:02:06 -06:00 |
|
rasbt
|
47030fd8c1
|
update badges
|
2025-02-17 12:00:46 -06:00 |
|
rasbt
|
1d1ac0b9a2
|
update readme badges
|
2025-02-17 11:49:41 -06:00 |
|
Sebastian Raschka
|
4d5f37abd4
|
Update README.md
|
2025-02-16 13:37:32 -06:00 |
|
Sebastian Raschka
|
6370898ce6
|
PyTorch tips for better training performance (#525)
* PyTorch tips for better training performance
* formatting
* pep 8
|
2025-02-12 16:10:34 -06:00 |
|
Sebastian Raschka
|
dcaac28b92
|
Bonus material: extending tokenizers (#496)
* Bonus material: extending tokenizers
* small wording update
|
2025-01-22 09:26:54 -06:00 |
|
Sebastian Raschka
|
b17d097742
|
Implementingthe BPE Tokenizer from Scratch (#487)
|
2025-01-17 12:22:00 -06:00 |
|
Sebastian Raschka
|
6d1bb2cfb4
|
Add chapter names
|
2024-11-08 08:39:34 -06:00 |
|
Sebastian Raschka
|
62e44b2415
|
Update README.md
|
2024-10-29 20:20:48 -05:00 |
|
Sebastian Raschka
|
3d54af20f5
|
Memory efficient weight loading (#401)
* memory efficient weight loading
* remove unused code
|
2024-10-14 10:30:25 -05:00 |
|
Sebastian Raschka
|
58c3bb3d9d
|
Llama 3 (#384)
* Implement Llama 3.2
* Add Llama 3.2 files
* exclude IMDB link because stanford website seems down
|
2024-10-05 07:52:15 -05:00 |
|
Sebastian Raschka
|
c38b003aa9
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|
Sebastian Raschka
|
7a9a17608d
|
Add user interface to ch06 and ch07 (#366)
* Add user interface to ch06 and ch07
* pep8
* fix url
|
2024-09-21 20:33:00 -05:00 |
|
Sebastian Raschka
|
1bc560fb13
|
Add chatpgpt-like user interface (#360)
* Add chatpgpt-like user interface
* fixes
|
2024-09-17 08:26:44 -05:00 |
|
Sebastian Raschka
|
f6f9825098
|
reflection-tuning dataset generation (#349)
|
2024-09-10 21:42:12 -05:00 |
|
Daniel Kleine
|
9674e62166
|
nbviewer links / typo (#346)
* fixed typo
* removed remaining nbviewer links
* Update mha-implementations.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-09-07 07:27:28 +02:00 |
|
Sebastian Raschka
|
e8b29c886a
|
Revert nbviewer links
|
2024-09-05 08:09:33 +02:00 |
|
Sebastian Raschka
|
de9ec16570
|
use nbviewer links (#339)
|
2024-08-29 09:09:10 +02:00 |
|
Sebastian Raschka
|
12655594d5
|
Update README.md
|
2024-08-24 07:22:18 -05:00 |
|
Sebastian Raschka
|
de277ebfe6
|
Update README.md
|
2024-08-08 07:50:45 -05:00 |
|
Sebastian Raschka
|
4601219c32
|
Update README.md
|
2024-08-08 07:47:31 -05:00 |
|
Sebastian Raschka
|
b1a420c8ca
|
Update README.md
|
2024-08-06 08:02:01 -05:00 |
|
Sebastian Raschka
|
b39234fc25
|
Update README.md
|
2024-08-05 17:47:06 -05:00 |
|
Sebastian Raschka
|
e4ed2c91cc
|
Update README.md
|
2024-08-04 16:06:38 -05:00 |
|
Sebastian Raschka
|
09dc080cf3
|
Direct Preference Optimization from scratch (#294)
|
2024-08-04 08:57:36 -05:00 |
|
Sebastian Raschka
|
3ea0798d44
|
Update README.md
|
2024-08-01 18:17:42 -05:00 |
|
Sebastian Raschka
|
75a24c4897
|
Update README.md
|
2024-07-28 09:28:11 -05:00 |
|
Sebastian Raschka
|
011ee83ea7
|
Update README.md
|
2024-07-28 08:21:38 -05:00 |
|
Sebastian Raschka
|
1e873d4cbc
|
Understanding PyTorch Buffers (#288)
|
2024-07-26 08:45:36 -05:00 |
|
Sebastian Raschka
|
7356020b44
|
Update README.md
|
2024-07-21 10:42:28 -05:00 |
|
Sebastian Raschka
|
b80b14cf09
|
Update README.md
|
2024-07-21 10:41:53 -05:00 |
|
Sebastian Raschka
|
06ed31f347
|
Updated test badges
|
2024-07-18 20:26:51 -05:00 |
|
rasbt
|
2ab7a9aa9d
|
clarify libraries
|
2024-07-14 08:53:16 -05:00 |
|
Sebastian Raschka
|
3549c2b0d4
|
Add download instructions
|
2024-07-13 06:31:14 -05:00 |
|
Sebastian Raschka
|
28041d7e4c
|
Update README.md
|
2024-07-06 08:26:50 -05:00 |
|
Sebastian Raschka
|
3e47407646
|
Update README.md
|
2024-07-03 20:28:04 -05:00 |
|
Sebastian Raschka
|
17a590e411
|
Update README.md
|
2024-07-03 20:27:54 -05:00 |
|
Sebastian Raschka
|
fe23241ba0
|
Update README.md
|
2024-07-03 05:56:46 -05:00 |
|