Commit Graph

311 Commits

Author SHA1 Message Date
rasbt
3a632323df make code more general for larger models 2024-05-05 10:18:46 -05:00
Sebastian Raschka
2e9d5acb5e cosmetics 2024-05-05 08:15:46 -05:00
rasbt
e9bdbf0725 add text-to-token-id fn 2024-05-05 08:05:20 -05:00
Sebastian Raschka
d3201f5aad Add figures for ch06 (#141) 2024-05-05 07:10:04 -05:00
rasbt
b8324061d0 update link 2024-05-04 08:08:58 -05:00
rasbt
dddf87296e table-update 2024-05-04 07:58:18 -05:00
rasbt
d60dcc6724 add description 2024-05-04 07:34:29 -05:00
Sebastian Raschka
da61d5b76a Ch06 draft (#138)
* Ch06 first draft

* add utility files
2024-05-03 08:37:58 -05:00
rasbt
c735c21e87 fix swiglu acronym 2024-05-01 20:26:17 -05:00
rasbt
aec169dc12 link formatting 2024-04-30 06:26:23 -05:00
rasbt
d249960bdc Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-04-30 06:25:37 -05:00
Sebastian Raschka
82d6bd47a4 use training set len (#137) 2024-04-29 21:56:05 -05:00
rasbt
0ac19a1e50 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
97ed38116a Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
Sebastian Raschka
70cd174091 add roberta option (#135) 2024-04-28 13:57:36 -05:00
Sebastian Raschka
ca47c5e4b2 Formatting improvements (#134)
* formatting improvements

* .yml triggers
2024-04-28 12:05:32 -05:00
Sebastian Raschka
9a5d4d8ac9 Try windows runners (#133)
* try windows runners

* update triggers

* trigger with code file update

* add new status badges
2024-04-28 07:39:23 -05:00
Sebastian Raschka
e1d094b655 Update README.md 2024-04-27 07:59:42 -05:00
Sebastian Raschka
fc3d70f72f Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
Sebastian Raschka
4adb96d7ee Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection

* remove redundant buffer
2024-04-26 17:13:08 -05:00
Sebastian Raschka
59b4fd3e25 IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00
rasbt
258aff3e9a style checks 2024-04-24 07:48:51 -05:00
rasbt
46d09b30d9 add usage 2024-04-24 07:27:04 -05:00
rasbt
5ef438aa3b add more experiments 2024-04-24 07:23:11 -05:00
rasbt
642f819910 update requirements 2024-04-24 06:38:02 -05:00
rasbt
3b4484029d rename folder 2024-04-23 21:02:57 -05:00
rasbt
c7cdedf981 update figures in bonus notebook 2024-04-23 21:01:27 -05:00
Sebastian Raschka
16964a6486 Chapter 6 ablation studies (#127)
* Chapter 6 ablation studies

* add table

* formatting

* formatting

* formatting
2024-04-23 09:51:52 -05:00
Sebastian Raschka
0bd2608a6c update stride wording 2024-04-22 20:40:48 -05:00
rasbt
90d239b4f7 fix merge conflict 2024-04-22 07:05:40 -05:00
rasbt
72be9f4e8e update numbering 2024-04-22 07:00:20 -05:00
rasbt
868955f6a5 file header 2024-04-22 06:53:38 -05:00
Sebastian Raschka
44b3815960 remove requests dependency (#125) 2024-04-21 14:15:05 -05:00
rasbt
d202cabdee update figures 2024-04-20 11:42:03 -05:00
Sebastian Raschka
c70ddff558 Return nan if val loader is empty (#124) 2024-04-20 08:02:30 -05:00
Sebastian Raschka
7740d556a0 Use dim=-1 for consistency (#122) 2024-04-18 05:56:23 -05:00
Sebastian Raschka
e0ce5ca459 Calculate warmup steps as a fraction (#121) 2024-04-17 20:30:42 -05:00
Sebastian Raschka
8d53e8d8cd extend setup instructions (#120) 2024-04-15 21:05:03 -05:00
Sebastian Raschka
b59eacb01f Update README.md 2024-04-14 12:42:02 -05:00
rasbt
0729afa835 shorten badge names 2024-04-14 12:41:23 -05:00
Sebastian Raschka
a9c1b94d09 Check hyperlink badge 2024-04-14 12:38:55 -05:00
Sebastian Raschka
155ac03f61 use torch no grad for loss (#119) 2024-04-14 08:13:07 -05:00
Sebastian Raschka
a3a5574758 Update README.md 2024-04-13 15:04:08 -05:00
Sebastian Raschka
dd51d4ad83 Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Sebastian Raschka
9f3f231ac7 use correct lr 2024-04-12 19:55:07 -04:00
Sebastian Raschka
55ebabf95c Automated link checking (#117)
* Automated link checking

* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
Sebastian Raschka
33b27368a3 improve check-links.yml 2024-04-11 17:23:15 -04:00
Sebastian Raschka
5ca4384eb7 unit test indicator placement 2024-04-10 22:15:07 -04:00
Sebastian Raschka
ae3020bc12 setup instruction note 2024-04-10 22:13:22 -04:00
Sebastian Raschka
e757091301 Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00