mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Commit Graph
Select branches
Hide Pull Requests
cleaning
debug-exclude-newer
dockerfile
fix-qwen3-tokeniizer-eos
fix_masking_kv_cache
gemma-3
head-dim
main
qwen-tokenizer-fix
requirements-update
uv-improvements
weezymatt-bugfix/semantic-error
#1
#10
#100
#1000
#1003
#1004
#1004
#101
#102
#104
#105
#106
#107
#109
#110
#111
#113
#115
#116
#117
#118
#119
#12
#120
#121
#122
#124
#125
#127
#128
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#14
#141
#142
#143
#144
#149
#151
#152
#153
#154
#156
#157
#158
#159
#160
#161
#162
#163
#164
#165
#166
#169
#17
#170
#171
#173
#174
#176
#177
#178
#179
#18
#180
#182
#183
#184
#187
#189
#19
#193
#195
#196
#197
#198
#199
#2
#20
#200
#201
#203
#204
#205
#206
#207
#208
#211
#212
#213
#216
#218
#219
#22
#222
#223
#224
#227
#228
#229
#233
#234
#235
#236
#237
#239
#24
#240
#241
#242
#243
#244
#245
#246
#247
#248
#250
#252
#253
#254
#258
#259
#26
#260
#261
#263
#264
#265
#269
#27
#270
#274
#278
#28
#281
#285
#288
#289
#29
#290
#291
#294
#295
#297
#298
#3
#300
#301
#303
#304
#305
#307
#31
#313
#314
#318
#319
#32
#320
#321
#323
#324
#325
#329
#33
#333
#335
#336
#337
#339
#344
#345
#346
#349
#352
#353
#356
#36
#360
#361
#366
#367
#368
#369
#37
#372
#373
#375
#376
#378
#38
#380
#381
#383
#384
#386
#388
#389
#39
#391
#393
#395
#396
#397
#398
#4
#400
#401
#403
#404
#405
#406
#407
#408
#412
#413
#414
#419
#420
#421
#423
#427
#432
#436
#437
#438
#439
#441
#451
#458
#461
#465
#465
#466
#468
#469
#470
#471
#473
#479
#481
#483
#484
#487
#492
#495
#496
#498
#5
#50
#500
#504
#506
#507
#509
#511
#512
#516
#518
#519
#52
#520
#521
#522
#523
#524
#525
#526
#529
#530
#531
#533
#534
#535
#536
#537
#539
#54
#540
#541
#542
#543
#544
#545
#546
#547
#549
#55
#550
#551
#553
#555
#557
#56
#560
#561
#562
#563
#565
#571
#572
#573
#574
#575
#576
#577
#578
#581
#582
#583
#584
#585
#588
#589
#590
#591
#592
#593
#596
#597
#598
#6
#600
#602
#604
#605
#606
#611
#612
#613
#614
#615
#616
#617
#620
#621
#627
#628
#629
#63
#630
#638
#64
#640
#642
#643
#646
#649
#65
#653
#658
#66
#660
#661
#663
#665
#667
#668
#671
#672
#673
#674
#676
#677
#678
#68
#680
#682
#685
#687
#688
#69
#695
#696
#697
#698
#699
#7
#70
#700
#702
#703
#704
#705
#706
#708
#709
#71
#710
#711
#713
#714
#715
#716
#719
#722
#723
#725
#726
#727
#728
#729
#73
#730
#732
#733
#735
#737
#738
#74
#741
#744
#747
#748
#749
#75
#750
#752
#758
#76
#760
#761
#764
#767
#768
#769
#77
#770
#771
#773
#774
#775
#776
#78
#780
#781
#786
#788
#79
#793
#798
#799
#800
#801
#802
#805
#807
#809
#81
#810
#811
#812
#814
#815
#817
#818
#819
#82
#820
#823
#826
#828
#829
#83
#830
#831
#832
#833
#834
#839
#84
#841
#842
#843
#845
#847
#849
#850
#851
#858
#859
#861
#862
#863
#865
#866
#867
#869
#870
#874
#875
#876
#878
#879
#880
#881
#883
#887
#888
#89
#890
#891
#892
#893
#897
#898
#9
#90
#900
#901
#903
#904
#906
#909
#910
#914
#915
#916
#918
#920
#925
#926
#927
#933
#935
#937
#938
#940
#942
#943
#945
#946
#949
#950
#951
#952
#953
#954
#957
#958
#959
#96
#960
#961
#962
#965
#969
#97
#974
#975
#976
#981
#987
#989
#99
#991
#993
#994
#998
#999
Select branches
Hide Pull Requests
cleaning
debug-exclude-newer
dockerfile
fix-qwen3-tokeniizer-eos
fix_masking_kv_cache
gemma-3
head-dim
main
qwen-tokenizer-fix
requirements-update
uv-improvements
weezymatt-bugfix/semantic-error
#1
#10
#100
#1000
#1003
#1004
#1004
#101
#102
#104
#105
#106
#107
#109
#110
#111
#113
#115
#116
#117
#118
#119
#12
#120
#121
#122
#124
#125
#127
#128
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#14
#141
#142
#143
#144
#149
#151
#152
#153
#154
#156
#157
#158
#159
#160
#161
#162
#163
#164
#165
#166
#169
#17
#170
#171
#173
#174
#176
#177
#178
#179
#18
#180
#182
#183
#184
#187
#189
#19
#193
#195
#196
#197
#198
#199
#2
#20
#200
#201
#203
#204
#205
#206
#207
#208
#211
#212
#213
#216
#218
#219
#22
#222
#223
#224
#227
#228
#229
#233
#234
#235
#236
#237
#239
#24
#240
#241
#242
#243
#244
#245
#246
#247
#248
#250
#252
#253
#254
#258
#259
#26
#260
#261
#263
#264
#265
#269
#27
#270
#274
#278
#28
#281
#285
#288
#289
#29
#290
#291
#294
#295
#297
#298
#3
#300
#301
#303
#304
#305
#307
#31
#313
#314
#318
#319
#32
#320
#321
#323
#324
#325
#329
#33
#333
#335
#336
#337
#339
#344
#345
#346
#349
#352
#353
#356
#36
#360
#361
#366
#367
#368
#369
#37
#372
#373
#375
#376
#378
#38
#380
#381
#383
#384
#386
#388
#389
#39
#391
#393
#395
#396
#397
#398
#4
#400
#401
#403
#404
#405
#406
#407
#408
#412
#413
#414
#419
#420
#421
#423
#427
#432
#436
#437
#438
#439
#441
#451
#458
#461
#465
#465
#466
#468
#469
#470
#471
#473
#479
#481
#483
#484
#487
#492
#495
#496
#498
#5
#50
#500
#504
#506
#507
#509
#511
#512
#516
#518
#519
#52
#520
#521
#522
#523
#524
#525
#526
#529
#530
#531
#533
#534
#535
#536
#537
#539
#54
#540
#541
#542
#543
#544
#545
#546
#547
#549
#55
#550
#551
#553
#555
#557
#56
#560
#561
#562
#563
#565
#571
#572
#573
#574
#575
#576
#577
#578
#581
#582
#583
#584
#585
#588
#589
#590
#591
#592
#593
#596
#597
#598
#6
#600
#602
#604
#605
#606
#611
#612
#613
#614
#615
#616
#617
#620
#621
#627
#628
#629
#63
#630
#638
#64
#640
#642
#643
#646
#649
#65
#653
#658
#66
#660
#661
#663
#665
#667
#668
#671
#672
#673
#674
#676
#677
#678
#68
#680
#682
#685
#687
#688
#69
#695
#696
#697
#698
#699
#7
#70
#700
#702
#703
#704
#705
#706
#708
#709
#71
#710
#711
#713
#714
#715
#716
#719
#722
#723
#725
#726
#727
#728
#729
#73
#730
#732
#733
#735
#737
#738
#74
#741
#744
#747
#748
#749
#75
#750
#752
#758
#76
#760
#761
#764
#767
#768
#769
#77
#770
#771
#773
#774
#775
#776
#78
#780
#781
#786
#788
#79
#793
#798
#799
#800
#801
#802
#805
#807
#809
#81
#810
#811
#812
#814
#815
#817
#818
#819
#82
#820
#823
#826
#828
#829
#83
#830
#831
#832
#833
#834
#839
#84
#841
#842
#843
#845
#847
#849
#850
#851
#858
#859
#861
#862
#863
#865
#866
#867
#869
#870
#874
#875
#876
#878
#879
#880
#881
#883
#887
#888
#89
#890
#891
#892
#893
#897
#898
#9
#90
#900
#901
#903
#904
#906
#909
#910
#914
#915
#916
#918
#920
#925
#926
#927
#933
#935
#937
#938
#940
#942
#943
#945
#946
#949
#950
#951
#952
#953
#954
#957
#958
#959
#96
#960
#961
#962
#965
#969
#97
#974
#975
#976
#981
#987
#989
#99
#991
#993
#994
#998
#999
-
f2b034df66
Merge pull request #17 from rasbt/package-install
Sebastian Raschka
2024-01-21 20:16:59 -06:00 -
fdfa39eb71
additional package installation info
rasbt
2024-01-21 20:16:19 -06:00 -
8860e16e05
<|endoftext|> token in dataset v1
rasbt
2024-01-21 12:03:04 -06:00 -
b1923a3075
Fix links
Sebastian Raschka
2024-01-19 21:00:20 -06:00 -
92896d817c
add toggle for qkv_bias
rasbt
2024-01-17 07:50:57 -06:00 -
0074c98968
add download utilities for vocab and encoder files
rasbt
2024-01-15 17:07:55 -06:00 -
dfe2c3b46f
use blocksize in positional embedding
rasbt
2024-01-15 08:15:33 -06:00 -
9e85f13ba9
readability improvements
rasbt
2024-01-15 07:36:19 -06:00 -
a7b4880179
small readability updates
rasbt
2024-01-14 11:58:42 -06:00 -
c79499572f
update chapter title
rasbt
2024-01-13 14:51:39 -06:00 -
6a1af00313
Update README.md
Sebastian Raschka
2024-01-13 14:50:25 -06:00 -
32267e3253
Merge pull request #14 from rasbt/update
Sebastian Raschka
2024-01-13 14:49:56 -06:00 -
c400f77f26
update exercise solutions
rasbt
2024-01-13 14:49:02 -06:00 -
4105844d34
update cover
rasbt
2024-01-10 17:55:28 -06:00 -
fce03b2f63
Merge pull request #13 from rasbt/update
Sebastian Raschka
2024-01-10 08:02:02 -06:00 -
f279134492
small cosmetic fixes and improvements
rasbt
2024-01-10 08:01:19 -06:00 -
c93f434f52
improve table readability
rasbt
2024-01-07 15:18:03 -06:00 -
690a1a62b0
add comments to ToC and fix link
rasbt
2024-01-07 15:13:53 -06:00 -
e113075a16
show normalization explicitely
rasbt
2024-01-06 19:24:01 -05:00 -
ea4b6c4e5f
add package versions to the top of the notebook
rasbt
2024-01-01 19:41:18 +01:00 -
c17a8c2d8d
Update LICENSE.txt
Sebastian Raschka
2024-01-01 16:21:05 +01:00 -
5602366381
Merge pull request #12 from rasbt/embedding-size
Sebastian Raschka
2023-12-28 19:06:13 +01:00 -
4f161bd549
use block size variable in positional embedding layer
rasbt
2023-12-28 19:05:06 +01:00 -
10aa40ba6a
Merge pull request #10 from Shuyib/main
Sebastian Raschka
2023-12-27 17:03:34 +01:00 -
c11b5ca211
Merge pull request #1 from Shuyib/Shuyib-patch-1
Megabyte
2023-12-27 16:32:18 +03:00 -
0beb32b45a
Update requirements.txt
Megabyte
2023-12-27 16:30:31 +03:00 -
0f83b87c04
Merge pull request #9 from xiaotian0328/patch-1
Sebastian Raschka
2023-12-27 08:28:52 +01:00 -
b8901da362
Update ch03.ipynb
Xiaotian Ma
2023-12-26 22:41:54 -06:00 -
c518adb0b7
Update ch03.ipynb
Xiaotian Ma
2023-12-26 22:05:21 -06:00 -
727615423d
add repo info
rasbt
2023-12-21 08:54:15 +01:00 -
7f0595f5f6
Merge pull request #7 from pitmonticone/main
Sebastian Raschka
2023-12-17 09:09:31 -06:00 -
40698e63b3
Update embeddings-and-linear-layers.ipynb
Pietro Monticone
2023-12-17 16:01:46 +01:00 -
1bebe21b91
Update ch02.ipynb
Pietro Monticone
2023-12-17 16:01:43 +01:00 -
f5c0fda59a
update overview figure
Sebastian Raschka
2023-12-16 12:43:33 -06:00 -
5824d87a51
Update README.md
Sebastian Raschka
2023-12-13 06:12:50 -06:00 -
db800cd67d
Merge pull request #6 from rasbt/cover-and-book-info
Sebastian Raschka
2023-12-12 07:27:44 -06:00 -
d29bb5a01e
Add cover and book info
rasbt
2023-12-12 07:27:03 -06:00 -
220df4ffb3
Delete ch02/03_bonus_embedding-vs-matmul/.DS_Store
Sebastian Raschka
2023-12-10 08:18:25 -06:00 -
a16585049e
Delete ch02/.DS_Store
Sebastian Raschka
2023-12-10 08:18:11 -06:00 -
28f43bd118
Delete ch03/.DS_Store
Sebastian Raschka
2023-12-10 08:17:56 -06:00 -
7e6cea02e2
Delete ch03/01_main-chapter-code/.DS_Store
Sebastian Raschka
2023-12-10 08:17:47 -06:00 -
f908a6ea59
remove temp files
rasbt
2023-12-09 17:21:06 -06:00 -
d82a5d6c08
remove temp files
rasbt
2023-12-09 17:20:40 -06:00 -
7cb3f59fff
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
rasbt
2023-12-09 17:18:43 -06:00 -
55aa84ac5c
remove OS temp files
rasbt
2023-12-09 17:17:47 -06:00 -
aa5851914f
Merge pull request #4 from rasbt/ch03
Sebastian Raschka
2023-12-09 17:14:56 -06:00 -
31980a6ef1
add ch03 and TOC
rasbt
2023-12-09 17:13:56 -06:00 -
d015b2ad08
link summary and exercises
rasbt
2023-12-08 06:30:20 -06:00 -
752b035632
Update LICENSE.txt
Sebastian Raschka
2023-12-05 18:45:47 -06:00 -
c8825b7c22
add exercise solutions
rasbt
2023-10-27 06:19:40 -05:00 -
e827b42e1e
add readme files
rasbt
2023-10-25 18:46:40 -05:00 -
f26aa70ebd
fix size of positional embedding layer
rasbt
2023-10-23 20:20:12 -05:00 -
ab1261d9b1
cleanup and minimal notebook
Sebastian R
2023-10-15 17:15:20 -05:00 -
a19305fcda
Merge pull request #2 from rasbt/ch02-code
Sebastian Raschka
2023-10-02 06:48:47 -05:00 -
e6a4f7a4d3
minor cosmetic fixes
rasbt
2023-10-02 06:48:16 -05:00 -
e90f2d0e2b
Merge pull request #1 from rasbt/ch02-code
Sebastian Raschka
2023-09-28 07:10:00 -05:00 -
d1d29d0555
ch02 code
rasbt
2023-09-28 07:08:50 -05:00 -
5723c3802b
restruture old ch02 into appendix A
rasbt
2023-09-22 07:01:08 -05:00 -
d66b23588d
first sync
rasbt
2023-07-23 13:18:13 -05:00