Files
LLMs-from-scratch/ch05
casinca 152a087a37 removing unused RoPE parameters (#590)
* removing unused RoPE parameters

* remove redundant context_length in GQA

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-03-31 17:10:39 -05:00
..
2025-03-23 19:28:49 -05:00
2025-03-23 19:35:12 -05:00

Chapter 5: Pretraining on Unlabeled Data

 

Main Chapter Code

 

Bonus Materials



Link to the video