Files
LLMs-from-scratch/ch05
Sebastian Raschka c4cde1c21b Reduce Llama 3 RoPE memory requirements (#658)
* Llama3 from scratch improvements

* Fix Llama 3 expensive RoPE memory issue

* updates

* update package

* benchmark

* remove unused rescale_theta
2025-06-12 11:08:02 -05:00
..
2025-03-23 19:28:49 -05:00
2025-03-23 19:35:12 -05:00

Chapter 5: Pretraining on Unlabeled Data

 

Main Chapter Code

 

Bonus Materials



Link to the video