mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Alt weight loading code via PyTorch (#585)
* Alt weight loading code via PyTorch * commit additional files
This commit is contained in:
committed by
GitHub
parent
ffd4035144
commit
3f93d73d6d
@@ -113,7 +113,7 @@ Several folders contain optional materials as a bonus for interested readers:
|
||||
- **Chapter 4: Implementing a GPT model from scratch**
|
||||
- [FLOPS Analysis](ch04/02_performance-analysis/flops-analysis.ipynb)
|
||||
- **Chapter 5: Pretraining on unlabeled data:**
|
||||
- [Alternative Weight Loading from Hugging Face Model Hub using Transformers](ch05/02_alternative_weight_loading/weight-loading-hf-transformers.ipynb)
|
||||
- [Alternative Weight Loading Methods](ch05/02_alternative_weight_loading/)
|
||||
- [Pretraining GPT on the Project Gutenberg Dataset](ch05/03_bonus_pretraining_on_gutenberg)
|
||||
- [Adding Bells and Whistles to the Training Loop](ch05/04_learning_rate_schedulers)
|
||||
- [Optimizing Hyperparameters for Pretraining](ch05/05_bonus_hparam_tuning)
|
||||
|
||||
Reference in New Issue
Block a user