mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Chapter 5 with alternative LLMs (Qwen3, Llama 3) (#943)
* ch05-alt-llms * fix double import * style fix * Fix urls * Fix urls * Fix urls
This commit is contained in:
committed by
GitHub
parent
9df9e69cd2
commit
90e0f3cc15
@@ -187,9 +187,10 @@ Several folders contain optional materials as a bonus for interested readers:
|
||||
- [Qwen3 Dense and Mixture-of-Experts (MoE) From Scratch](ch05/11_qwen3/)
|
||||
- [Gemma 3 From Scratch](ch05/12_gemma3/)
|
||||
- [Olmo 3 From Scratch](ch05/13_olmo3/)
|
||||
- [Chapter 5 with other LLMs as Drop-In Replacement (e.g., Llama 3, Qwen 3)](ch05/14_ch05_with_other_llms/)
|
||||
- **Chapter 6: Finetuning for classification**
|
||||
- [Additional experiments finetuning different layers and using larger models](ch06/02_bonus_additional-experiments)
|
||||
- [Finetuning different models on 50k IMDb movie review dataset](ch06/03_bonus_imdb-classification)
|
||||
- [Additional Experiments Finetuning Different Layers and Using Larger Models](ch06/02_bonus_additional-experiments)
|
||||
- [Finetuning Different Models on the 50k IMDb Movie Review Dataset](ch06/03_bonus_imdb-classification)
|
||||
- [Building a User Interface to Interact With the GPT-based Spam Classifier](ch06/04_user_interface)
|
||||
- **Chapter 7: Finetuning to follow instructions**
|
||||
- [Dataset Utilities for Finding Near Duplicates and Creating Passive Voice Entries](ch07/02_dataset-utilities)
|
||||
|
||||
@@ -1192,7 +1192,7 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.10.16"
|
||||
"version": "3.13.5"
|
||||
},
|
||||
"widgets": {
|
||||
"application/vnd.jupyter.widget-state+json": {
|
||||
|
||||
@@ -1179,7 +1179,7 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.12.3"
|
||||
"version": "3.13.5"
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
|
||||
8
ch05/14_ch05_with_other_llms/README.md
Normal file
8
ch05/14_ch05_with_other_llms/README.md
Normal file
@@ -0,0 +1,8 @@
|
||||
# Chapter 5 With Other LLMs
|
||||
|
||||
This folder contains code notebooks that swap in other LLMs (for example, Qwen3 and Llama 3) for GPT-2 in Chapter 5.
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
1470
ch05/14_ch05_with_other_llms/ch05-llama32.ipynb
Normal file
1470
ch05/14_ch05_with_other_llms/ch05-llama32.ipynb
Normal file
File diff suppressed because one or more lines are too long
1467
ch05/14_ch05_with_other_llms/ch05-qwen3.ipynb
Normal file
1467
ch05/14_ch05_with_other_llms/ch05-qwen3.ipynb
Normal file
File diff suppressed because one or more lines are too long
Reference in New Issue
Block a user