mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Add KV cache (#671)
This commit is contained in:
committed by
GitHub
parent
78bbcb3643
commit
2af686d70b
@@ -9,6 +9,7 @@
|
||||
## Bonus Materials
|
||||
|
||||
- [02_performance-analysis](02_performance-analysis) contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter
|
||||
- [03_kv-cache](03_kv-cache) implements a KV cache to speed up the text generation during inference
|
||||
- [ch05/07_gpt_to_llama](../ch05/07_gpt_to_llama) contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI (it might be interesting to look at alternative architectures after completing chapter 4, but you can also save that for after reading chapter 5)
|
||||
|
||||
|
||||
|
||||
Reference in New Issue
Block a user