mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
Multi-Head Latent Attention (#876)
* Multi-Head Latent Attention * update
This commit is contained in:
committed by
GitHub
parent
bf27ad1485
commit
9b9586688d
2
.gitignore
vendored
2
.gitignore
vendored
@@ -12,7 +12,7 @@ appendix-D/01_main-chapter-code/3.pdf
|
||||
appendix-E/01_main-chapter-code/loss-plot.pdf
|
||||
|
||||
ch04/04_gqa/kv_bytes_vs_context_length.pdf
|
||||
ch04/04_gqa/savings_vs_n_kv_groups.pdf
|
||||
ch05/05_mla/kv_bytes_vs_context_length.pdf
|
||||
|
||||
ch05/01_main-chapter-code/loss-plot.pdf
|
||||
ch05/01_main-chapter-code/temperature-plot.pdf
|
||||
|
||||
Reference in New Issue
Block a user