Multi-Head Latent Attention (#876)

* Multi-Head Latent Attention

* update
This commit is contained in:
Sebastian Raschka
2025-10-11 20:08:30 -05:00
committed by GitHub
parent bf27ad1485
commit 9b9586688d
15 changed files with 1164 additions and 233 deletions

2
.gitignore vendored
View File

@@ -12,7 +12,7 @@ appendix-D/01_main-chapter-code/3.pdf
appendix-E/01_main-chapter-code/loss-plot.pdf
ch04/04_gqa/kv_bytes_vs_context_length.pdf
ch04/04_gqa/savings_vs_n_kv_groups.pdf
ch05/05_mla/kv_bytes_vs_context_length.pdf
ch05/01_main-chapter-code/loss-plot.pdf
ch05/01_main-chapter-code/temperature-plot.pdf