books/LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2026-04-10 12:33:42 +00:00

Files

History

taihaozesong f1fa9df15c Fix mha wrapper implementations in ch03 bonus

2024-03-13 18:02:26 +08:00

..

ch03.py

Fix mha wrapper implementations in ch03 bonus

2024-03-13 18:02:26 +08:00

mha-implementations.ipynb

Fix mha wrapper implementations in ch03 bonus

2024-03-13 18:02:26 +08:00

README.md

mha variants

2024-03-06 08:30:32 -06:00

README.md

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention