mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2026-04-10 12:33:42 +00:00

Files

Sebastian Raschka be5e2a3331 Readability and code quality improvements (#959 )

* Consistent dataset naming

* consistent section headers

2026-02-17 18:44:56 -06:00

2026-02-17 18:44:56 -06:00

mha-implementations.ipynb

2026-02-17 18:44:56 -06:00

README.md

2024-09-05 18:24:33 +02:00

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention

The figures below summarize the performance benchmarks (lower is better).