books/LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2026-04-10 12:33:42 +00:00

Files

History

Sebastian Raschka 8c1f9ccf54 Improve MHA einsum (#775 )

2025-08-19 10:38:15 -05:00

..

mha-implementations.ipynb

Improve MHA einsum (#775 )

2025-08-19 10:38:15 -05:00

README.md

Einsum multi-head attention (#345 )

2024-09-05 18:24:33 +02:00

README.md

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention

Summary

The figures below summarize the performance benchmarks (lower is better).

Forward pass only

Forward and backward pass

Forward and backward pass after compilation