talentJay-ux
d7f178d28b
Sliding window KV Cache bug fix ( #925 )
...
1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
3. Fix KV Cache import issue for gpt_with_kv_cache_optimized
2025-12-15 18:47:01 -06:00
Martin Ma
6522be94be
Fix bug in masking when kv cache is used. ( #697 )
...
* Fix bug in masking when kv cache is used.
* add tests
* dd tests
* upd
* add kv cache test to gh workflow
* explicit mask slicing
* upd
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-06-23 13:12:56 -05:00
Sebastian Raschka
fdc3e1b701
Add GPT-2 KV cache to pkg ( #687 )
2025-06-21 12:29:04 -05:00
Sebastian Raschka
ece59ba587
Optimize KV cache ( #673 )
...
* Optimize KV cache
* style
* interpretable generate
* interpretable generate
* update readme
2025-06-16 16:00:50 -05:00
Sebastian Raschka
ba0370abd1
Optimized KV cache ( #672 )
...
* Optimized KV cache
* typo fix
2025-06-15 14:26:16 -05:00