talentJay-ux
d7f178d28b
Sliding window KV Cache bug fix ( #925 )
...
1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
3. Fix KV Cache import issue for gpt_with_kv_cache_optimized
2025-12-15 18:47:01 -06:00
Sebastian Raschka
a11965fbd9
Remove persistent flag from cache buffers ( #916 )
2025-11-24 20:10:02 -06:00
Sebastian Raschka
2f53bf5fe5
Link the other KV cache sections ( #708 )
2025-06-24 16:52:29 -05:00
Sebastian Raschka
81eda38d3b
Improve KV cache code for torch.compile ( #705 )
...
* Improve KV cache code for torch.compile
* cleanup
* cleanup
2025-06-23 18:08:49 -05:00
Martin Ma
6522be94be
Fix bug in masking when kv cache is used. ( #697 )
...
* Fix bug in masking when kv cache is used.
* add tests
* dd tests
* upd
* add kv cache test to gh workflow
* explicit mask slicing
* upd
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-06-23 13:12:56 -05:00
Shamik
f5bc863752
Update README.md ( #702 )
...
Typo in kv cache readme
2025-06-23 07:21:51 -05:00
Sebastian Raschka
fdc3e1b701
Add GPT-2 KV cache to pkg ( #687 )
2025-06-21 12:29:04 -05:00
Sebastian Raschka
ece59ba587
Optimize KV cache ( #673 )
...
* Optimize KV cache
* style
* interpretable generate
* interpretable generate
* update readme
2025-06-16 16:00:50 -05:00
Sebastian Raschka
ba0370abd1
Optimized KV cache ( #672 )
...
* Optimized KV cache
* typo fix
2025-06-15 14:26:16 -05:00
Sebastian Raschka
2af686d70b
Add KV cache ( #671 )
2025-06-15 09:58:08 -05:00