Sebastian Raschka
e155d1b02c
Update unit tests for CI ( #952 )
...
* Update CI
* Revert submodule pointer update
* Update
* update
* update
2026-01-27 17:44:55 -06:00
Pádraic Slattery
59d9262047
chore: Update outdated GitHub Actions versions ( #951 )
2026-01-19 12:22:29 -06:00
Sebastian Raschka
bc6f335526
Olmo 3 from scratch ( #914 )
...
* Olmo 3 from scratch
* update
* update
* update
2025-11-22 22:42:18 -06:00
Aviral Garg
27d52d6378
Fix MHAEinsum weight dimension bug when d_in != d_out ( #857 ) ( #893 )
...
* Fix MHAEinsum weight dimension bug when d_in != d_out (#857 )
Previously MHAEinsum initialized weight matrices with shape (d_out, d_in) and used inappropriate einsum notation, causing failures for non-square input-output dimensions. This commit corrects weight initialization to shape (d_in, d_out), updates einsum notation to 'bnd,do->bno', and adds three unit tests to verify parity across different d_in and d_out settings. All tests pass successfully.
* use pytest
* Update .gitignore
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-10-31 21:45:31 -05:00
Sebastian Raschka
7bd263144e
Switch from urllib to requests to improve reliability ( #867 )
...
* Switch from urllib to requests to improve reliability
* Keep ruff linter-specific
* update
* update
* update
2025-10-07 15:22:59 -05:00
Sebastian Raschka
458f2d9b67
Test dependencies with Python 3.13 ( #843 )
...
* Custom python 3.13 entry in pyproject.toml
* amend
* update
* update
* update
* Update pyproject.toml
* Update pyproject.toml
* Update pyproject.toml
* update
2025-09-27 08:38:07 -05:00
Sebastian Raschka
f571b5e493
Add Gemma3 KV cache variant ( #776 )
...
* Add Gemma3 KV cache variant
* update
2025-08-19 12:37:49 -05:00
Sebastian Raschka
80d4732456
add HF equivalency tests for standalone nbs ( #774 )
...
* add HF equivalency tests for standalone nbs
* update
* update
* update
* update
2025-08-18 18:58:46 -05:00
Sebastian Raschka
a6b883c9f9
Gemma 3 270M From Scratch ( #771 )
...
* Gemma 3 270M From Scratch
* fix path
* update readme
2025-08-17 08:23:05 -05:00
Matthew Hernandez
6f12edb0cc
Fix issue: 731 by resolving semantic error ( #738 )
...
* fix issue 731
* update test path
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-07-10 16:30:15 -05:00
Martin Ma
6522be94be
Fix bug in masking when kv cache is used. ( #697 )
...
* Fix bug in masking when kv cache is used.
* add tests
* dd tests
* upd
* add kv cache test to gh workflow
* explicit mask slicing
* upd
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-06-23 13:12:56 -05:00
Sebastian Raschka
aedad7efc3
Add Llama 3.2 to pkg ( #591 )
...
* Add Llama 3.2 to pkg
* remove redundant attributes
* update tests
* updates
* updates
* updates
* fix link
* fix link
2025-03-31 18:59:47 -05:00
Sebastian Raschka
c21bfe4a23
Add PyPI package ( #576 )
...
* Add PyPI package
* fixes
* fixes
2025-03-23 19:28:49 -05:00
Sebastian Raschka
f63f04d8d5
Fix BPE bonus materials ( #561 )
...
* Fix BPE bonus materials
* fix bpe implementation
* update
* Add 'Hello, world. Is this-- a test?' test case
* update link to test file
* update path handling
* update path handling
* fix pytest paths
2025-03-08 17:21:30 -06:00
Sebastian Raschka
bca82a9443
Simplify the setup in CI ( #542 )
...
* Simplify the setup in CI
* windows --no-binary option
* windows uv workflow
* add path
* some fixes
* some fixes
* some fixes
* cleanup
* cleanup
* cleanup
* cleanup
* cleanup
* more windows issues
* more windows issues
* more windows issues
* more windows issues
* more windows issues
* more windows issues
* more windows issues
* more windows issues
* revert
2025-02-20 11:56:52 -06:00
Tim Hopper
1039bf9b80
Don't explicitly install Python. Use uv sync instead of uv add. ( #540 )
...
* Don't explicitly install Python. Use uv sync instead of uv add.
* updates
* update
* updated 3.11 workflows
---------
Co-authored-by: rasbt <mail@sebastianraschka.com >
2025-02-19 10:42:13 -06:00
rasbt
24f78865df
update badges
2025-02-17 12:00:46 -06:00