Bonus material: extending tokenizers (#496)

* Bonus material: extending tokenizers

* small wording update
This commit is contained in:
Sebastian Raschka
2025-01-22 09:26:54 -06:00
committed by GitHub
parent dce46038da
commit a22d612be6
7 changed files with 1224 additions and 2 deletions

View File

@@ -0,0 +1,3 @@
# Byte Pair Encoding (BPE) Tokenizer From Scratch
- [bpe-from-scratch.ipynb](bpe-from-scratch.ipynb) contains optional (bonus) code that explains and shows how the BPE tokenizer works under the hood.