Commit Graph

  • 0b998dff97 track tokens seen in chapter5, track examples seen in chapter6 (#319) TITC 2024-08-13 20:09:05 +08:00
  • 38390b2a8d track tokens seen in chapter5, track examples seen in chapter6 (#319) TITC 2024-08-13 20:09:05 +08:00
  • 6533ce63c1 fix code cell ordering rasbt 2024-08-12 19:04:05 -05:00
  • 5f0c55ddee fix code cell ordering rasbt 2024-08-12 19:04:05 -05:00
  • 75abb61f13 Small typo fix (#313) Jeroen Van Goey 2024-08-12 14:54:12 +02:00
  • 76e6910a1a Small typo fix (#313) Jeroen Van Goey 2024-08-12 14:54:12 +02:00
  • b3d550bfd5 Adds .vscode folder to .gitignore (#314) Eric Thomson 2024-08-12 08:49:11 -04:00
  • da5236ee72 Adds .vscode folder to .gitignore (#314) Eric Thomson 2024-08-12 08:49:11 -04:00
  • f58c2b969b update attention benchmarks (#307) Sebastian Raschka 2024-08-10 09:44:11 -05:00
  • 3f6652d87e update attention benchmarks (#307) Sebastian Raschka 2024-08-10 09:44:11 -05:00
  • 8d79fb13b0 Update README.md Sebastian Raschka 2024-08-10 07:54:51 -05:00
  • 7feb8cad86 Update README.md Sebastian Raschka 2024-08-10 07:54:51 -05:00
  • c91999b9f4 fixed bash command (#305) Daniel Kleine 2024-08-10 04:29:04 +02:00
  • 13dbc548f8 fixed bash command (#305) Daniel Kleine 2024-08-10 04:29:04 +02:00
  • 3067ed83dc remove all non-English texts and notice (#304) TITC 2024-08-10 06:09:14 +08:00
  • 09a3a73f2d remove all non-English texts and notice (#304) TITC 2024-08-10 06:09:14 +08:00
  • de277ebfe6 Update README.md Sebastian Raschka 2024-08-08 07:50:45 -05:00
  • f1c3d451fe Update README.md Sebastian Raschka 2024-08-08 07:50:45 -05:00
  • 4601219c32 Update README.md Sebastian Raschka 2024-08-08 07:47:31 -05:00
  • 81e9cea3d3 Update README.md Sebastian Raschka 2024-08-08 07:47:31 -05:00
  • 895512ebee Revert accidental edit Sebastian Raschka 2024-08-06 19:54:34 -05:00
  • c5eaae11b1 Revert accidental edit Sebastian Raschka 2024-08-06 19:54:34 -05:00
  • 7a5771932b note about logistic sigmoid rasbt 2024-08-06 19:48:30 -05:00
  • 06151a809e note about logistic sigmoid rasbt 2024-08-06 19:48:30 -05:00
  • 9ef132bb64 note about logistic sigmoid rasbt 2024-08-06 19:48:06 -05:00
  • 26df0c474c note about logistic sigmoid rasbt 2024-08-06 19:48:06 -05:00
  • 2245f8d9c1 extend equation description rasbt 2024-08-06 19:46:50 -05:00
  • e810f9f004 extend equation description rasbt 2024-08-06 19:46:50 -05:00
  • a65e06ff99 add more explanations rasbt 2024-08-06 19:45:11 -05:00
  • c8090f30ef add more explanations rasbt 2024-08-06 19:45:11 -05:00
  • b1a420c8ca Update README.md Sebastian Raschka 2024-08-06 08:02:01 -05:00
  • 98d24a1607 Update README.md Sebastian Raschka 2024-08-06 08:02:01 -05:00
  • 7374d617b4 total training iters may equal to warmup_iters (#301) TITC 2024-08-06 20:10:05 +08:00
  • d16527ddf2 total training iters may equal to warmup_iters (#301) TITC 2024-08-06 20:10:05 +08:00
  • 192bdc3501 improve gradient accumulation (#300) Sebastian Raschka 2024-08-05 18:27:20 -05:00
  • 70e5714556 improve gradient accumulation (#300) Sebastian Raschka 2024-08-05 18:27:20 -05:00
  • 089901db26 small figure update rasbt 2024-08-05 17:57:16 -05:00
  • 36fbc7aa74 small figure update rasbt 2024-08-05 17:57:16 -05:00
  • b39234fc25 Update README.md Sebastian Raschka 2024-08-05 17:47:06 -05:00
  • 50332cf75b Update README.md Sebastian Raschka 2024-08-05 17:47:06 -05:00
  • dcdf04e3bd minor DPO fixes (#298) Daniel Kleine 2024-08-05 15:40:46 +02:00
  • 8318d1f002 minor DPO fixes (#298) Daniel Kleine 2024-08-05 15:40:46 +02:00
  • 6030071e3f update model path rasbt 2024-08-05 07:36:08 -05:00
  • 36b9d5e0eb update model path rasbt 2024-08-05 07:36:08 -05:00
  • 22681878a8 Update ch05.ipynb (#297) SSebo 2024-08-05 20:12:27 +08:00
  • 7643c6c0c4 Update ch05.ipynb (#297) SSebo 2024-08-05 20:12:27 +08:00
  • e4ed2c91cc Update README.md Sebastian Raschka 2024-08-04 16:06:38 -05:00
  • 16e83434b5 Update README.md Sebastian Raschka 2024-08-04 16:06:38 -05:00
  • f302f5e8d5 improve latex rendering in dpo notebook rasbt 2024-08-04 09:19:54 -05:00
  • 60aada801b improve latex rendering in dpo notebook rasbt 2024-08-04 09:19:54 -05:00
  • 2c6cdb497f Update matplotlib tests on Windows (#295) Sebastian Raschka 2024-08-04 09:18:19 -05:00
  • e130ca293c Update matplotlib tests on Windows (#295) Sebastian Raschka 2024-08-04 09:18:19 -05:00
  • 09dc080cf3 Direct Preference Optimization from scratch (#294) Sebastian Raschka 2024-08-04 08:57:36 -05:00
  • 52435804eb Direct Preference Optimization from scratch (#294) Sebastian Raschka 2024-08-04 08:57:36 -05:00
  • 3ea0798d44 Update README.md Sebastian Raschka 2024-08-01 18:17:42 -05:00
  • ff7a6db212 Update README.md Sebastian Raschka 2024-08-01 18:17:42 -05:00
  • eabeab04d1 restructure into local and cloud setup rasbt 2024-07-31 06:59:04 -05:00
  • b5fc1a6061 restructure into local and cloud setup rasbt 2024-07-31 06:59:04 -05:00
  • b4b2ac668c Add video tutorial Sebastian Raschka 2024-07-30 06:57:46 -05:00
  • 1b100179c0 Add video tutorial Sebastian Raschka 2024-07-30 06:57:46 -05:00
  • 6c2d34b03d Update README.md Sebastian Raschka 2024-07-30 06:55:41 -05:00
  • f5a003744e Update README.md Sebastian Raschka 2024-07-30 06:55:41 -05:00
  • 358717870b add state_dict example rasbt 2024-07-28 14:15:32 -05:00
  • 0dad0a3c04 add state_dict example rasbt 2024-07-28 14:15:32 -05:00
  • ce33e706ba Fix 8-billion-parameter spelling rasbt 2024-07-28 10:48:56 -05:00
  • a7869ad2bf Fix 8-billion-parameter spelling rasbt 2024-07-28 10:48:56 -05:00
  • 60752e3b3a fixed typos and formatting (#291) Daniel Kleine 2024-07-28 17:04:33 +02:00
  • 9a3b04f92f fixed typos and formatting (#291) Daniel Kleine 2024-07-28 17:04:33 +02:00
  • 75a24c4897 Update README.md Sebastian Raschka 2024-07-28 09:28:11 -05:00
  • 9bf5d67d61 Update README.md Sebastian Raschka 2024-07-28 09:28:11 -05:00
  • 011ee83ea7 Update README.md Sebastian Raschka 2024-07-28 08:21:38 -05:00
  • 4f7f5bd443 Update README.md Sebastian Raschka 2024-07-28 08:21:38 -05:00
  • 1b86a61a2e buffer tutorial Sebastian Raschka 2024-07-27 17:06:16 -05:00
  • f4fc0ededd buffer tutorial Sebastian Raschka 2024-07-27 17:06:16 -05:00
  • f66c089f0b Test with PyTorch 2.0 and 2.4 (#290) Sebastian Raschka 2024-07-27 15:09:02 -05:00
  • 263eee8921 Test with PyTorch 2.0 and 2.4 (#290) Sebastian Raschka 2024-07-27 15:09:02 -05:00
  • 7bf0d13ee8 Update README.md Sebastian Raschka 2024-07-27 13:39:23 -05:00
  • 9a2a269530 Update README.md Sebastian Raschka 2024-07-27 13:39:23 -05:00
  • 7eb3f941a4 update figure caption rasbt 2024-07-27 13:30:18 -05:00
  • 9eb2e2a5f6 update figure caption rasbt 2024-07-27 13:30:18 -05:00
  • 4a0a8cd9e4 formatting rasbt 2024-07-27 09:51:24 -05:00
  • c87e4364b7 formatting rasbt 2024-07-27 09:51:24 -05:00
  • 2b7bd46a93 Generate preference dataset with Llama 3.1 70B (#289) Sebastian Raschka 2024-07-27 09:44:04 -05:00
  • 99af403b9f Generate preference dataset with Llama 3.1 70B (#289) Sebastian Raschka 2024-07-27 09:44:04 -05:00
  • 3da0f18da7 update rasbt 2024-07-27 07:12:42 -05:00
  • 7f1e071fff update rasbt 2024-07-27 07:12:42 -05:00
  • 1e873d4cbc Understanding PyTorch Buffers (#288) Sebastian Raschka 2024-07-26 08:45:36 -05:00
  • deea13e5c2 Understanding PyTorch Buffers (#288) Sebastian Raschka 2024-07-26 08:45:36 -05:00
  • 6dd8666d9c Test code in pytorch 2.4 (#285) Sebastian Raschka 2024-07-24 21:53:41 -05:00
  • 08040f024c Test code in pytorch 2.4 (#285) Sebastian Raschka 2024-07-24 21:53:41 -05:00
  • bce3a708f9 47,678-->48,725 (#281) TITC 2024-07-23 10:24:57 +08:00
  • 6cbe6520a2 47,678-->48,725 (#281) TITC 2024-07-23 10:24:57 +08:00
  • 5d8a0c22ab use utf-8 encoding in json rasbt 2024-07-22 07:01:54 -05:00
  • 46fcde15d8 use utf-8 encoding in json rasbt 2024-07-22 07:01:54 -05:00
  • 3f6f2af3a3 Simplify embedding vs linear layer code (#278) Sebastian Raschka 2024-07-21 10:21:10 -07:00
  • fa56c80402 Simplify embedding vs linear layer code (#278) Sebastian Raschka 2024-07-21 10:21:10 -07:00
  • 7356020b44 Update README.md Sebastian Raschka 2024-07-21 10:42:28 -05:00
  • bbe09e9799 Update README.md Sebastian Raschka 2024-07-21 10:42:28 -05:00
  • b80b14cf09 Update README.md Sebastian Raschka 2024-07-21 10:41:53 -05:00
  • 0d28079a8a Update README.md Sebastian Raschka 2024-07-21 10:41:53 -05:00