mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-10 12:33:42 +00:00
minor DPO fixes (#298)
* fixed issues, updated .gitignore * added closing paren * fixed CEL spelling * fixed more minor issues * Update ch07/01_main-chapter-code/ch07.ipynb * Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb * Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb * Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
This commit is contained in:
2
.gitignore
vendored
2
.gitignore
vendored
@@ -85,6 +85,8 @@ ch07/01_main-chapter-code/instruction-data-with-response-alpaca52k.json
|
||||
ch07/01_main-chapter-code/instruction-data-with-response-lora.json
|
||||
ch07/01_main-chapter-code/instruction-data-with-response-phi3-prompt.json
|
||||
ch07/02_dataset-utilities/instruction-examples-modified.json
|
||||
ch07/04_preference-tuning-with-dpo/gpt2-medium355M-sft.pth
|
||||
ch07/04_preference-tuning-with-dpo/loss-plot.pdf
|
||||
|
||||
# Temporary OS-related files
|
||||
.DS_Store
|
||||
|
||||
Reference in New Issue
Block a user