fix reward margins plot label in dpo nb

This commit is contained in:
rasbt
2025-01-12 14:04:05 -06:00
parent 4bfbcd069d
commit b524afe3da

File diff suppressed because one or more lines are too long