fix reward margins plot label in dpo nb

This commit is contained in:
rasbt
2025-01-12 14:04:05 -06:00
parent 992f3068d1
commit bed5f89378

File diff suppressed because one or more lines are too long