|
8e2d5516ca
|
Add accuracy reward (#4270)
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
|
2025-10-15 18:01:07 -06:00 |
|
|
45ee98b05e
|
Replace unittest with pytest (#4188)
|
2025-10-06 11:14:54 +02:00 |
|
|
f5b1ed24a0
|
⏳ Replaced unittest.TestCase with TrlTestCase that handles tmp dir (#3863)
|
2025-08-12 12:37:19 -07:00 |
|
|
b4c418110c
|
💇 Add soft overlong punishment reward function and update documentation (#3804)
|
2025-08-12 10:58:41 -07:00 |
|
|
54d4f6b13a
|
🎁 Reward submodule (#3430)
|
2025-05-15 19:10:22 -07:00 |
|