Files
trl/docs/source
behroozazarkhalili 56180a5e26 Remove BestOfNSampler class
This removes the BestOfNSampler class as it's considered out of scope for the TRL library. The class provided a "Best of N" sampling strategy for PPO fine-tuning, but this functionality is not a core part of TRL's reinforcement learning training capabilities.

Changes:
- Removed implementation: trl/extras/best_of_n_sampler.py
- Removed tests: tests/test_best_of_n_sampler.py
- Removed documentation: docs/source/best_of_n.md
- Removed example notebook: examples/notebooks/best_of_n.ipynb
- Updated imports: trl/__init__.py, trl/extras/__init__.py
- Updated documentation references: docs/source/_toctree.yml, docs/source/example_overview.md, examples/notebooks/README.md
2025-10-11 06:52:30 -07:00
..
2025-10-11 06:52:30 -07:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-09-30 15:13:45 -06:00
2025-10-09 13:49:44 -05:00
2025-08-29 09:27:28 -06:00
2025-10-09 13:49:44 -05:00
2025-08-11 20:14:50 -07:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00
2025-08-26 20:17:21 -07:00
2025-10-09 13:49:44 -05:00
2025-10-09 13:49:44 -05:00