|
9955ee7eaa
|
🐳 Docker update + Simplify Jobs doc (#3931)
Co-authored-by: sergiopaniego <sergiopaniegoblanco@gmail.com>
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
|
2025-09-13 18:35:55 -06:00 |
|
|
07f9ad982d
|
💡 Fix type hint to make_parser function in multiple scripts (#4050)
|
2025-09-11 11:36:05 -06:00 |
|
|
0c69fd2867
|
👷 Added Kernels on the Hub x TRL guide (#3969)
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
|
2025-09-04 15:37:02 +02:00 |
|
|
0c91515b58
|
🧭 HF jobs x TRL guide (#3890)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
|
2025-08-26 21:44:29 -07:00 |
|
|
48d7ecc67b
|
🗑️ Deprecate setup_chat_format (#3929)
|
2025-08-20 14:06:23 -07:00 |
|
|
8793a46760
|
🧾 Use logger.warning instead of warnings.warn (#3923)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
|
2025-08-20 09:20:09 -07:00 |
|
|
72d4d82b8c
|
🎚️ Add dataset mixer (#3791)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
|
2025-08-11 20:14:50 -07:00 |
|
|
eee9ec94ef
|
Update missing uv dep (#3772)
|
2025-07-25 08:00:03 -07:00 |
|
|
a043fd74a3
|
Add uv scripts headers (#3767)
|
2025-07-25 07:48:40 -07:00 |
|
|
dffd1acb94
|
👋 Remove --bf16 flag from training scripts (#3724)
Co-authored-by: Shirin Yamani <75791599+shirinyamani@users.noreply.github.com>
|
2025-07-11 18:20:15 -07:00 |
|
|
e4b586a389
|
👔 Apply doc-builder style (#3615)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
|
2025-06-19 12:02:51 +02:00 |
|
|
ed9b78a5f7
|
🗳️ Remove logging_steps parameter from for simpler setup (#3612)
|
2025-06-18 13:52:21 +02:00 |
|
|
9df19e8a75
|
📜 Fix license and copyrights (#3264)
|
2025-04-08 15:22:58 -07:00 |
|
|
1d23ecc36f
|
©️ Update copyrights year (#2547)
* happy new year
* fix wandb import sort
|
2025-01-07 14:53:09 +01:00 |
|
|
ca850be0a2
|
🕹️ CLI refactor (#2380)
* Refactor main function in dpo.py
* Update setup.py and add cli.py
* Add examples to package data
* style
* Refactor setup.py file
* Add new file t.py
* Move dpo to package
* Update MANIFEST.in and setup.py, refactor trl/cli.py
* Add __init__.py to trl/scripts directory
* Add license header to __init__.py
* File moved instruction
* Add Apache License and update file path
* Move dpo.py to new location
* Refactor CLI and DPO script
* Refactor import structure in scripts package
* env
* rm config from chat arg
* rm old cli
* chat init
* test cli [skip ci]
* Add `datast_config_name` to `ScriptArguments` (#2440)
* add missing arg
* Add test cases for 'trl sft' and 'trl dpo' commands
* Add sft.py script and update cli.py to include sft command
* Move sft script
* chat
* style [ci skip]
* kto
* rm example config
* first step on doc
* see #2442
* see #2443
* fix chat windows
* ©️ Copyrights update (#2454)
* First changes
* Other files
* Finally
* rm comment
* fix nashmd
* Fix example
* Fix example [ci skip]
* 💬 Fix chat for windows (#2443)
* fix chat for windows
* add some tests back
* Revert "add some tests back"
This reverts commit 350aef52f53f8cf34fccd7ad0f78a3dd63867e06.
* 🆔 Add `datast_config` to `ScriptArguments` (#2440)
* datast_config_name
* Update trl/utils.py [ci skip]
* sort import
* typo [ci skip]
* Trigger CI
* Rename `dataset_config_name` to `dataset_config`
* 🏎 Fix deepspeed preparation of `ref_model` in `OnlineDPOTrainer` (#2417)
* Remove unused deepspeed code
* add model prep back
* add deepspeed even if it doesn't work
* rm old code
* Fix config name
* Remove `make dev` in favor of `pip install -e .[dev]`
* Update script paths and remove old symlink related things
* Fix chat script path [ci skip]
* style
|
2024-12-13 17:52:23 +01:00 |
|