🕹️ CLI refactor (#2380)

mirror of https://github.com/huggingface/trl.git synced 2025-10-21 11:33:51 +08:00

* Refactor main function in dpo.py

* Update setup.py and add cli.py

* Add examples to package data

* style

* Refactor setup.py file

* Add new file t.py

* Move dpo to package

* Update MANIFEST.in and setup.py, refactor trl/cli.py

* Add __init__.py to trl/scripts directory

* Add license header to __init__.py

* File moved instruction

* Add Apache License and update file path

* Move dpo.py to new location

* Refactor CLI and DPO script

* Refactor import structure in scripts package

* env

* rm config from chat arg

* rm old cli

* chat init

* test cli [skip ci]

* Add `datast_config_name` to `ScriptArguments` (#2440)

* add missing arg

* Add test cases for 'trl sft' and 'trl dpo' commands

* Add sft.py script and update cli.py to include sft command

* Move sft script

* chat

* style [ci skip]

* kto

* rm example config

* first step on doc

* see #2442

* see #2443

* fix chat windows

* ©️ Copyrights update (#2454)

* First changes

* Other files

* Finally

* rm comment

* fix nashmd

* Fix example

* Fix example [ci skip]

* 💬 Fix chat for windows (#2443)

* fix chat for windows

* add some tests back

* Revert "add some tests back"

This reverts commit 350aef52f53f8cf34fccd7ad0f78a3dd63867e06.

* 🆔 Add `datast_config` to `ScriptArguments` (#2440)

* datast_config_name

* Update trl/utils.py [ci skip]

* sort import

* typo [ci skip]

* Trigger CI

* Rename `dataset_config_name` to `dataset_config`

* 🏎 Fix deepspeed preparation of `ref_model` in `OnlineDPOTrainer` (#2417)

* Remove unused deepspeed code

* add model prep back

* add deepspeed even if it doesn't work

* rm old code

* Fix config name

* Remove `make dev` in favor of `pip install -e .[dev]`

* Update script paths and remove old symlink related things

* Fix chat script path [ci skip]

* style

This commit is contained in:

Quentin Gallouédec

2024-12-13 17:52:23 +01:00

committed by

GitHub

parent 179ba53671

commit ca850be0a2

36 changed files with 1105 additions and 921 deletions

									
										4

examples/scripts/rloo/rloo.py
									
												View File
												
				@ -71,9 +71,7 @@ if __name__ == "__main__":

				    # Model & Tokenizer

				    ################

				    tokenizer = AutoTokenizer.from_pretrained(

				        model_args.model_name_or_path,

				        padding_side="left",

				        trust_remote_code=model_args.trust_remote_code,

				        model_args.model_name_or_path, padding_side="left", trust_remote_code=model_args.trust_remote_code

				    )

				    tokenizer.add_special_tokens({"pad_token": "[PAD]"})

				    if tokenizer.chat_template is None:

🕹️ CLI refactor (#2380)

4 examples/scripts/rloo/rloo.py Unescape Escape View File

4

examples/scripts/rloo/rloo.py

View File