mirror of
https://github.com/huggingface/trl.git
synced 2025-10-20 18:43:52 +08:00
* first piece of doc * improve readibility * some data utils and doc * simplify prompt-only * format * fix path data utils * fix example format * simplify * tests * prompt-completion * update antropic hh * update dataset script * implicit prompt * additional content * `maybe_reformat_dpo_to_kto` -> `unpair_preference_dataset` * Preference dataset with implicit prompt * unpair preference dataset tests * documentation * ... * doc * changes applied to dpo example * better doc and better log error * a bit more doc * improve doc * converting * some subsections * converting section * further refinements * tldr * tldr preference * rename * lm-human-preferences-sentiment * `imdb` to `stanfordnlp/imdb` * Add script for LM human preferences descriptiveness * Remove sentiment_descriptiveness.py script * style * example judge tlrd with new dataset * Syle * Dataset conversion for TRL compatibility * further refinements * trainers in doc * top level for functions * stanfordnlp/imdb * downgrade transformers * temp reduction of tests * next commit * next commit * additional content * proper tick format * precise the assistant start token * improve * lower case * Update titles in _toctree.yml and data_utils.mdx * revert make change * correct dataset ids * expand a bit dataset formats * skip gated repo tests * data utilities in API * Update docs/source/dataset_formats.mdx Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update docs/source/dataset_formats.mdx Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update docs/source/dataset_formats.mdx Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update docs/source/dataset_formats.mdx Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * tiny internal testing for chat template testing * precise type/format * exlude sft trainer in doc * Update trl/trainer/utils.py * XPO in the doc --------- Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
16 lines
267 B
Plaintext
16 lines
267 B
Plaintext
## Data Utilities
|
|
|
|
[[autodoc]] is_conversational
|
|
|
|
[[autodoc]] apply_chat_template
|
|
|
|
[[autodoc]] maybe_apply_chat_template
|
|
|
|
[[autodoc]] extract_prompt
|
|
|
|
[[autodoc]] maybe_extract_prompt
|
|
|
|
[[autodoc]] unpair_preference_dataset
|
|
|
|
[[autodoc]] maybe_unpair_preference_dataset
|