mirror of
https://github.com/huggingface/trl.git
synced 2025-10-20 18:43:52 +08:00
* v1 slow tests * nit * add qlora tests for DPO * add decorator * release memory + log reports * report to none to avoid seg fault issues * update setup * fix * add exampel testing * fix nit * change temp filename * add workflow file * fix comment * add slack push script * more tests for DPO * add dpo example tests * another makefile command * fix * add paths + clean up * nit * Update slow-tests.yml * trigger tests * up * up * more fixes * fix * final fixes * minor fixes * oops * add more text * fix * more * trigger CI * up * fix * remove * run the tests on 2 GPUs only * final fix SFT * revert config files + address comments * fix * add Phi * final fixes * final fix