[Feature request] Add ORPO finetuning #1

s-kostyaev · 2024-04-15T13:21:33Z

Hi @armbues, thank you for this great project.

Please add ORPO finetuning to do SFT and DPO in one step https://arxiv.org/abs/2403.07691

armbues · 2024-04-15T13:25:42Z

Great idea! I will add this to the roadmap.

armbues · 2024-04-25T10:51:45Z

The official implementation of the "ORPO Trainer" can be found here.

armbues added the enhancement New feature or request label Apr 15, 2024

armbues self-assigned this Apr 15, 2024

Provide feedback