We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi @armbues, thank you for this great project.
Please add ORPO finetuning to do SFT and DPO in one step https://arxiv.org/abs/2403.07691
The text was updated successfully, but these errors were encountered:
Great idea! I will add this to the roadmap.
Sorry, something went wrong.
The official implementation of the "ORPO Trainer" can be found here.
armbues
No branches or pull requests
Hi @armbues, thank you for this great project.
Please add ORPO finetuning to do SFT and DPO in one step https://arxiv.org/abs/2403.07691
The text was updated successfully, but these errors were encountered: