Quality based routing #2213
JustinGuese
started this conversation in
Ideas
Replies: 1 comment
-
An even simpler implementation would be setting the model to "auto", and then gpt4free automatically tries all from best to worst, according to e.g. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How about we would offer a "quality based routing", like instead of setting the model, one would set something like
and in the backend it will map to try the models in that order, e.g.
Why?
Because I'd love to try gpt-4o first, but it's not always available or I am reaching a rate limit. If it's not available I want to try 4, 4-turbo and so on.
Same for if I just want a quick result, where I'd be willing to have the same done for 3.5-turbo and so on....
Currently one would have to implement the logic in their own app, like try 4o, if it doesn't work, try 4 and so on. Would save me a lot of work ;)
What do you think?
Beta Was this translation helpful? Give feedback.
All reactions