You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m building AI agent that is capable of doing multiple steps in one go and also capable of using a lot of tools. Evals are a standard way to test one shot generations or cot based generations, but there is no clear best practices in docs to be able to test such agents, especially when using streamText and streamObject functions to do the actual requests to llms.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I’m building AI agent that is capable of doing multiple steps in one go and also capable of using a lot of tools. Evals are a standard way to test one shot generations or cot based generations, but there is no clear best practices in docs to be able to test such agents, especially when using
streamText
andstreamObject
functions to do the actual requests to llms.i would appreciate some guidance here
Beta Was this translation helpful? Give feedback.
All reactions