-
Notifications
You must be signed in to change notification settings - Fork 137
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
prompt for agent evaluation tasks #32
Comments
Hi, we have updated the online inference logic here: https://github.com/bytedance/UI-TARS/blob/feat/oswd_infer/infer/osworld.py Meantime, we are supporting UI-TARS model in the original OSWorld repo, please stay tune. |
能提一个请求么,上架火山引擎,你们这个视觉模型真的很好,但国内真的好难用到….
发自我的iPhone
…------------------ Original ------------------
From: Shihao Liang ***@***.***>
Date: Sat,Feb 8,2025 10:30 AM
To: bytedance/UI-TARS ***@***.***>
Cc: Subscribed ***@***.***>
Subject: Re: [bytedance/UI-TARS] prompt for agent evaluation tasks (Issue #32)
Hi, we have updated the online inference logic here: https://github.com/bytedance/UI-TARS/blob/feat/oswd_infer/infer/osworld.py
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
火山暂时不支持,readme的部署文档有魔搭+阿里云一键部署的方案 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thanks for the nice work!
I am wondering whether the prompts shared in the readme is also used for online / offline agent evaluations reported in the paper.
Specifically, I tried the prompt template for the computer. On the one hand, the grounding is very accurate. On the other hand, however, the model can easily stuck in some states, for example, keep issuing actions like click non-interactive elements like a string. Like the prompt suggested, I indeed add the history of the actions performed as part of the user instruction. This is not helpful in resolving the issue.
Provided the SOTA performances on the online / offline agent benchmarks, I would like to learn what else I could potentially miss.
Thanks.
More details are provided below.
The text was updated successfully, but these errors were encountered: