You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your great work in enhancing LLMs' reasoning ability.
I tried to reproduce the result you posted in the technical report, but I observed a gap between my result and the result on paper, could you please open your evaluation code and some settings of your inference prompts.
The text was updated successfully, but these errors were encountered:
Hello! We appreciate your interest.
You can find our inference prompts at this link, and our evaluation codes are available here. It's important to note that in the code for STILL-3-1.5B-Preview, we utilized a sampling decoding setup with a sampling temperature of 0.6 and a top-p sampling probability of 0.95. Each question was sampled 64 times, and the average score was computed. To reproduce our results in STILL-2, however, we recommend using a greedy search.
Thanks for your great work in enhancing LLMs' reasoning ability.
I tried to reproduce the result you posted in the technical report, but I observed a gap between my result and the result on paper, could you please open your evaluation code and some settings of your inference prompts.
The text was updated successfully, but these errors were encountered: