-
Notifications
You must be signed in to change notification settings - Fork 883
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I tried madlad400, but there is a problem with the output if it is float16 #980
Comments
Indeed.. the T5 models typically don't work well in fp16. Probably they need some kind of activation clipping or rescaling to fix this. |
Thank.
My machine has low memory so it's swapping so it's slow. hahaha. My hope is that the file size is still large, so it would be nice if it could be used in int8 as well.
Thank. |
Do you mind uploading your madlad-400 mlx to HF? |
Please don't worry about it |
No, I mean I want to use it and easier if it's on HF. |
Hi.
I tried madlad400, but there is a problem with the output if it is float16
Thank.
The text was updated successfully, but these errors were encountered: