-
Notifications
You must be signed in to change notification settings - Fork 883
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Architecture Requests for Mamba #1030
Comments
We have Mamba in MLX LM already and there is a PR for Mamba 2 (#1009 ). As for the others, it would be helpful if you could point to Hugging Face repos for each model type. We can consider adding them on an ongoing basis. |
Mamba: https://huggingface.co/tiiuae/falcon-mamba-7b |
Zamba2 7b was just released. One of the best models of its size, it outperforms Llama3.2 11b and Mistral 7b in almost every benchmark. |
I would like support the following architectures:
These architectures are becoming quite common now and are supported by most major LLM libraries.
The text was updated successfully, but these errors were encountered: