Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llamagaurd3 identifies code as 'violent crime' #63

Open
visagansanthanam-unisys opened this issue Oct 23, 2024 · 3 comments
Open

llamagaurd3 identifies code as 'violent crime' #63

visagansanthanam-unisys opened this issue Oct 23, 2024 · 3 comments

Comments

@visagansanthanam-unisys

I am trying to have llamagaurd3 for a use case and I see that the model identifies any source code as unsafe violent crime. is this is a expected behavior

image

@EricMichaelSmith
Copy link
Contributor

Hi @visagansanthanam-unisys can you give us other examples of this? No, this is not expected behavior

@visagansanthanam-unisys
Copy link
Author

@EricMichaelSmith here are some more examples
image
image
However, I see the 8b models (llamaguard3:latest) seems to be working fine
image

@kplawiak
Copy link
Member

kplawiak commented Nov 1, 2024

Hi @visagansanthanam-unisys the two models (Llama Guard 1B and 8B) are different in terms of training data and underlying base models. Specifically, the 1B model was not trained on the coding interpreter category, which can lead to limitations for code input.
For more information on the training process and model limitations, please refer to the Llama Guard 3 1B model card (https://github.com/meta-llama/PurpleLlama/blob/main/Llama-Guard3/1B/MODEL_CARD.md) and the Llama Guard 3 8B model card (https://github.com/meta-llama/PurpleLlama/blob/main/Llama-Guard3/8B/MODEL_CARD.md).
Additionally, we recommend checking out the Llama Guard documentation (https://www.llama.com/docs/model-cards-and-prompt-formats/llama-guard-3/) for more examples (e.g how to format the input before passing it to the model).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants