Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mlx generate text, by default halusonates more #947

Open
kishoretvk opened this issue Aug 21, 2024 · 3 comments
Open

Mlx generate text, by default halusonates more #947

kishoretvk opened this issue Aug 21, 2024 · 3 comments

Comments

@kishoretvk
Copy link

kishoretvk commented Aug 21, 2024

Issue Description
Problem: When using the mix generate text command with verbose set to false, and the following parameters:

Temperature: 0.1 or 0
Top p: 1
The LLM models seem to hallucinate more frequently.

Models Used:

Mistral 7B v3
IBM Granite
Mistral 7B v2
Use Case: Text-to-SQL

Method Used: mlx.generate_text

Steps to Reproduce
Set up the environment with the specified models.
Run the mix generate text command with verbose set to false.
Use a temperature of 0.1 or 0 and top p of 1.
Observe the output for hallucinations.
Expected Behavior
The models should generate accurate and coherent SQL queries without hallucinations.

Actual Behavior
The models produce outputs that are factually incorrect or nonsensical, indicating hallucinations.

Additional Information
Dataset: 20,000 examples split for training and validation.
Hardware: 128GB M3 Max

@awni
Copy link
Member

awni commented Aug 21, 2024

Could you share the commands you used and the precise model paths?

Hardware: 40 GB A100 GPU.

Also MLX is meant for Apple silicon.. is that a typo or were you running it on a linux machine?

@kishoretvk
Copy link
Author

Sorry my bad 128 gb ram
M3 max

@awni
Copy link
Member

awni commented Aug 21, 2024

How about the commands / model paths?

Also when you say "by default halusonates more" what are you comparing against?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants