You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Issue Description
Problem: When using the mix generate text command with verbose set to false, and the following parameters:
Temperature: 0.1 or 0
Top p: 1
The LLM models seem to hallucinate more frequently.
Models Used:
Mistral 7B v3
IBM Granite
Mistral 7B v2
Use Case: Text-to-SQL
Method Used: mlx.generate_text
Steps to Reproduce
Set up the environment with the specified models.
Run the mix generate text command with verbose set to false.
Use a temperature of 0.1 or 0 and top p of 1.
Observe the output for hallucinations.
Expected Behavior
The models should generate accurate and coherent SQL queries without hallucinations.
Actual Behavior
The models produce outputs that are factually incorrect or nonsensical, indicating hallucinations.
Additional Information
Dataset: 20,000 examples split for training and validation.
Hardware: 128GB M3 Max
The text was updated successfully, but these errors were encountered:
Issue Description
Problem: When using the mix generate text command with verbose set to false, and the following parameters:
Temperature: 0.1 or 0
Top p: 1
The LLM models seem to hallucinate more frequently.
Models Used:
Mistral 7B v3
IBM Granite
Mistral 7B v2
Use Case: Text-to-SQL
Method Used: mlx.generate_text
Steps to Reproduce
Set up the environment with the specified models.
Run the mix generate text command with verbose set to false.
Use a temperature of 0.1 or 0 and top p of 1.
Observe the output for hallucinations.
Expected Behavior
The models should generate accurate and coherent SQL queries without hallucinations.
Actual Behavior
The models produce outputs that are factually incorrect or nonsensical, indicating hallucinations.
Additional Information
Dataset: 20,000 examples split for training and validation.
Hardware: 128GB M3 Max
The text was updated successfully, but these errors were encountered: