Skip to content

Commit

Permalink
Update bench_engines.py
Browse files Browse the repository at this point in the history
  • Loading branch information
zhudotexe committed Oct 15, 2024
1 parent ae8f90c commit 5ae9a29
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion bench_engines.py
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ def get_engine(model_id: str, context_size: int = None):
"tensor_parallel_size": 8,
"tokenizer_mode": "auto",
# for more stability
"gpu_memory_utilization": 0.7,
"gpu_memory_utilization": 0.8,
"enable_prefix_caching": True,
},
sampling_params=SamplingParams(temperature=0, max_tokens=2048),
Expand Down

0 comments on commit 5ae9a29

Please sign in to comment.