-
Notifications
You must be signed in to change notification settings - Fork 426
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] 请问function calling是否只适配本项目的后端servcer
awaiting response
#2747
opened Nov 13, 2024 by
positive666
[Bug] Cannot install torch-npu==2.3.1, torch==2.3.1 and torchvision==0.18.1 because these package versions have conflicting dependencies.
#2745
opened Nov 13, 2024 by
jiabao-wang
3 tasks
[Bug] Accuracy of W8A8 is big different from that of the original model
#2730
opened Nov 9, 2024 by
HelloCard
3 tasks done
[Bug] How to improve the first frame response speed TTFT
awaiting response
#2728
opened Nov 8, 2024 by
zhouyuustc
3 tasks done
[Bug] Deployment of Llama3.1-70b getting struck
#2724
opened Nov 7, 2024 by
pulkitmehtaworkmetacube
3 tasks done
lmdeploy - ERROR - __init__.py:17 - ModuleNotFoundError: No module named 'dlinfer'
awaiting response
#2722
opened Nov 7, 2024 by
jiabao-wang
3 tasks
[Bug] 并发场景下,发起大的输入token请求时会导致流式响应出现问题
awaiting response
#2709
opened Nov 5, 2024 by
zhouyuustc
3 tasks done
[Bug] InternVL2-1B performance of lmdeploy is much worse compared to the original Hugging Face PyTorch model.
#2705
opened Nov 4, 2024 by
henry16lin
3 tasks done
[Bug] pytorch backend 's precision points loss 1.0-2.5 points between main code and v0.6.1 on some models.
#2679
opened Oct 29, 2024 by
zhulinJulia24
3 tasks
[Bug] no user input makes api server throw exception with MLLM
#2658
opened Oct 25, 2024 by
gaord
3 tasks
[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank.
#2651
opened Oct 24, 2024 by
zhulinJulia24
3 tasks
[Bug] Use triton to deploy minicpm-v-2_6 GPU memory keeps increasing until it overflows
#2642
opened Oct 24, 2024 by
LinJianping
1 of 3 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.