Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

An error occurred when starting the API server in Docker and using the API locally #553

Open
lililolo0927 opened this issue Feb 20, 2025 · 1 comment

Comments

@lililolo0927
Copy link

An error occurred when starting the API server in Docker and using the API locally

The cmd I used to create container is
docker run --gpus all -d -v /mnt/data/models:/models -p 10002:10002 --name ktransformers -itd approachingai/ktransformers:0.2.1
and running well with port 10002 opened .
Image

Inside the container, I typed the following command to start server, referring to this link, https://kvcache-ai.github.io/ktransformers/en/install.html#restful-api-and-web-ui

ktransformers --gguf_path /models/gguf/DeepSeek-V3-Q4_K_M --model_path deepseek-ai/DeepSeek-V3 --model_name DeepSeek-V3

Then it shows as the following in the container, (started well)
Image

And if I try the example as https://github.com/kvcache-ai/ktransformers/blob/main/doc/en/api/server/api.md ,
on local server, the followings were returned,
Image

Image

And on the server,

Image

Image

Please share the ideas to fix this problems! Thanks

@ubergarm
Copy link
Contributor

  1. Only very recently did the API endpoint start working after PR382 ktransformers@ee24eb8 Before yesterday 2.1 didn't really work with API, local chat only.
  2. You're on python 3.10, I've only tested with 3.11 using uv (i don't use conda or poetry)
  3. Full guide including prebuilt binary .whl files here if u interested https://github.com/ubergarm/r1-ktransformers-guide

Its working pretty well now with the very tip of main ktransformers@25c5bdd which is what the prebuilt whl i provide above is.

Holler again if you're getting the same errors after updating! Or do git rev-parse --short HEAD inside the container perhaps to see what version you are on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants