How to run Phi3 on NPU with ORT+DML via OnnxRuntimeGenai library #1180

Gusha-nye · 2025-01-13T07:51:31Z

Currently, in the example given in the official microsoft documentation(https://learn.microsoft.com/zh-cn/windows/ai/models/get-started-models-genai), it is possible to run phi3 models on GPUs using DML acceleration under the ORT framework with the OnnxRuntimeAI library. However, I now want to deploy it to reason on NPU, but I don't seem to see any parameter in the GeneratorParams class or Generator class of the OnnxRuntimeGenAI library where I can set the hardware (CPU, GPU, NPU) reasoning platform. Is it currently possible to implement ORT+DML based running Phi3 on NPU? If it is possible, please tell me how to set it up? If there is a demo can you send it for reference? (C#, python, C++ are all acceptable)

microsoft-github-policy-service bot added the ep:DML label Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to run Phi3 on NPU with ORT+DML via OnnxRuntimeGenai library #1180

How to run Phi3 on NPU with ORT+DML via OnnxRuntimeGenai library #1180

Gusha-nye commented Jan 13, 2025

How to run Phi3 on NPU with ORT+DML via OnnxRuntimeGenai library #1180

How to run Phi3 on NPU with ORT+DML via OnnxRuntimeGenai library #1180

Comments

Gusha-nye commented Jan 13, 2025