image_transforms preprocess quite slow when run large image with qwen2vl #34272

zhjunqin · 2024-10-21T03:04:37Z

System Info

transformers version: 4.45.2
Platform: Linux-5.4.0-132-generic-x86_64-with-glibc2.31
Python version: 3.12.7
Huggingface_hub version: 0.25.1
Safetensors version: 0.4.5
Accelerate version: 1.0.0
Accelerate config: not found
PyTorch version (GPU?): 2.4.0+cu121 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?:
Using GPU in script?:
GPU type: NVIDIA GeForce RTX 3090

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

funcitons in image_transforms, rescale, normalize quite slow when preprocess large image.
https://github.com/huggingface/transformers/blob/main/src/transformers/image_transforms.py

here is benchmark

please refer to vllm-project/vllm#9238

Expected behavior

how to improve performance?

The text was updated successfully, but these errors were encountered:

zucchini-nlp · 2024-10-21T09:05:02Z

Hey @zhjunqin !

Can be related to #28847, where we enabled image processing with torchvision but that only is supported in ViT model. Also @yonigozlan is working on optimizing image processing time in #33810, so he might be your point of contact :)

yonigozlan · 2024-10-21T12:23:37Z

Hey @zhjunqin !
Thanks a lot for raising this issue. Indeed I'm currently working on adding fast image processors to Transformers, and I'll try to address the Qwen one shortly. I'll ping this issue once a PR is opened!

github-actions · 2024-11-20T08:03:51Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

SinanAkkoyun · 2024-11-20T08:06:59Z

@yonigozlan Hey :) Did you find time to address the qwen preprocessor?

yonigozlan · 2024-11-27T01:26:34Z

Not yet, but it is still planned :). I will ping here when it's done

github-actions · 2024-12-21T08:06:29Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Gladiator07 · 2025-01-16T12:10:12Z

Hi @yonigozlan , is qwen's preprocessing factoring still planned ? We are using offline inference with qwen 2 vl 7B for document extraction tasks for approx 70 million images and the preprocessing time is a major slowdown for us. If it is not planned immediately, can you tell if there's any workaround to speed or skip the preprocessing as I am already sending the resized images using smart_resize function but still it somehow sends it to huggingface for resizing it again. Any pointers will help a lot...

yonigozlan · 2025-01-16T15:19:32Z

Hi @Gladiator07. Sorry for the delay on this. I was waiting for this big refactoring PR on fast image processors #35069 to be merged to continue adding new fast image processors.
But as this is taking longer than I thought and since there is a lot of demand for qwen2vl, I'll try to open a PR for a fast qwen2vl image processors by the end of the week. I'll ping here when it's opened.
Once it's out you'll be able to checkout my branch to use it.
Hope that sounds good!

yonigozlan · 2025-01-16T16:51:33Z

PR is open here #35733 !

zhjunqin added the bug label Oct 21, 2024

LysandreJik added Performance Vision labels Oct 21, 2024

DarkLight1337 mentioned this issue Oct 28, 2024

[Bug]: Qwen2-VL incoherent output with OpenAI API vllm-project/vllm#9732

Closed

SinanAkkoyun mentioned this issue Oct 28, 2024

[bug] (duplicate) big images take way too long to process QwenLM/Qwen2-VL#491

Closed

github-actions bot closed this as completed Dec 29, 2024

yonigozlan reopened this Jan 16, 2025

This was referenced Jan 16, 2025

Improve image processing time #33810

Open

add Qwen2-VL image processor fast #35733

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image_transforms preprocess quite slow when run large image with qwen2vl #34272

image_transforms preprocess quite slow when run large image with qwen2vl #34272

zhjunqin commented Oct 21, 2024 •

edited

Loading

zucchini-nlp commented Oct 21, 2024

yonigozlan commented Oct 21, 2024

github-actions bot commented Nov 20, 2024

SinanAkkoyun commented Nov 20, 2024

yonigozlan commented Nov 27, 2024

github-actions bot commented Dec 21, 2024

Gladiator07 commented Jan 16, 2025

yonigozlan commented Jan 16, 2025 •

edited

Loading

yonigozlan commented Jan 16, 2025

image_transforms preprocess quite slow when run large image with qwen2vl #34272

image_transforms preprocess quite slow when run large image with qwen2vl #34272

Comments

zhjunqin commented Oct 21, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

zucchini-nlp commented Oct 21, 2024

yonigozlan commented Oct 21, 2024

github-actions bot commented Nov 20, 2024

SinanAkkoyun commented Nov 20, 2024

yonigozlan commented Nov 27, 2024

github-actions bot commented Dec 21, 2024

Gladiator07 commented Jan 16, 2025

yonigozlan commented Jan 16, 2025 • edited Loading

yonigozlan commented Jan 16, 2025

zhjunqin commented Oct 21, 2024 •

edited

Loading

yonigozlan commented Jan 16, 2025 •

edited

Loading