Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-3156: Enable vectored IO by default. #3155

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

ahmarsuhail
Copy link

@ahmarsuhail ahmarsuhail commented Feb 14, 2025

What changes are included in this PR?

We currently had two variables to enable vectored-io,

HADOOP_VECTORED_IO_DEFAULT and HADOOP_VECTORED_IO_ENABLED_DEFAULT,

#3128 updated HADOOP_VECTORED_IO_DEFAULT, which was not being used, so vectoredIO was still turned off by default.

Are these changes tested?

Test with in proc Spark app, and ensured that vectored-io is used without the need to set "parquet.hadoop.vectored.io.enabled", "true"

Are there any user-facing changes?

@ahmarsuhail ahmarsuhail changed the title Enable vectored IO by default. GH-3156: Enable vectored IO by default. Feb 14, 2025
@ahmarsuhail
Copy link
Author

FYI @dongjoon-hyun

Copy link
Member

@wgtmac wgtmac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

Copy link
Contributor

@mukund-thakur mukund-thakur left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we not remove this HADOOP_VECTORED_IO_ENABLED_DEFAULT from code then? This can still lead to confusion.

@mukund-thakur
Copy link
Contributor

Should we not remove this HADOOP_VECTORED_IO_ENABLED_DEFAULT from code then? This can still lead to confusion.

never mind I see you have removed it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants