-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
iceberg scan with deletes slow anti hash join #55928
Comments
DEBUG BaseDistributedDataScan: Total Scan Metrics: 3 seconds scan because we pass all data files through hashjoin
|
looks like iceberg scan has to do it internally |
it is an issue of eq delete range covering multiple data files, not sure if SR could do it better wo some bloom filter stuff |
lets say this is a performance improvement request now |
why iceberg anti join hash probe is so slow here is olap example
for iceberg plan it is seconds
attaching query profile for iceberg |
before compactions
we scan 51 mil without delete files, 73 mil with delete files
after compaction
we scan 124 mil with delete files
it is slower than before compaction
please note that there are only a handful of delete row records now
@stephen-shelby
The text was updated successfully, but these errors were encountered: