[SUPPORT] Does Hudi re-create record level index during an upsert operation? #12783

dataproblems · 2025-02-05T23:03:28Z

Describe the problem you faced

I created a hudi table with record level index and performed upsert operation on it. Now, the first time when I performed the upsert operation, it read the record index file, figured out which files needed an update, and wrote the files to S3. The second time when I performed an upsert on the same table, I saw the record index folder being deleted and recreated under the metadata folder. My hudi table is quite large and this re-creation of the entire record level index is too expensive to support during an upsert operation.

To Reproduce

Steps to reproduce the behavior:

Create hudi table using insert mode and specify record level index
Perform an upsert
Perform another upsert

Expected behavior

I expect any number of upserts after the initial creation of the record level index to just update the index as required and not re-create the whole index.

Environment Description

Hudi version : 0.15.0
Spark version : 3.4.1
Hive version :
Hadoop version : 3.3.6
Storage (HDFS/S3/GCS..) : S3
Running on Docker? (yes/no) : no

Additional context

Config I used for upsert operation:

    hoodie.embed.timeline.server -> false, 
    hoodie.parquet.small.file.limit -> 1073741824, 
    hoodie.metadata.record.index.enable -> true, 
    hoodie.datasource.write.precombine.field -> $timestampField, 
    hoodie.datasource.write.payload.class -> org.apache.hudi.common.model.OverwriteWithLatestAvroPayload,   
    hoodie.metadata.index.column.stats.enable -> true, 
    hoodie.parquet.max.file.size -> 2147483648, 
    hoodie.metadata.enable -> true, 
    hoodie.index.type -> RECORD_INDEX, 
    hoodie.datasource.write.operation -> upsert, 
    hoodie.parquet.compression.codec -> snappy, 
    hoodie.datasource.write.recordkey.field -> $recordKeyField, 
    hoodie.table.name -> $tableName, 
    hoodie.datasource.write.table.type -> COPY_ON_WRITE, 
    hoodie.datasource.write.hive_style_partitioning -> true, 
    hoodie.write.markers.type -> DIRECT, 
    hoodie.populate.meta.fields -> true, 
    hoodie.datasource.write.keygenerator.class -> org.apache.hudi.keygen.SimpleKeyGenerator, 
    hoodie.upsert.shuffle.parallelism -> 10000, 
    hoodie.datasource.write.partitionpath.field -> $partitionField

I do not see any errors but it doesn't make sense that hudi will clear away my index and recreate it.

The text was updated successfully, but these errors were encountered:

danny0405 · 2025-02-06T02:43:31Z

The MDT is updated incrementally for each time of upsert, the reason MDT got re-initialized should be some data consistency issue between MDT and data table.

dataproblems · 2025-02-06T18:18:47Z

Is there a log line that I could search for to determine what might have caused it? Since there is only one writer that writes to this hudi table, is there a way to know what caused the inconsistency?

ad1happy2go · 2025-02-07T04:25:29Z

@dataproblems Are you saying the record_index directory itself getting deleted and created ? If yes, that is not expected for sure.

Can you share hudi configurations? Can you share driver logs also.

dataproblems · 2025-02-10T19:06:46Z

@ad1happy2go the hudi configuration I used was this:

    hoodie.embed.timeline.server -> false, 
    hoodie.parquet.small.file.limit -> 1073741824, 
    hoodie.metadata.record.index.enable -> true, 
    hoodie.datasource.write.precombine.field -> $timestampField, 
    hoodie.datasource.write.payload.class -> org.apache.hudi.common.model.OverwriteWithLatestAvroPayload,   
    hoodie.metadata.index.column.stats.enable -> true, 
    hoodie.parquet.max.file.size -> 2147483648, 
    hoodie.metadata.enable -> true, 
    hoodie.index.type -> RECORD_INDEX, 
    hoodie.datasource.write.operation -> upsert, 
    hoodie.parquet.compression.codec -> snappy, 
    hoodie.datasource.write.recordkey.field -> $recordKeyField, 
    hoodie.table.name -> $tableName, 
    hoodie.datasource.write.table.type -> COPY_ON_WRITE, 
    hoodie.datasource.write.hive_style_partitioning -> true, 
    hoodie.write.markers.type -> DIRECT, 
    hoodie.populate.meta.fields -> true, 
    hoodie.datasource.write.keygenerator.class -> org.apache.hudi.keygen.SimpleKeyGenerator, 
    hoodie.upsert.shuffle.parallelism -> 10000, 
    hoodie.datasource.write.partitionpath.field -> $partitionField

I may not be able to share driver logs as I saw this for our production table. However, is there a specific error message that I can search for in the logs? I can confirm if something like that exists.

ad1happy2go added index metadata metadata table priority:critical production down; pipelines stalled; Need help asap. labels Feb 7, 2025

ad1happy2go added this to Hudi Issue Support Feb 7, 2025

github-project-automation bot moved this to ⏳ Awaiting Triage in Hudi Issue Support Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUPPORT] Does Hudi re-create record level index during an upsert operation? #12783

[SUPPORT] Does Hudi re-create record level index during an upsert operation? #12783

dataproblems commented Feb 5, 2025

danny0405 commented Feb 6, 2025

dataproblems commented Feb 6, 2025

ad1happy2go commented Feb 7, 2025

dataproblems commented Feb 10, 2025

[SUPPORT] Does Hudi re-create record level index during an upsert operation? #12783

[SUPPORT] Does Hudi re-create record level index during an upsert operation? #12783

Comments

dataproblems commented Feb 5, 2025

danny0405 commented Feb 6, 2025

dataproblems commented Feb 6, 2025

ad1happy2go commented Feb 7, 2025

dataproblems commented Feb 10, 2025