-
Notifications
You must be signed in to change notification settings - Fork 76
feat: implement delete file index #435
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
a23aa2b to
67d3b69
Compare
9345afc to
0b8f579
Compare
0b8f579 to
5cd132b
Compare
| "upper_bounds", "record_count"}; | ||
| const std::vector<std::string> kStatsColumns = { | ||
| "value_counts", "null_value_counts", "nan_value_counts", "lower_bounds", | ||
| "upper_bounds", "column_sizes", "record_count"}; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The newly added column_sizes seems not aligned with java impl, is that a bug in java or our implementation is different?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I think this is an oversight in the Java impl if I understand it correctly.
5cd132b to
01b4c8e
Compare
Implemented the DeleteFileIndex and Builder to manage and efficiently filter
delete files (equality deletes, position deletes, and deletion vectors)
based on sequence numbers and partitions.
Key changes:
DeleteFileIndexandDeleteFileIndex::Builderinsrc/iceberg/delete_file_index.{h,cc}.ContentFileUtilfor helper functions related to content files and DVs.ManifestReaderto support dropping stats viaTryDropStats().src/iceberg/test/delete_file_index_test.cc.