rdblue commented on issue #359: Spec: Add file and position delete files URL: https://github.com/apache/incubator-iceberg/issues/359#issuecomment-606747108 I don't understand what #885 has to do with the spec. That updates `DataFile`. The spec is [this document](http://iceberg.apache.org/spec) that details the Iceberg at-rest format. Maybe you used the wrong branch? For position, I think we should use a long after all. That's what `DataFile` uses to encode the number of rows, and I've seen Parquet files with more than 2 billion rows before. I agree that we should sort by both path and position, but this will require support from the processing engine because the file contents could be large.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
