deniskuzZ commented on PR #13329:
URL: https://github.com/apache/iceberg/pull/13329#issuecomment-2983541535

   > > off-topic: partition stats file format is highly coupled with table 
write.format. If CU is using ORC, he automatically loses the partition stats.
   > 
   > Yes. In the initial proposal document jack-ye has brought this up. Spec 
was hardcoded that it should be in table default format. Then we updated that 
it can be any of the format 
(https://iceberg.apache.org/spec/#partition-statistics-file).
   > 
   > So, we can have a table property to configure a different format stats 
than data format. But I recommend supporting InternalData for ORC, I can help 
on it too if needed (I did it for parquet and avro). So, that ORC can be used 
for writing table metadata in v4. Than changing the format of partition stats 
here.
   
   @ajantha-bhat, that would be awesome if we add an InternalData reader/writer 
for ORC. However, I am not sure how big that effort would be since we'll need 
to implement support for a few missing features like default values, 
timestamp(9), variant


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to