rdsr opened a new pull request #843: [WIP] InputFormat support for Iceberg URL: https://github.com/apache/incubator-iceberg/pull/843 The current code is pretty rough. I'm sending to get early feedback on the approach. Most of the scaffolding is there. There's support to allow for parameterized in memory records. All read value functions for data formats like Avro, Parquet and ORC are supported through the `ReadSupport` interface. The same interface also allows to add identity partition columns to the input row. I supported parameterized input records because much of the use of the InputFormat api is for supporting MR based engines like Pig and Hive. I need feedback on the `ReadSupport` API and see whether folks are OK with it. Detailed code feedback can be omitted for now. Once people are ok the high level approach I will clean up the code and add tests. Feedback welcome on testing the input format as well. cc @rdblue @teabot @jerryshao @massdosage This patch is based on the work of @guilload
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
