rdsr opened a new pull request #843: [WIP] InputFormat support for Iceberg
URL: https://github.com/apache/incubator-iceberg/pull/843
 
 
   The current code is pretty rough. I'm sending to get early feedback on the 
approach.
   Most of the scaffolding is  there.  There's support to allow for 
parameterized in memory records. All read value functions for data formats like 
Avro, Parquet and ORC are supported through the `ReadSupport` interface. The 
same interface also allows to add identity partition columns to the input row. 
   I supported parameterized input records because much of the use of the 
InputFormat api is for supporting  MR based engines like Pig and Hive. 
   
   I need feedback on the `ReadSupport` API and see whether folks are OK with 
it.  Detailed code feedback can be omitted for now. Once people are ok the high 
level approach I will clean up the code and add tests. Feedback welcome on 
testing the input format as well. 
   
   cc @rdblue @teabot @jerryshao @massdosage
   
   This patch is based on the work of @guilload

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to