oleg-zinovev opened a new pull request #1836: DRILL-7156: empty parquet files support URL: https://github.com/apache/drill/pull/1836 PR for Drill empty parquet files read and write support. Known limitations: 1) Not working for hive parquet for now 2) Ignores all schemas except last while writing empty parquet file 3) Not support empty schemas (e.g. create table `..` as select * from `empty.json, e.g. {}`) Short changes description: 1) Parquet footer metadata added 2) Parquet writer checks that at least 1 row has been written. If not - creates a empty parquet file with footer. 3) EmptyParquetRowGroupScan and EmptyParquetScanBatchCreator added Questions: 1) TestParquetWriterEmptyFiles#testMultipleWriters now creates several empty files, but not fails, since reading of empty parquet is supported. Should I rewrite comment or remove the test?
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
