oleg-zinovev opened a new pull request #1836: DRILL-7156: empty parquet files 
support
URL: https://github.com/apache/drill/pull/1836
 
 
   PR for Drill empty parquet  files read and write support.
   
   Known limitations:
   1) Not working for hive parquet for now
   2) Ignores all schemas except last while writing empty parquet file
   3) Not support empty schemas (e.g. create table `..` as select * from 
`empty.json, e.g. {}`)
   
   Short changes description:
   1) Parquet footer metadata added
   2) Parquet writer checks that at least 1 row has been written. If not - 
creates a empty parquet file with footer.
   3) EmptyParquetRowGroupScan and EmptyParquetScanBatchCreator added
   
   Questions:
   1) TestParquetWriterEmptyFiles#testMultipleWriters now creates several empty 
files, but not fails, since reading of empty parquet is supported. Should I 
rewrite comment or remove the test?
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to