Tongjie Chen created PARQUET-100:
------------------------------------

             Summary: provide an option in parquet-pig to avoid reading footers 
in client side
                 Key: PARQUET-100
                 URL: https://issues.apache.org/jira/browse/PARQUET-100
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-mr
    Affects Versions: parquet-mr_1.6.0
            Reporter: Tongjie Chen


Parquet Pig reads footer in client side, to calculate splits and retrieve 
schema etc.

In HCatalog environment, if there are large number of files generated by Hive, 
Parquet-Pig will spend significant chunk of time processing those footers in 
client side (before job is submitted to cluster).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to