Tianshuo Deng created PARQUET-197:
-------------------------------------

             Summary: parquet-cascading and the mapred API does not create 
metadata file
                 Key: PARQUET-197
                 URL: https://issues.apache.org/jira/browse/PARQUET-197
             Project: Parquet
          Issue Type: Bug
            Reporter: Tianshuo Deng
            Assignee: Tianshuo Deng


Repro: run a scalding job that writes parquet files to a folder. no _metadata 
and _common_metadata file is created

Impact: potential performance problem if parquet metadata is read from client 
side, which is the case for sparkSQL

casue: the metatdata writing logic is in the mapreduce API but not the mapred 
API of parquet.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to