[ 
https://issues.apache.org/jira/browse/PARQUET-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15123632#comment-15123632
 ] 

Wes McKinney commented on PARQUET-479:
--------------------------------------

I definitely don't want to bloat the git repo. So if we go that route, either 
we would host test data files outside of the main git repo or have a data 
generation script that creates them from scratch locally. parquet-mr probably 
never had to face this issue because it was the proverbial chicken. 

My preference would be to focus on testing round-tripping data from the ground 
up, but I also need to be able to write Parquet files =) It might be useful to 
have some "smoke tests" that use external pre-generated data files but it 
doesn't feel like a scalable solution (e.g. bug fixes may require generating 
the right file to reproduce a bug).

> Improve/expand functional unit tests
> ------------------------------------
>
>                 Key: PARQUET-479
>                 URL: https://issues.apache.org/jira/browse/PARQUET-479
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-cpp
>    Affects Versions: cpp-0.1
>            Reporter: Aliaksei Sandryhaila
>            Assignee: Aliaksei Sandryhaila
>             Fix For: cpp-0.1
>
>
> We need to add a testing framework for unit tests, and run it as a part of 
> each Travis CI build.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to