[
https://issues.apache.org/jira/browse/PARQUET-481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aliaksei Sandryhaila updated PARQUET-481:
-----------------------------------------
Description:
reader-test currently tests with a single parquet file and only verifies that
we can read it, not the correctness of the output.
Proposed changes:
- Expand it to work with multiple files
- Move tests for Scanner to scanner-test.cc
- Add method ParquetFileReader::JsonPrint() that prints a file contents in a
json format, so we can consistently compare the output with the ground truth
stored in parquet-cpp/data. This method will also be more handy than DebugPrint
when we start working with nested columns.
was:
reader-test currently tests with a single parquet file and only verifies that
we can read it, not the correctness of the output.
Proposed changes:
- Move reader-test.cc to a separate directory parquet-cpp/tests (in the future,
all unit tests will be located there)
- Expand it to work with multiple files
- Add method ParquetFileReader::JsonPrint() that prints a file contents in a
json format, so we can consistently compare the output with the ground truth
stored in parquet-cpp/data. This method will also be more handy than DebugPrint
when we start working with nested columns.
> Refactor and expand reader-test
> -------------------------------
>
> Key: PARQUET-481
> URL: https://issues.apache.org/jira/browse/PARQUET-481
> Project: Parquet
> Issue Type: Sub-task
> Components: parquet-cpp
> Affects Versions: cpp-0.1
> Reporter: Aliaksei Sandryhaila
> Assignee: Aliaksei Sandryhaila
> Fix For: cpp-0.1
>
>
> reader-test currently tests with a single parquet file and only verifies that
> we can read it, not the correctness of the output.
> Proposed changes:
> - Expand it to work with multiple files
> - Move tests for Scanner to scanner-test.cc
> - Add method ParquetFileReader::JsonPrint() that prints a file contents in a
> json format, so we can consistently compare the output with the ground truth
> stored in parquet-cpp/data. This method will also be more handy than
> DebugPrint when we start working with nested columns.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)