[ 
https://issues.apache.org/jira/browse/PARQUET-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17608558#comment-17608558
 ] 

ASF GitHub Bot commented on PARQUET-2195:
-----------------------------------------

wgtmac opened a new pull request, #998:
URL: https://github.com/apache/parquet-mr/pull/998

   This PR enhances parquet-cli by adding a scan command to go through all 
records without printing them. This is useful when users need to verify if the 
parquet file is corrupted.
   
   No additional unit tests are added. Test it manually with local parquet 
files.




> Add scan command to parquet-cli
> -------------------------------
>
>                 Key: PARQUET-2195
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2195
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-cli
>            Reporter: Gang Wu
>            Priority: Major
>
> parquet-cli has *cat* and *head* commands to print the records but it does 
> not have the capability to *scan* (w/o printing) all records to check if the 
> file is corrupted.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to