[ 
https://issues.apache.org/jira/browse/ORC-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17428194#comment-17428194
 ] 

Yiqun Zhang commented on ORC-1028:
----------------------------------

I think this kind of binary-level corruption, which can occur anywhere, is 
almost impossible to repair.

ORC Tools does not currently have a command to quickly identify file 
corruption. The header and footer of a file may be easier to detect quickly, 
but a stripe corruption may require a full scan to identify, even if some bit 
changes do not affect the read.

Ultimately it's the file system that causes the problem, the file format can't 
help.

> Orc file damage detection
> -------------------------
>
>                 Key: ORC-1028
>                 URL: https://issues.apache.org/jira/browse/ORC-1028
>             Project: ORC
>          Issue Type: New Feature
>          Components: Java
>            Reporter: 任建亭
>            Priority: Major
>
> On our cluster, we found a lot of corrupted ORC files. How do I quickly 
> detect if an ORC file is corrupted? Is there a tool available to repair 
> damaged ORC files if they are corrupted



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to