[
https://issues.apache.org/jira/browse/HDFS-17535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ruiliang updated HDFS-17535:
----------------------------
Description:
我了解到 EC 确实存在文件损坏的重错误
https://issues.apache.org/jira/browse/HDFS-15759
1:我已确认 EC 损坏文件,此损坏文件可以恢复吗?
有重要数据导致我们生产数据丢失问题?有办法恢复吗?
检查 EC 块组:blk_-9223372036361352768
状态:错误,消息:EC 计算结果不匹配。:ip 为 10.12.66.116 块为:-9223372036361352765
2:[https://github.com/apache/orc/issues/1939]我想知道如果你选择了你当前的代码(GitHub pull
request #2869),我可以跳过与HDFS-14768,HDFS-15186, 和HDFS-15240?
hdfs 版本 3.1.0
谢谢
Latest findings: It is a machine network problem, the cpu si(soft interrupt) is
too high, nn loses dn heartbeat, nn sends to dn to recover and reconstruct.
Because the Weaver-Scope service of k8s is installed on the server, conntrack
interruption times out seriously, affecting all network usage.
was:
我了解到 EC 确实存在文件损坏的重大错误
https://issues.apache.org/jira/browse/HDFS-15759
1:我已确认 EC 损坏文件,此损坏文件可以恢复吗?
有重要数据导致我们生产数据丢失问题?有办法恢复吗?
检查 EC 块组:blk_-9223372036361352768
状态:错误,消息:EC 计算结果不匹配。:ip 为 10.12.66.116 块为:-9223372036361352765
2:[https://github.com/apache/orc/issues/1939]我想知道如果你选择了你当前的代码(GitHub pull
request #2869),我可以跳过与HDFS-14768,HDFS-15186, 和HDFS-15240?
hdfs 版本 3.1.0
谢谢
Latest findings: It is a machine network problem, the cpu si(soft interrupt) is
too high, nn loses dn heartbeat, nn sends to dn to recover and reconstruct.
Because the Weaver-Scope service of k8s is installed on the server, conntrack
interruption times out seriously, affecting all network usage.
> I have confirmed the EC corrupt file, can this corrupt file be restored?
> ------------------------------------------------------------------------
>
> Key: HDFS-17535
> URL: https://issues.apache.org/jira/browse/HDFS-17535
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: ec, hdfs
> Affects Versions: 3.1.0
> Reporter: ruiliang
> Priority: Blocker
>
> 我了解到 EC 确实存在文件损坏的重错误
> https://issues.apache.org/jira/browse/HDFS-15759
> 1:我已确认 EC 损坏文件,此损坏文件可以恢复吗?
> 有重要数据导致我们生产数据丢失问题?有办法恢复吗?
> 检查 EC 块组:blk_-9223372036361352768
> 状态:错误,消息:EC 计算结果不匹配。:ip 为 10.12.66.116 块为:-9223372036361352765
> 2:[https://github.com/apache/orc/issues/1939]我想知道如果你选择了你当前的代码(GitHub pull
> request #2869),我可以跳过与HDFS-14768,HDFS-15186, 和HDFS-15240?
> hdfs 版本 3.1.0
> 谢谢
>
> Latest findings: It is a machine network problem, the cpu si(soft interrupt)
> is too high, nn loses dn heartbeat, nn sends to dn to recover and reconstruct.
> Because the Weaver-Scope service of k8s is installed on the server, conntrack
> interruption times out seriously, affecting all network usage.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]