[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Íñigo Goiri updated HDFS-14043: --- Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 2.9.3 3.2.1 3.3.0 3.1.2 3.0.4 2.10.0 Status: Resolved (was: Patch Available) Thanks [~lukmajercak] for the fix and [~surmountian] for the review. Committed to trunk, branch-3.2, branch-3.1, branch-3.0, branch-2, and branch-2.9. > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Fix For: 2.10.0, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3 > > Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch, > HDFS-14043.003.patch > > > We already tolerate IOExceptions when reading seen_txid file from namenode's > dirs. So we take the maximum txid of all the *readable* namenode dirs. We > should extend this to when the file is corrupted. Currently, > PersistentLongFile.readFile throws NumberFormatException in this case and the > whole NN crashes. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Majercak updated HDFS-14043: -- Attachment: HDFS-14043.003.patch > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch, > HDFS-14043.003.patch > > > We already tolerate IOExceptions when reading seen_txid file from namenode's > dirs. So we take the maximum txid of all the *readable* namenode dirs. We > should extend this to when the file is corrupted. Currently, > PersistentLongFile.readFile throws NumberFormatException in this case and the > whole NN crashes. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Majercak updated HDFS-14043: -- Attachment: HDFS-14043.002.patch > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch > > > We already tolerate IOExceptions when reading seen_txid file from namenode's > dirs. So we take the maximum txid of all the *readable* namenode dirs. We > should extend this to when the file is corrupted. Currently, > PersistentLongFile.readFile throws NumberFormatException in this case and the > whole NN crashes. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Íñigo Goiri updated HDFS-14043: --- Status: Patch Available (was: Open) > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Attachments: HDFS-14043.001.patch > > > We already tolerate IOExceptions when reading seen_txid file from namenode's > dirs. So we take the maximum txid of all the *readable* namenode dirs. We > should extend this to when the file is corrupted. Currently, > PersistentLongFile.readFile throws NumberFormatException in this case and the > whole NN crashes. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Majercak updated HDFS-14043: -- Description: We already tolerate IOExceptions when reading seen_txid file from namenode's dirs. So we take the maximum txid of all the *readable* namenode dirs. We should extend this to when the file is corrupted. Currently, PersistentLongFile.readFile throws NumberFormatException in this case and the whole NN crashes. > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Attachments: HDFS-14043.001.patch > > > We already tolerate IOExceptions when reading seen_txid file from namenode's > dirs. So we take the maximum txid of all the *readable* namenode dirs. We > should extend this to when the file is corrupted. Currently, > PersistentLongFile.readFile throws NumberFormatException in this case and the > whole NN crashes. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file
[ https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Majercak updated HDFS-14043: -- Attachment: HDFS-14043.001.patch > Tolerate corrupted seen_txid file > - > > Key: HDFS-14043 > URL: https://issues.apache.org/jira/browse/HDFS-14043 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs, namenode >Affects Versions: 2.9.2, 3.1.2, 2.9.3 >Reporter: Lukas Majercak >Assignee: Lukas Majercak >Priority: Major > Attachments: HDFS-14043.001.patch > > -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org