[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-05 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Íñigo Goiri updated HDFS-14043:
---
   Resolution: Fixed
 Hadoop Flags: Reviewed
Fix Version/s: 2.9.3
   3.2.1
   3.3.0
   3.1.2
   3.0.4
   2.10.0
   Status: Resolved  (was: Patch Available)

Thanks [~lukmajercak] for the fix and [~surmountian] for the review.
Committed to trunk, branch-3.2, branch-3.1, branch-3.0, branch-2, and 
branch-2.9.

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Fix For: 2.10.0, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3
>
> Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch, 
> HDFS-14043.003.patch
>
>
> We already tolerate IOExceptions when reading seen_txid file from namenode's 
> dirs. So we take the maximum txid of all the *readable* namenode dirs. We 
> should extend this to when the file is corrupted. Currently, 
> PersistentLongFile.readFile throws NumberFormatException in this case and the 
> whole NN crashes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org



[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-05 Thread Lukas Majercak (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lukas Majercak updated HDFS-14043:
--
Attachment: HDFS-14043.003.patch

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch, 
> HDFS-14043.003.patch
>
>
> We already tolerate IOExceptions when reading seen_txid file from namenode's 
> dirs. So we take the maximum txid of all the *readable* namenode dirs. We 
> should extend this to when the file is corrupted. Currently, 
> PersistentLongFile.readFile throws NumberFormatException in this case and the 
> whole NN crashes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org



[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-05 Thread Lukas Majercak (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lukas Majercak updated HDFS-14043:
--
Attachment: HDFS-14043.002.patch

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Attachments: HDFS-14043.001.patch, HDFS-14043.002.patch
>
>
> We already tolerate IOExceptions when reading seen_txid file from namenode's 
> dirs. So we take the maximum txid of all the *readable* namenode dirs. We 
> should extend this to when the file is corrupted. Currently, 
> PersistentLongFile.readFile throws NumberFormatException in this case and the 
> whole NN crashes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org



[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-01 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Íñigo Goiri updated HDFS-14043:
---
Status: Patch Available  (was: Open)

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Attachments: HDFS-14043.001.patch
>
>
> We already tolerate IOExceptions when reading seen_txid file from namenode's 
> dirs. So we take the maximum txid of all the *readable* namenode dirs. We 
> should extend this to when the file is corrupted. Currently, 
> PersistentLongFile.readFile throws NumberFormatException in this case and the 
> whole NN crashes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org



[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-01 Thread Lukas Majercak (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lukas Majercak updated HDFS-14043:
--
Description: We already tolerate IOExceptions when reading seen_txid file 
from namenode's dirs. So we take the maximum txid of all the *readable* 
namenode dirs. We should extend this to when the file is corrupted. Currently, 
PersistentLongFile.readFile throws NumberFormatException in this case and the 
whole NN crashes.

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Attachments: HDFS-14043.001.patch
>
>
> We already tolerate IOExceptions when reading seen_txid file from namenode's 
> dirs. So we take the maximum txid of all the *readable* namenode dirs. We 
> should extend this to when the file is corrupted. Currently, 
> PersistentLongFile.readFile throws NumberFormatException in this case and the 
> whole NN crashes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org



[jira] [Updated] (HDFS-14043) Tolerate corrupted seen_txid file

2018-11-01 Thread Lukas Majercak (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-14043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lukas Majercak updated HDFS-14043:
--
Attachment: HDFS-14043.001.patch

> Tolerate corrupted seen_txid file
> -
>
> Key: HDFS-14043
> URL: https://issues.apache.org/jira/browse/HDFS-14043
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: hdfs, namenode
>Affects Versions: 2.9.2, 3.1.2, 2.9.3
>Reporter: Lukas Majercak
>Assignee: Lukas Majercak
>Priority: Major
> Attachments: HDFS-14043.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org