[jira] Updated: (HDFS-1594) When the disk becomes full Namenode is getting shutdown and not able to recover

2011-01-25 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated HDFS-1594: Status: Open (was: Patch Available) When the disk becomes full Namenode is getting shutdown and not able

[jira] Created: (HDFS-1596) Move secondary namenode checkpoint configs from core-default.xml to hdfs-default.xml

2011-01-25 Thread Patrick Angeles (JIRA)
Move secondary namenode checkpoint configs from core-default.xml to hdfs-default.xml Key: HDFS-1596 URL: https://issues.apache.org/jira/browse/HDFS-1596 Project:

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986554#action_12986554 ] Tsz Wo (Nicholas), SZE commented on HDFS-1595: -- {code} +int

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986560#action_12986560 ] Todd Lipcon commented on HDFS-1595: --- I'll be honest: I don't know the new Append code in

[jira] Commented: (HDFS-1295) Improve namenode restart times by short-circuiting the first block reports from datanodes

2011-01-25 Thread Matt Foley (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986564#action_12986564 ] Matt Foley commented on HDFS-1295: -- Hi Dhruba, I think this is a really important

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986589#action_12986589 ] Tsz Wo (Nicholas), SZE commented on HDFS-1595: -- I'll be honest: I don't know

[jira] Updated: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Ken Goodhope (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Goodhope updated HDFS-863: -- Attachment: HDFS-863.patch Agreed, and done. Reran tests with the following results [junit] Test

[jira] Updated: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Ken Goodhope (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Goodhope updated HDFS-863: -- Attachment: (was: HDFS-863.patch) Potential deadlock in TestOverReplicatedBlocks

[jira] Updated: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Ken Goodhope (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Goodhope updated HDFS-863: -- Attachment: HDFS-863.patch Potential deadlock in TestOverReplicatedBlocks

[jira] Commented: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986611#action_12986611 ] Todd Lipcon commented on HDFS-863: -- Hi Ken. Looks good except for one nit - there are some

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986612#action_12986612 ] Hairong Kuang commented on HDFS-1595: - In the case we found, F has a faulty network

[jira] Updated: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Ken Goodhope (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Goodhope updated HDFS-863: -- Attachment: HDFS-863.patch Thought I caught all of those, but obviously not. Found a few more and

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Kan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986624#action_12986624 ] Kan Zhang commented on HDFS-1595: - Looks like we have 3 options when the pipeline is reduced

[jira] Updated: (HDFS-863) Potential deadlock in TestOverReplicatedBlocks

2011-01-25 Thread Ken Goodhope (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Goodhope updated HDFS-863: -- Attachment: HDFS-863.patch Finally got my auto format set up right and it found a couple more issues my

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986634#action_12986634 ] Todd Lipcon commented on HDFS-1595: --- Another option not mentioned above that we use in

[jira] Commented: (HDFS-1580) Add interface for generic Write Ahead Logging mechanisms

2011-01-25 Thread Jitendra Nath Pandey (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986709#action_12986709 ] Jitendra Nath Pandey commented on HDFS-1580: The interface also needs to have

[jira] Commented: (HDFS-1580) Add interface for generic Write Ahead Logging mechanisms

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986718#action_12986718 ] Todd Lipcon commented on HDFS-1580: --- Above sounds reasonable with respect to 1073. Is

[jira] Commented: (HDFS-1582) Remove auto-generated native build files

2011-01-25 Thread Eli Collins (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986743#action_12986743 ] Eli Collins commented on HDFS-1582: --- Patch looks good. What testing has been done to

[jira] Created: (HDFS-1597) Misplaced assertion in FSEditLog.logSync

2011-01-25 Thread Todd Lipcon (JIRA)
Misplaced assertion in FSEditLog.logSync Key: HDFS-1597 URL: https://issues.apache.org/jira/browse/HDFS-1597 Project: Hadoop HDFS Issue Type: Bug Affects Versions: 0.22.0 Reporter:

[jira] Commented: (HDFS-1597) Misplaced assertion in FSEditLog.logSync

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986762#action_12986762 ] Todd Lipcon commented on HDFS-1597: --- The race is the following: ||Thread A||Thread B||

[jira] Updated: (HDFS-1597) Misplaced assertion in FSEditLog.logSync

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HDFS-1597: -- Attachment: illustrate-test-failure.txt Here's a little hack I did that makes the test fail reliably

[jira] Commented: (HDFS-1597) Misplaced assertion in FSEditLog.logSync

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986770#action_12986770 ] Todd Lipcon commented on HDFS-1597: --- Actually in trunk there's a second bug that affects

[jira] Commented: (HDFS-1597) Misplaced assertion in FSEditLog.logSync

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986773#action_12986773 ] Todd Lipcon commented on HDFS-1597: --- bq. As of HDFS-119 syntxid is set in the finally

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986774#action_12986774 ] Tsz Wo (Nicholas), SZE commented on HDFS-1595: -- Resurrecting the pipeline with

[jira] Updated: (HDFS-1469) TestBlockTokenWithDFS fails on trunk

2011-01-25 Thread Konstantin Boudnik (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Boudnik updated HDFS-1469: - Attachment: log.gz I have ran slightly modified test (converted to JUnit 4 with better

[jira] Commented: (HDFS-1469) TestBlockTokenWithDFS fails on trunk

2011-01-25 Thread Konstantin Boudnik (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986781#action_12986781 ] Konstantin Boudnik commented on HDFS-1469: -- Forgot to mention, that the timeout

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986788#action_12986788 ] Koji Noguchi commented on HDFS-1595: bq. So this faulty node F has no problem receiving

[jira] Updated: (HDFS-1597) Batched edit log syncs can reset synctxid throw assertions

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HDFS-1597: -- Description: The top of FSEditLog.logSync has the following assertion: {code} assert

[jira] Updated: (HDFS-1597) Batched edit log syncs can reset synctxid throw assertions

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HDFS-1597: -- Attachment: hdfs-1597.txt Here's a patch containing a fix and also two new unit tests that verify the

[jira] Commented: (HDFS-1593) Allow a datanode to copy a block to a datanode on a foreign HDFS cluster.

2011-01-25 Thread Sanjay Radia (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986852#action_12986852 ] Sanjay Radia commented on HDFS-1593: In the case of the NN issuing a copy operation,

[jira] Commented: (HDFS-1595) DFSClient may incorrectly detect datanode failure

2011-01-25 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12986861#action_12986861 ] dhruba borthakur commented on HDFS-1595: It appears that Todd's proposal could work

[jira] Updated: (HDFS-1597) Batched edit log syncs can reset synctxid throw assertions

2011-01-25 Thread Todd Lipcon (JIRA)
[ https://issues.apache.org/jira/browse/HDFS-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HDFS-1597: -- Status: Patch Available (was: Open) Batched edit log syncs can reset synctxid throw assertions