[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13128381#comment-13128381 ] Hudson commented on HBASE-4078: --- Integrated in HBase-TRUNK #2325 (See [https://builds.apache.org/job/HBase-TRUNK/2325/]) HBASE-4078 Validate store files after flush/compaction nspiegelberg : Files : * /hbase/trunk/CHANGES.txt * /hbase/trunk/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java * /hbase/trunk/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Fix For: 0.92.0, 0.94.0 Attachments: 0001-Validate-store-files-after-compactions-flushes.patch, 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13127103#comment-13127103 ] Hudson commented on HBASE-4078: --- Integrated in HBase-0.92 #62 (See [https://builds.apache.org/job/HBase-0.92/62/]) HBASE-4078 Validate store files after flush/compaction nspiegelberg : Files : * /hbase/branches/0.92/CHANGES.txt * /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java * /hbase/branches/0.92/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Fix For: 0.92.0, 0.94.0 Attachments: 0001-Validate-store-files-after-compactions-flushes.patch, 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093320#comment-13093320 ] Lars Hofhansl commented on HBASE-4078: -- When does the corruption actually happen? Does any of StoreFile.Writer.{append|appendMetadata|close}(...) silently fail, leaving a corrupt file? If any of these throws any exception we would skip moving the file anyway. If so, wouldn't it be better to fix that? Or is this a problem deeper in HDFS? Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Attachments: 0001-Validate-store-files-after-compactions-flushes.patch, 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093326#comment-13093326 ] Lars Hofhansl commented on HBASE-4078: -- Ah never mind me... HDFS flakiness is what this is all about. Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Attachments: 0001-Validate-store-files-after-compactions-flushes.patch, 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081435#comment-13081435 ] stack commented on HBASE-4078: -- I added some comments over on reviewboard but then realized that the patch looks like hbase-4078. Is it same patch? Thanks. Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Attachments: 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081439#comment-13081439 ] stack commented on HBASE-4078: -- Pardon my sillyness above where I am saying that the patch for this issue is the same as the patch for this issue. Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker Attachments: 0001-Validate-store-files.patch See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness
[ https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13064349#comment-13064349 ] stack commented on HBASE-4078: -- This looks like its similar to HBASE-3834 Silent Data Offlining During HDFS Flakiness --- Key: HBASE-4078 URL: https://issues.apache.org/jira/browse/HBASE-4078 Project: HBase Issue Type: Bug Components: io, regionserver Affects Versions: 0.89.20100924, 0.90.3, 0.92.0 Reporter: Nicolas Spiegelberg Assignee: Pritam Damania Priority: Blocker See HBASE-1436 . The bug fix for this JIRA is a temporary workaround for improperly moving partially-written files from TMP into the region directory when a FS error occurs. Unfortunately, the fix is to ignore all IO exceptions, which masks off-lining due to FS flakiness. We need to permanently fix the problem that created HBASE-1436 then at least have the option to not open a region during times of flakey FS. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira