[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-10-16 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13128381#comment-13128381
 ] 

Hudson commented on HBASE-4078:
---

Integrated in HBase-TRUNK #2325 (See 
[https://builds.apache.org/job/HBase-TRUNK/2325/])
HBASE-4078 Validate store files after flush/compaction

nspiegelberg : 
Files : 
* /hbase/trunk/CHANGES.txt
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java
* 
/hbase/trunk/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java


 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Fix For: 0.92.0, 0.94.0

 Attachments: 
 0001-Validate-store-files-after-compactions-flushes.patch, 
 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-10-13 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13127103#comment-13127103
 ] 

Hudson commented on HBASE-4078:
---

Integrated in HBase-0.92 #62 (See 
[https://builds.apache.org/job/HBase-0.92/62/])
HBASE-4078 Validate store files after flush/compaction

nspiegelberg : 
Files : 
* /hbase/branches/0.92/CHANGES.txt
* 
/hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java
* 
/hbase/branches/0.92/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java


 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Fix For: 0.92.0, 0.94.0

 Attachments: 
 0001-Validate-store-files-after-compactions-flushes.patch, 
 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-08-29 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093320#comment-13093320
 ] 

Lars Hofhansl commented on HBASE-4078:
--

When does the corruption actually happen?

Does any of StoreFile.Writer.{append|appendMetadata|close}(...) silently fail, 
leaving a corrupt file? If any of these throws any exception we would skip 
moving the file anyway.
If so, wouldn't it be better to fix that?

Or is this a problem deeper in HDFS?


 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Attachments: 
 0001-Validate-store-files-after-compactions-flushes.patch, 
 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-08-29 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093326#comment-13093326
 ] 

Lars Hofhansl commented on HBASE-4078:
--

Ah never mind me... HDFS flakiness is what this is all about.

 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Attachments: 
 0001-Validate-store-files-after-compactions-flushes.patch, 
 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-08-08 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081435#comment-13081435
 ] 

stack commented on HBASE-4078:
--

I added some comments over on reviewboard but then realized that the patch 
looks like hbase-4078.  Is it same patch?  Thanks.

 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Attachments: 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-08-08 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081439#comment-13081439
 ] 

stack commented on HBASE-4078:
--

Pardon my sillyness above where I am saying that the patch for this issue is 
the same as the patch for this issue.

 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker
 Attachments: 0001-Validate-store-files.patch


 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-4078) Silent Data Offlining During HDFS Flakiness

2011-07-12 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13064349#comment-13064349
 ] 

stack commented on HBASE-4078:
--

This looks like its similar to HBASE-3834

 Silent Data Offlining During HDFS Flakiness
 ---

 Key: HBASE-4078
 URL: https://issues.apache.org/jira/browse/HBASE-4078
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.89.20100924, 0.90.3, 0.92.0
Reporter: Nicolas Spiegelberg
Assignee: Pritam Damania
Priority: Blocker

 See HBASE-1436 .  The bug fix for this JIRA is a temporary workaround for 
 improperly moving partially-written files from TMP into the region directory 
 when a FS error occurs.  Unfortunately, the fix is to ignore all IO 
 exceptions, which masks off-lining due to FS flakiness.  We need to 
 permanently fix the problem that created HBASE-1436  then at least have the 
 option to not open a region during times of flakey FS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira