[ 
https://issues.apache.org/jira/browse/HBASE-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16395358#comment-16395358
 ] 

Josh Elser commented on HBASE-19969:
------------------------------------

[~vrodionov], looks like you might have some flaky tests on Hadoop3.
{noformat}
[ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 104.317 
s <<< FAILURE! - in 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad
[ERROR] 
TestIncBackupDeleteTable[0](org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad)
  Time elapsed: 101.908 s  <<< ERROR!
java.io.IOException: java.io.IOException: Failed copy from 
hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/572cd372c217436b9dd4969ddde3701e_SeqId_205_,hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/d2c3e41730c6415aa98400ccff8b32e7_SeqId_205_
 to hdfs://localhost:41991/backupUT/backup_1520771793802
        at 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:120)
Caused by: java.io.IOException: Failed copy from 
hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/572cd372c217436b9dd4969ddde3701e_SeqId_205_,hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/d2c3e41730c6415aa98400ccff8b32e7_SeqId_205_
 to hdfs://localhost:41991/backupUT/backup_1520771793802
        at 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:120)
[ERROR] Tests run: 2, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 884.349 
s <<< FAILURE! - in 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad
[ERROR] 
TestIncBackupDeleteTable[0](org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad)
  Time elapsed: 33.157 s  <<< ERROR!
java.net.ConnectException: Call From 1f03fa77e36a/172.17.0.3 to localhost:41991 
failed on connection exception: java.net.ConnectException: Connection refused; 
For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
        at 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:93)
Caused by: java.net.ConnectException: Connection refused
        at 
org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:93{noformat}
Would be good to take a quick look to rule out test issues (the cnxn refused 
sounds like it might be just be the node itself). Logs are at 
https://builds.apache.org/job/HBase%20Nightly/job/master/259/artifact/output-jdk8-hadoop3/

> Improve fault tolerance in backup merge operation
> -------------------------------------------------
>
>                 Key: HBASE-19969
>                 URL: https://issues.apache.org/jira/browse/HBASE-19969
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>            Priority: Major
>             Fix For: 3.0.0
>
>         Attachments: 19969-v4.patch, HBASE-19969-v1.patch, 
> HBASE-19969-v2.patch, HBASE-19969-v3.patch
>
>
> Some file system operations are not fault tolerant during merge. We delete 
> backup data in a backup file system, then copy new data over to backup 
> destination. Deletes can be partial, copy can fail as well



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to