[ https://issues.apache.org/jira/browse/HBASE-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16395358#comment-16395358 ]
Josh Elser commented on HBASE-19969: ------------------------------------ [~vrodionov], looks like you might have some flaky tests on Hadoop3. {noformat} [ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 104.317 s <<< FAILURE! - in org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad [ERROR] TestIncBackupDeleteTable[0](org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad) Time elapsed: 101.908 s <<< ERROR! java.io.IOException: java.io.IOException: Failed copy from hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/572cd372c217436b9dd4969ddde3701e_SeqId_205_,hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/d2c3e41730c6415aa98400ccff8b32e7_SeqId_205_ to hdfs://localhost:41991/backupUT/backup_1520771793802 at org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:120) Caused by: java.io.IOException: Failed copy from hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/572cd372c217436b9dd4969ddde3701e_SeqId_205_,hdfs://localhost:41991/user/jenkins/test-data/2cc09bf1-7846-4fd8-9928-b23061612b23/data/ns1/test-1520771750106/cf746153b1afe40a737302073b2956b2/f/d2c3e41730c6415aa98400ccff8b32e7_SeqId_205_ to hdfs://localhost:41991/backupUT/backup_1520771793802 at org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:120) [ERROR] Tests run: 2, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 884.349 s <<< FAILURE! - in org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad [ERROR] TestIncBackupDeleteTable[0](org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad) Time elapsed: 33.157 s <<< ERROR! java.net.ConnectException: Call From 1f03fa77e36a/172.17.0.3 to localhost:41991 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: http://wiki.apache.org/hadoop/ConnectionRefused at org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:93) Caused by: java.net.ConnectException: Connection refused at org.apache.hadoop.hbase.backup.TestIncrementalBackupWithBulkLoad.TestIncBackupDeleteTable(TestIncrementalBackupWithBulkLoad.java:93{noformat} Would be good to take a quick look to rule out test issues (the cnxn refused sounds like it might be just be the node itself). Logs are at https://builds.apache.org/job/HBase%20Nightly/job/master/259/artifact/output-jdk8-hadoop3/ > Improve fault tolerance in backup merge operation > ------------------------------------------------- > > Key: HBASE-19969 > URL: https://issues.apache.org/jira/browse/HBASE-19969 > Project: HBase > Issue Type: Sub-task > Reporter: Vladimir Rodionov > Assignee: Vladimir Rodionov > Priority: Major > Fix For: 3.0.0 > > Attachments: 19969-v4.patch, HBASE-19969-v1.patch, > HBASE-19969-v2.patch, HBASE-19969-v3.patch > > > Some file system operations are not fault tolerant during merge. We delete > backup data in a backup file system, then copy new data over to backup > destination. Deletes can be partial, copy can fail as well -- This message was sent by Atlassian JIRA (v7.6.3#76005)