[ 
https://issues.apache.org/jira/browse/ACCUMULO-3838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553134#comment-14553134
 ] 

Josh Elser commented on ACCUMULO-3838:
--------------------------------------

Re-reading the master log, it is possible that the test case just didn't read 
the metadata table before the master deleted the status record from metadata 
(after it copied it to the replication table).

{noformat}
2015-05-20 07:33:03,043 [replication.StatusMaker] DEBUG: Creating replication 
status record for 
file:/accumulo/test/target/mini-tests/org.apache.accumulo.test.replication.ReplicationIT_replicationEntriesPrecludeWalDeletion/accumulo/wal/hostname+59851/5dfa3257-859c-4ecd-9b4f-4abe5bbd04ac
 on table 1 with [begin: 0 end: 0 infiniteEnd: true closed: true createdTime: 
1432107178863].
2015-05-20 07:33:03,046 [replication.StatusMaker] INFO : Creating order record 
for 
file:/accumulo/test/target/mini-tests/org.apache.accumulo.test.replication.ReplicationIT_replicationEntriesPrecludeWalDeletion/accumulo/wal/hostname+59851/5dfa3257-859c-4ecd-9b4f-4abe5bbd04ac
 for 1 with [begin: 0 end: 0 infiniteEnd: true closed: true createdTime: 
1432107178863]
2015-05-20 07:33:03,048 [replication.StatusMaker] DEBUG: Deleting 
~replfile:/accumulo/test/target/mini-tests/org.apache.accumulo.test.replication.ReplicationIT_replicationEntriesPrecludeWalDeletion/accumulo/wal/hostname+59851/5dfa3257-859c-4ecd-9b4f-4abe5bbd04ac
 stat:1 [] 5 false from metadata table as it's no longer needed
{noformat}

If the Master saw the status record in accumulo.metadata, copied it to 
replication table, deleted it from metadata before the test's thread could read 
the status message from metadata, the test could fail as described above.

> ReplicationIT.replicationEntriesPrecludeWalDeletion failed because it missed 
> an expected WAL
> --------------------------------------------------------------------------------------------
>
>                 Key: ACCUMULO-3838
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3838
>             Project: Accumulo
>          Issue Type: Bug
>          Components: test
>    Affects Versions: 1.7.0
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>            Priority: Minor
>             Fix For: 1.8.0, 1.7.1
>
>
> Automated testing found a situation where we have 4 WALs in the replication 
> table, but only expected to find 3. Thus, the test failed.
> Looking at the master log, there shouldn't be any way that the test _didn't_ 
> see this log. My hunch is that the test accidentally removed one of the logs 
> that should have been left in the expected set.
> Need to revisit the logic to make sure it's solid.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to