[ 
https://issues.apache.org/jira/browse/HDFS-13610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16524657#comment-16524657
 ] 

Erik Krogen commented on HDFS-13610:
------------------------------------

Looking through the test failures, I noticed some testing issues, as well as 
some bugs with the existing code. Given that this is meant to be a "cleanup" 
JIRA I plan to fix the bugs here; if anyone disagrees let me know and I can 
file separate JIRAs.

* Due to changes in HDFS-13609 to enable use of the RPC mechanism whenever 
inProgressOK is true, there were codepaths which would use the RPC mechanism 
even when dfs.ha.tail-edits.in-progress was false, meaning that the JNs did not 
enable the cache. I have updated the QJM logic to only use 
{{selectRpcInputStreams}} if this config is true.
* Fixed a bug where there was a false error logged when the layout version 
changed. Also fixed the logging when a layout version change occurs to avoid 
printing out a placeholder value ({{Integer.MAX_VALUE}}).
* Fixed a test in {{TestQuorumJournalManager}}.
* The tests in {{TestStandbyInProgressTail}}, now that they use the RPC 
mechanism, were subject to the failure scenario described below. I updated the 
tests to be resilient to this by allowing them to wait briefly for the expected 
edit to be tailed.
** JNs A, B, and C are at txID 1.
** A write is persisted to a quorum of JNs, say A and B. They are now at txID 
2, and C is at txID 1.
** The edit tailing succeeds from a different quorum of JNs, say B and C. Only 
transactions up to txID 1 will be tailed.
** An assertion requiring that the standby sees txID 2 fails.

I have uploaded a v002 patch incorporating these changes.

> [Edit Tail Fast Path Pt 4] Cleanup: integration test, documentation, remove 
> unnecessary dummy sync
> --------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-13610
>                 URL: https://issues.apache.org/jira/browse/HDFS-13610
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ha, journal-node, namenode
>            Reporter: Erik Krogen
>            Assignee: Erik Krogen
>            Priority: Major
>         Attachments: HDFS-13610-HDFS-12943.000.patch, 
> HDFS-13610-HDFS-12943.001.patch, HDFS-13610-HDFS-12943.002.patch
>
>
> See HDFS-13150 for full design.
> This JIRA is targeted at cleanup tasks:
> * Add in integration testing. We can expand {{TestStandbyInProgressTail}}
> * Documentation in HDFSHighAvailabilityWithQJM
> * Remove the dummy sync added as part of HDFS-10519; it is unnecessary since 
> now in-progress tailing does not rely as heavily on the JN committedTxnId



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to