[
https://issues.apache.org/jira/browse/HBASE-20642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16491404#comment-16491404
]
stack commented on HBASE-20642:
-------------------------------
bq. If the master is swapped, nonces map will be rebuilt from uncompleted
procedure during the replay so we should not have a problem checking on the new
master as well. right?
That is not my understanding. The nonces are in an in-memory-only map in the
Master process. They will not be migrated from one Master to the new one....
so, even if you put calls behind a nonce-check, it'll fail since the nonce-map
is empty on new Master.
bq. Yes, they will get this on their first submission if the master goes down
in between.
Because the Master is failing which broke the synchronous wait on add column?
Maybe add a check if master is going down and if it is throw that for an
exception instead of doing this pre-flight check against current state of table
descriptor? Would that be more meaningful?
bq. This is addColumnFamily() synchronous call and it is getting moved to the
new master.
It is pretty cool that the call keeps going though the Master has crashed... I
think it is a bit much to expect that this call can pick up where it left off
on the old Master though. It has no reference to the original transaction (it
does not have a Future .... ). We want to move folks over to the async calls
where they check to see if the Procedure is completed..... Thats the style we'd
prefer.
Meantime, I agree this exception message is confusing. Lets fix it (see above
for suggestion).
bq. No problem, probably I'm not putting the problem in right words stack
Nah. I think its the receiving end that has the problem (smile).
Thanks.
> IntegrationTestDDLMasterFailover throws 'InvalidFamilyOperationException
> -------------------------------------------------------------------------
>
> Key: HBASE-20642
> URL: https://issues.apache.org/jira/browse/HBASE-20642
> Project: HBase
> Issue Type: Bug
> Reporter: Ankit Singhal
> Assignee: Ankit Singhal
> Priority: Major
> Attachments: HBASE-20642.patch
>
>
> [~romil.choksi] reported that IntegrationTestDDLMasterFailover is failing
> while adding column family during the time master is restarting.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)