[kudu-CR] KUDU-2191 (10/n): Hive Metastore notification log event listener

Adar Dembo (Code Review) Wed, 11 Apr 2018 11:32:07 -0700

Adar Dembo has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/8313 )

Change subject: KUDU-2191 (10/n): Hive Metastore notification log event listener
......................................................................

Patch Set 7:

(13 comments)

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.h
File src/kudu/master/catalog_manager.h:

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.h@980
PS7, Line 980: Protected by leader_lock_;
I think it makes more sense to protect with 'lock_', since this is cached
in-memory state much like the info maps.

But see the comment I left around StoreLatestNotificationLogEventId; I think
this will be cleanest if converted into a CowObject.

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/catalog_manager.cc
File src/kudu/master/catalog_manager.cc:

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/catalog_manager.cc@2285
PS4, Line 2285: es while the HMS is
: // unavailable, because that would cause the catalogs t
> I've added more commentary, let me know if it's still confusing.
I was actually wrong in my statement anyway: we don't roll back HMS-initiated
DROP TABLE either.

The only inconsistency between the two in behavior is that we'll send an ALTER
to HMS even if it was originally initiated by HMS. You wrote that this helps
keep the catalogs converged. Can you provide an example scenario? I'm asking
because this seems a little fragile to me: I think it'd be easy for someone to
accidentally (or intentionally) change HMS's no-op ALTER behavior to publish
notification events anyway, at which point this ALTER would generate an
infinite loop.

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.cc
File src/kudu/master/catalog_manager.cc:

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.cc@2304
PS7, Line 2304: Rollback is skipped when the
: // alter is in response to a notification log event, since
unhandled
: // notification log events are automatically retried during
the next polling
: // period.
This seems somewhat fragile since there's nothing actually enforcing that the
polling task retries events whose Kudu-side changes failed. Meaning, someone
could change the retry behavior tomorrow and then we'd have diverging catalogs.

Wouldn't it be simpler to do the rollback here anyway? SysCatalogTable::Write()
failures are super rare, so it's not like it'd be a perf hit to rollback, right?

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.cc@2423
PS7, Line 2423: // Update the cached HMS notification log event ID, if it
changed.
I think it'd be cleaner if you routed all requests to update
hms_notification_log_event_id_ through StoreLatestNotificationLogEventId(). For
example, it'll let you centralize all of the locking in one place. To do that
you'll need to parameterize the SysCatalogTable::Write().

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.cc@3935
PS7, Line 3935: Status CatalogManager::InitLatestNotificationLogEventId() {
DCHECK(hms_catalog_) here too, and in StoreLatest?

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/catalog_manager.cc@3949
PS7, Line 3949: // It wouldn't typically be correct to write to the field
under only a read
: // lock, however this instance is safe because it's only
called by the
: // HmsNotificationLogListenerTask singleton thread.
I'm not a fan of this shortcut. Why not use proper locking? Something like this:
1. Create lock::guard for some mutex (maybe a new one).
2. Create unique_lock lock_
3. Short-circuit success if hms_notification_log_event_id_ >= event_id
4. Release lock_
5. Perform the Write()
6. If Write() failed, return the error
7. Lock the unique_lock for lock_
8. Update hms_notification_log_event_id_
9. Return success

It's actually not much more work to store hms_notification_log_event_id_ as a
CowObject and protect it via CowLock. It may even be simpler, since it'll hide
the new mutex from the rest of CatalogManager by building it (well, an RWCLock)
into the CowObject. Then this looks something like this:
1. CowLock hms_notification_log_event_id_ for write mode
2. Short-circuit success if hms_notification_log_event_id_ >= event_id
3. Update the value of hms_notification_log_event_id_ via mutable_data
4. Perform the Write()
5. If Write() failed, return the error
6. Commit the CowLock
7. Return success

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.h
File src/kudu/master/hms_notification_log_listener.h:

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.h@86
PS7, Line 86: // Initialize the HMS notification log listener. The catalog
manager must be
Nit: Initializes (to match the tense in Handle*Event). Please do the same for
Init, Shutdown, and Poll too.

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/hms_notification_log_listener.cc
File src/kudu/master/hms_notification_log_listener.cc:

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/hms_notification_log_listener.cc@35
PS4, Line 35: #include "kudu/hms/hive_metastore_constants.h"
> I'm always in favor of shorter flag names over longer, however I feel it's
OK, that's a reasonable argument.

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/hms_notification_log_listener.cc@116
PS4, Line 116: ask::Poll(
> Isn't that just a special case of 'not leader'?
I suppose, but if you look at the various exit paths out of the
ScopedLeaderSharedLock constructor, you'll see that not every error message is
about not being the leader.

Anyway, it's just a VLOG, so I don't really care.

http://gerrit.cloudera.org:8080/#/c/8313/4/src/kudu/master/hms_notification_log_listener.cc@291
PS4, Line 291: // Kudu catalog's metadata for the table. Checking the name in
addition to the
: // ID prevents the HMS from altering a Kudu table when it has
an outdated
: /
> No, the columns could have changed or some other piece of important metadat
Yeah but you're not including the new schema in 'req'. I assumed that was
intentional; that this is for renames only. Was not the case?

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.cc
File src/kudu/master/hms_notification_log_listener.cc:

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.cc@87
PS7, Line 87: CHECK_OK(ThreadJoiner(thread_.get()).Join());
Should probably null out thread_ when you're done so the CHECK on L85 will fire
if someone calls Shutdown twice.

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.cc@201
PS7, Line 201: // The durable event ID gets updated every time we make a
change in response
: // to a log notification, however not every log notification
results in a
: // change (for instance, a notification pertaining to a
Parquet table). To
: // avoid replaying these notifications we persist the latest
processed
: // notification log event ID after polling. This is best
effort, since failing
: // to update the ID should only results in wasted work, not an
unsynchronized
: // catalog.
Hmm, upon second thought, updating the event ID with every reflected operation
could lead to a lot of unnecessary deltas in the master tablet after having
worked through a particularly large poll. Perhaps we shouldn't do that, and
just update once here when the poll is finished?

The disadvantage to doing just one update is that if we lose leadership as
we're processing HMS events, the new leader master will have to retry the
entire HMS event set. But maybe that exact scenario is rare enough that the
trade-off (reduced delta compaction in the master) is worth it. Certainly the
code is less complex if we just update once.

http://gerrit.cloudera.org:8080/#/c/8313/7/src/kudu/master/hms_notification_log_listener.cc@290
PS7, Line 290: drop
alter

--
To view, visit http://gerrit.cloudera.org:8080/8313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I32ed099c44a593ffe514152135957018f21ed775
Gerrit-Change-Number: 8313
Gerrit-PatchSet: 7
Gerrit-Owner: Dan Burkert <danburk...@apache.org>
Gerrit-Reviewer: Adar Dembo <a...@cloudera.com>
Gerrit-Reviewer: Dan Burkert <danburk...@apache.org>
Gerrit-Reviewer: Hao Hao <hao....@cloudera.com>
Gerrit-Reviewer: Kudu Jenkins
Gerrit-Reviewer: Tidy Bot
Gerrit-Comment-Date: Wed, 11 Apr 2018 18:31:38 +0000
Gerrit-HasComments: Yes

[kudu-CR] KUDU-2191 (10/n): Hive Metastore notification log event listener

Reply via email to