[
https://issues.apache.org/jira/browse/SENTRY-872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308437#comment-15308437
]
Sravya Tirukkovalur commented on SENTRY-872:
--------------------------------------------
Thanks for uploading the updated deisgn doc [~cmccabe]! Some comments:
1. In Section "HIVEĀ7973: Hive Replication Support ", seems like there is some
text missing at the end.
2. In Section "Future work", "The HDFS Plugin Should Use Update Log IDs". In
current design, we apply deltas in the NN plugin. I do not believe we
necessarily buffer deltas in NN, as there is no reason. So we may want to
remove this section.
3. We might want to add a section about "Sentry passive" which follows active
versus "Sentry standby" which warms up only when it acquires leadership? I
think we are inclining towards a passive which can serve requests with minimal
downtime, that is acquiring leadership should not take too long. But might be
better if we state it explicitly, so that we evaluate the alternatives
thoroughly?
4. There are some slight alternatives we might want to consider in the path of
propagating HMS updates to Sentry and NN. In the proposed design, we will need
to replicate HMS <obj,path> information as well as delta changes of
it(add/delete <ob,path>) in Sentry db for the passive to follow. Other option
is for passive to directly talk to HMS to get these deltas. If the only
motivation for replicating this in sentry db is bringing passive upto speed, I
think the later approach is preferable as there is no real need to replicate
both info and deltas? But, other parameter to consider is around full update.
That is, when Sentry restarts in the later approach, we will have to trigger a
full update from HMS. But without a proper snapshot solution in HMS, this would
mean we will have to lock HMS writes for this period, which means HMS is not
available for writes for this period.
5. Would be useful to have a detailed protocol description especially around
what happens when different services restart, and what in memory state does
each service rely on.
Let me know what you think and we can update the doc accordingly. Thanks!
> Uber jira for HMS HA + Sentry HA with HDFS plugin improvements
> --------------------------------------------------------------
>
> Key: SENTRY-872
> URL: https://issues.apache.org/jira/browse/SENTRY-872
> Project: Sentry
> Issue Type: Improvement
> Components: Hdfs Plugin
> Affects Versions: 1.5.0
> Reporter: Sravya Tirukkovalur
> Assignee: Sravya Tirukkovalur
> Fix For: 1.8.0
>
> Attachments: SENTRY-872.0.patch, SENTRY-872.pdf, SENTRY-872_design.pdf
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)