[ 
https://issues.apache.org/jira/browse/SENTRY-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16156250#comment-16156250
 ] 

Hadoop QA commented on SENTRY-1915:
-----------------------------------

Here are the results of testing the latest attachment
https://issues.apache.org/jira/secure/attachment/12885689/SENTRY-1915.02.patch 
against master.

{color:green}Overall:{color} +1 all checks pass

{color:green}SUCCESS:{color} all tests passed

Console output: 
https://builds.apache.org/job/PreCommit-SENTRY-Build/3253/console

This message is automatically generated.

> Sentry is doing a lot of work to convert list of paths to HMSPaths structure
> ----------------------------------------------------------------------------
>
>                 Key: SENTRY-1915
>                 URL: https://issues.apache.org/jira/browse/SENTRY-1915
>             Project: Sentry
>          Issue Type: Bug
>          Components: Sentry
>    Affects Versions: 2.0.0
>            Reporter: Alexander Kolbasov
>            Assignee: Alexander Kolbasov
>         Attachments: SENTRY-1915.01.patch, SENTRY-1915.02.patch
>
>
> It turns out that in 2.0 we changed the way full snapshots are sent from 
> Sentry to HDFS. Before they were using {{HMSPaths}} which used tree structure 
> and eliminated some duplication. Also SENTRY-1827 helped to compressed this 
> on the serialization side.
> Now we are using {{TPathChanges}} structure that is not tree-based and 
> contains very non-efficient way of representing paths: {{required 
> list<list<string>> addPaths;}} so we split each paths on slashes and store 
> list of elements instead of storing a tree. As a result we may use much more 
> memory.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to