[ 
https://issues.apache.org/jira/browse/OAK-8167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16810862#comment-16810862
 ] 

Vikas Saurabh edited comment on OAK-8167 at 4/5/19 2:06 PM:
------------------------------------------------------------

[~anchela], while I agree it can leak (not right away - but that's a bad 
arguement) information such that one can get an estimate of number of items 
that matched a query. But do note that by default we do "secure" facet 
evaluation - for obvious reason that's unscalable and not useful for any kind 
of practical facet implementation. Maybe we should document this as a warning 
that "statistical" facet evaluation can potentially leak information about 
number of items for a given query. How worrisome is that leakage though is 
beyond my abilities to assess.


was (Author: catholicon):
[~anchela], while I agree it can leak (not right away - but that's a bad 
arguement) information such that one can get an estimate of number of items 
that matched a query. But do note that by default we do "secure" facet 
evaluation - for obvious reason that's unscalable and useful for any kind of 
practical facet implementation. Maybe we should document this as a warning that 
"statistical" facet evaluation can potentially leak information about number of 
items for a given query. How worrisome is that leakage though is beyond my 
abilities to assess.

> With uneven distribution of ACL restriction across facet labels statistical 
> facet count become too inaccurate
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: OAK-8167
>                 URL: https://issues.apache.org/jira/browse/OAK-8167
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: lucene, query
>    Affects Versions: 1.6.16
>            Reporter: Kelvin Xu
>            Priority: Major
>              Labels: vulnerability
>
> With the statistical mode, facet count is updated proportionally to the 
> percentage of accessible samples, which works for secured contents scattered 
> across different facets. For edge case where the whole facet (results) is not 
> accessible, the count still shows a number after the sampling percent is 
> applied. Even if the number is small, user experience is 
> misleading/inaccurate as nothing would return when the facet is clicked 
> (applied as a query condition).
> For example, a ACLs/CUGs guarded "private" folder, in which all the assets 
> are tagged with the same facet value. Non authorized user may still see this 
> facet with a count but gets nothing when clicking on the facet.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to