[ 
https://issues.apache.org/jira/browse/HADOOP-10755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059369#comment-14059369
 ] 

Andrew Wang commented on HADOOP-10755:
--------------------------------------

Hi Eddy, thanks for working on this. Overall looks good, had a few review 
comments:

Text in core-default.xml:
* typo: nagative
* Should mention that setting a negative value disables it
* Suggested text something like this:

{quote}
Expiration time for entries in the the negative user-to-group mapping caching, 
in seconds. This is useful when invalid users are retrying frequently. It is 
suggested to set a small value for this expiration, since a transient error in 
group lookup could temporarily lock out a legitimate user.

Set this to a negative value to disable negative user-to-group caching.
{quote}

* We have a behavior change in getGroups. Previously it would always throw an 
exception if getGroups().isEmpty(), now it depends on this new config. I think 
the config should only control the caching, not the return value. Would be good 
to add a test to enforce this behavior.
* The test looks racy, depends on the sleeps/timings to work. Any possibility 
of improvement here with mocks or reaching in with @VisibleForTesting?

> Support negative caching of user-group mapping
> ----------------------------------------------
>
>                 Key: HADOOP-10755
>                 URL: https://issues.apache.org/jira/browse/HADOOP-10755
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: security
>    Affects Versions: 2.2.0
>            Reporter: Andrew Wang
>            Assignee: Lei (Eddy) Xu
>         Attachments: HADOOP-10755.000.patch, HADOOP-10755.001.patch, 
> HADOOP-10755.002.patch, HDFS-5369.000.patch
>
>
> We've seen a situation at a couple of our customers where interactions from 
> an unknown user leads to a high-rate of group mapping calls. In one case, 
> this was happening at a rate of 450 calls per second with the shell-based 
> group mapping, enough to severely impact overall namenode performance and 
> also leading to large amounts of log spam (prints a stack trace each time).
> Let's consider negative caching of group mapping, as well as quashing the 
> rate of this log message.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to