[ 
https://issues.apache.org/jira/browse/HBASE-18023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16051205#comment-16051205
 ] 

David Harju commented on HBASE-18023:
-------------------------------------

Thanks [~elserj]!

Good suggestions, I'll work to implement them and then post a new patch.

I added the "(See https://issues.apache.org/jira/browse/HBASE-18023)" reference 
in response to [~stack]'s suggestion to add to the log line, "...a pointer to 
doc or issue on why many small batches will go down better than a few massive 
ones".  If there's a better doc or issue to reference I can replace it but 
otherwise I can remove the reference altogether.

As for the mocking suggestion at the bottom, I was resistant to expanding / 
creating any new public methods for objects in the patch (which I may need to 
do in order to do the mock verification you suggest), which is why I went with 
the more brittle verification you saw, but there does seem to be precedent in 
the code for creating methods for testing purposes only so I'll go ahead and 
make those "for testing purposes only" public access points (either public 
methods delivering the logging string or a public ctor for RSRpcServices which 
takes in some kind of logging delegate).



> Log multi-* requests for more than threshold number of rows
> -----------------------------------------------------------
>
>                 Key: HBASE-18023
>                 URL: https://issues.apache.org/jira/browse/HBASE-18023
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver
>            Reporter: Clay B.
>            Assignee: David Harju
>            Priority: Minor
>         Attachments: HBASE-18023.master.001.patch
>
>
> Today, if a user happens to do something like a large multi-put, they can get 
> through request throttling (e.g. it is one request) but still crash a region 
> server with a garbage storm. We have seen regionservers hit this issue and it 
> is silent and deadly. The RS will report nothing more than a mysterious 
> garbage collection and exit out.
> Ideally, we could report a large multi-* request before starting it, in case 
> it happens to be deadly. Knowing the client, user and how many rows are 
> affected would be a good start to tracking down painful users.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to