[ 
https://issues.apache.org/jira/browse/HBASE-12364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14208524#comment-14208524
 ] 

Gunnar Tapper commented on HBASE-12364:
---------------------------------------

I don't know what CP means?

Thank you,

Gunnar

Download a free version of HPDSM, a unified big-data administration tool for 
Vertica and Hadoop at: HP DSM Download

“People don’t know what they want until you show it to them… Our task is to 
read things that are not yet on the page.” — Steve Jobs



> API for query metrics
> ---------------------
>
>                 Key: HBASE-12364
>                 URL: https://issues.apache.org/jira/browse/HBASE-12364
>             Project: HBase
>          Issue Type: Improvement
>          Components: metrics
>    Affects Versions: 0.98.6.1
>         Environment: Any Hadoop distribution.
>            Reporter: Gunnar Tapper
>
> Request based on a discussion with Nick Dimiduk at Strata.
> Background: IT organizations operate on reports based on metrics. They look 
> for comparative statistics such as number of queries per user per day per 
> connection. Further, troubleshooting is often based on questions such as 
> "what queries are each running, which user is using the most resources, and 
> what query among the queries that a specific user need to be tuned?"
> Currently, the slow-query log does not provide the instrumentation needed for 
> management applications to obtain and analyze this level of information. In 
> itself, they slow-query doesn't contain important information that allows 
> mapping of the information to user, connection, application, and so on.
> Further, the slow-query log doesn't log each get and scan, which means that 
> it's not possible to see all queries that have been run against the HBase 
> database.
> Preferably, a REST API is provided to obtain the required information, which 
> should be extended so that each query can be mapped to environment 
> information; for example:
> • Account
> • Account string
> • Client IP address
> • Query text
> • Session ID
> • Date and time
> • User name
> Start, Status, and Completion records should be provided so that it's 
> possible to determine the progress and outcome of any given query. Further, 
> status and completion information should contain information about resource 
> usage; for example:
> • CPU time
> • Memory 
> • I/Os
> • Rows read/written
> • Objects accessed
> • Wait times
> Preferably, counters are provided in both cumulative and delta (since query 
> start) formats.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to