[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2022-03-15 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507211#comment-17507211
 ] 

Nick Dimiduk commented on HBASE-25865:
--

I have converted HBASE-25958 to an independent JIRA because I don't think it's 
critical for blocking 2.5.0.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Blocker
> Fix For: 2.5.0, 2.6.0, 3.0.0-alpha-3
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2022-03-07 Thread Andrew Kyle Purtell (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17502539#comment-17502539
 ] 

Andrew Kyle Purtell commented on HBASE-25865:
-

I approved the PRs, just a couple of minor questions

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Blocker
> Fix For: 2.6.0, 3.0.0-alpha-3
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2022-03-07 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17502272#comment-17502272
 ] 

Nick Dimiduk commented on HBASE-25865:
--

Sorry for the delay here, [~apurtell] ; I've been focused on tracing. This is 
basically ready with the first chart. Let me refresh the PR, will you give it 
another look? Thanks.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Blocker
> Fix For: 2.6.0, 3.0.0-alpha-3
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-12-15 Thread Andrew Kyle Purtell (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17460214#comment-17460214
 ] 

Andrew Kyle Purtell commented on HBASE-25865:
-

Marked as a release blocker for 2.5.0. See 
https://lists.apache.org/thread/f86s3sg3ktjydxs5cf2c9ocml48k6s9c for context. 
/cc [~ndimiduk]

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Blocker
> Fix For: 2.5.0, 2.6.0, 3.0.0-alpha-3
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-12-09 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17456812#comment-17456812
 ] 

Nick Dimiduk commented on HBASE-25865:
--

bq. Thinking of the Jersey and JAX-RS related changes in this regard, their 
potential consequences for anyone who may have implemented custom servlets.

It's true that people who are relying on these ancient versions of jersey and 
the JSRs via transitive dependency will be affected. In a [slightly different 
context|https://hbase.apache.org/book.html#hbase.versioning], we say that a 
client application MAY require recompilation when upgrading to a new minor 
version, which means they have opportunity to discover the issue at compile 
time.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 2.5.0, 3.0.0-alpha-2, 2.6.0
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-12-09 Thread Andrew Kyle Purtell (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17456751#comment-17456751
 ] 

Andrew Kyle Purtell commented on HBASE-25865:
-

bq. do you intend to update the hbase-thirdparty dependency in hbase-2.5.0 to a 
new 4.0.x release? 

[~ndimiduk] I don't have a strong opinion either way, although the minor 
increment is a good opportunity to move from 3.x to 4.x thirdparty given more 
tolerance for incompatible changes per our guidelines. 

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 2.5.0, 3.0.0-alpha-2, 2.6.0
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-12-09 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17456740#comment-17456740
 ] 

Nick Dimiduk commented on HBASE-25865:
--

Not entirely. I was delayed waiting for the hbase-thirdparty 4.0.0 release, and 
then I shifted my attention to the OpenTelemetry verification. Now that 
thirdparty is nearly here, I'd like to return to this and land the one chart 
that I have.

[~apurtell] do you intend to update the hbase-thirdparty dependency in 
hbase-2.5.0 to a new 4.0.x release? If no, we can drop this from 2.5.0.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 2.5.0, 3.0.0-alpha-2, 2.6.0
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-12-08 Thread Andrew Kyle Purtell (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17456079#comment-17456079
 ] 

Andrew Kyle Purtell commented on HBASE-25865:
-

Has this issue been abandoned?

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 2.5.0, 3.0.0-alpha-2
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-25 Thread Bharath Vissapragada (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351368#comment-17351368
 ] 

Bharath Vissapragada commented on HBASE-25865:
--

thanks, lgtm. 

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-25 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351263#comment-17351263
 ] 

Nick Dimiduk commented on HBASE-25865:
--

Attached examples of this region visualizer as implemented, generated from a 
large cluster with two primary tables. Top chart is interactive, as you can see 
from the mouse-over. Selecting a legend item will also cause that table's data 
to be in focus (not shown). The bottom chart is an interactive element used for 
zooming in the top chart, in that selecting a length of the bottom chart clips 
the viewport of the top chart, demonstrated in the second screenshot.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
> Attachments: image1.png, image2.png
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-21 Thread Bharath Vissapragada (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17349473#comment-17349473
 ] 

Bharath Vissapragada commented on HBASE-25865:
--

Sure, let me apply the patch locally and see how it looks. I'm ok with skipping 
the screenshots if it is too much effort, don't mean to create more work for 
you.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-21 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17349466#comment-17349466
 ] 

Nick Dimiduk commented on HBASE-25865:
--

[~bharathv] I'm still working on getting a screenshot through an approval 
process. In the mean time, here's how you can try out just the visualization 
using data collected from an existing cluster. 
https://gist.github.com/ndimiduk/2305c1ffd4e02b96e56bfc04db789740

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-20 Thread Bharath Vissapragada (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17348709#comment-17348709
 ] 

Bharath Vissapragada commented on HBASE-25865:
--

Patch seems fine to me (and the shading approach too). Mind posting screenshots 
of how this looks like?

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-18 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17347215#comment-17347215
 ] 

Nick Dimiduk commented on HBASE-25865:
--

Also, please consider this change along with the questions I posed in 
https://issues.apache.org/jira/browse/HBASE-25844?focusedCommentId=17347212=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17347212

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-18 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17347214#comment-17347214
 ] 

Nick Dimiduk commented on HBASE-25865:
--

Here is a feature branch based off of HBASE-25864 that demonstrates the basic 
idea. Watchers, I'd love to hear what you think.

https://github.com/ndimiduk/hbase/compare/25864-shaded-javax.ws.rs...ndimiduk:25865-region-visualizer

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master, Operability, Usability
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.5.0
>
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-07 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17340995#comment-17340995
 ] 

Nick Dimiduk commented on HBASE-25865:
--

My work on HBASE-25844 was motivated by this project, getting 
{{ClusterMetrics}} out to the browser without contributing to our manual 
servlet mess.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-25865) Visualize current state of region assignment

2021-05-07 Thread Nick Dimiduk (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-25865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17340992#comment-17340992
 ] 

Nick Dimiduk commented on HBASE-25865:
--

The original scripts were written in Python, used Pandas to manipulate the raw 
data, and Plot.ly to render charts in a Jupiter notebook. I've started a PoC 
implementation using Vega-Lite to directly parse a json conversion of the 
{{ClusterMetrics}} object and create an interactive chart. Of note, Vega seems 
to handle the data volume produced by this cluster much better than Plot.ly did.

> Visualize current state of region assignment
> 
>
> Key: HBASE-25865
> URL: https://issues.apache.org/jira/browse/HBASE-25865
> Project: HBase
>  Issue Type: New Feature
>  Components: master
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
>
> After several months of debugging and tuning the balancer and normalizer on a 
> large production cluster, we found that working from visualizations of the 
> current region state was very useful for understanding behaviors and 
> quantifying improvements we made along the way. Specifically, we found that a 
> chart of total assigned region count and total assigned region store files 
> size per table per host was immensely useful for tuning the balancer. 
> Histograms of store file size made understanding normalizer activity much 
> more intuitive.
> Our scripts would parse the output of the shell's {{status 'detailed'}} 
> command, extract the desired metric, and produce charts. I'd like to build 
> into the master UI the equivalent functionality, with data coming directly 
> from the {{ClusterMetrics object}}, and data rendered into an interactive 
> chart rendered in the browser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)