Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-15 Thread Tamas Penzes
gt; > From: Peter Somogyi > Sent: 15 November 2018 16:48 > To: dev@hbase.apache.org > Subject: Re: [DISCUSS] Gathering metrics on HBase versions in use > > I like the idea to have some sort of metrics from the users. > > I agree with All

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-15 Thread Reid Chan
to PMC's mail list. -- Best regards, R.C From: Peter Somogyi Sent: 15 November 2018 16:48 To: dev@hbase.apache.org Subject: Re: [DISCUSS] Gathering metrics on HBase versions in use I like the idea to have some sort of metrics

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-15 Thread Peter Somogyi
I like the idea to have some sort of metrics from the users. I agree with Allan that in many cases HBase cluster is in an internal network making the data collection difficult or not even possible. It could lead us to an incorrect view if these generally bigger clusters do not appear in the

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-14 Thread Duo Zhang
+1 on collecting the download information. And collecting data when starting up is a bit dangerous I'd say, both technically and legally... Maybe a possible way is to add a link on the master state page, or some ASCII arts in the master start log, to guide the people to our survey? Allan Yang

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-14 Thread Allan Yang
I also think having metrics about the downloads from Apache/archives is a doable action. Most HBase clusters are running in user's Intranet with no public access, sending anonymous data from them may not be possible. And also we need to find a way to obtain their authorization I think... Best

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-14 Thread Zach York
Can we have metrics around the downloads from Apache/archives? I'm not sure how that is all set up, but might be a low cost way to get some metrics. On Wed, Nov 14, 2018, 12:12 PM Andrew Purtell While it seems you are proposing some kind of autonomous ongoing usage > metrics collection, please

Re: [DISCUSS] Gathering metrics on HBase versions in use

2018-11-14 Thread Andrew Purtell
While it seems you are proposing some kind of autonomous ongoing usage metrics collection, please note I ran an anonymous version usage survey via surveymonkey for 1.x last year. It was opt in and there were no PII concerns by its nature. All of the issues around data collection, storage, and

[DISCUSS] Gathering metrics on HBase versions in use

2018-11-14 Thread Misty Linville
When discussing the 2.0.x branch in another thread, it came up that we don’t have a good way to understand the version skew of HBase across the user base. Metrics gathering can be tricky. You don’t want to capture personally identifiable information (PII) and you need to be transparent about what