Re: [VOTE] Bigtop 0.7.0 RC0

2013-10-30 Thread Bruno Mahé
Since we are going to spin a new Apache Bigtop 0.7.0 RC1, here is my 
formal -1 for Apache Bigtop 0.7.0 RC0.


See inline.

On 10/29/2013 03:49 PM, Roman Shaposhnik wrote:

On Mon, Oct 28, 2013 at 11:14 PM, Bruno Mahé bm...@apache.org wrote:

I would agree with that. The only trouble is I can't repro it :-(
Could you please provide additional details on the JIRA?

But if it is easy to repro (and I'm just missing something)
I'd agree with you.



I updated the ticket with some more information.
If you still cannot reproduce it, feel free to ping me and I will give you
access to the instance.
At some point I was wondering if it is because I am mixing the amazon ami
with centos6 repos, but given every other service I try do work...


Thanks for the details instructions on how to repro. It is indeed
easily reproducible and it is also extremely easy to fix.

Now, at this point, I'm absolutely in favor of spinning up RC1
with the following fixes in: BIGTOP-1132 and BIGTOP-1129

Both of those are really isolated fixes and here's what I'd like
to propose: I respin on Wed and send out a new VOTE thread.
This time, however, the voting will only go till noon on Sun 11/3.

If anybody objects to that -- please let me know ASAP.



Awesome!
Sounds like a great plan.

I came home too late to verify the patch of BIGTOP-1129, but hopefully I 
will be able to do so either tomorrow or Thursday.





The rest of my comments inline:


I can help with that. ;-) At least with the default use case. Let me know
if you're still interested.



Sure. Any recommendations?


Basically, you can simply follow this setup docs:
 
http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-Installation-Guide/csig_install_search.html
and especially this part:
 
http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-Installation-Guide/csig_deploy_search_solrcloud.html

It also covers Hue Search app. It would very nice if somebody
can help come up with Bigtop-specific wiki docs, but for now
Cloudera Search is close enough.


At this point Puppet code is the most reliable way to deploy Hue. Part of
it has to do that Hue now depends on way more external services than
it uses to -- which means that you have to get configs 'just right'. Our
puppet works great for that -- but that's for a fully distributed cluster.



I was not using puppet (on purpose).
I will give it another shot by looking at what our puppet recipes are doing
and see if any of these changes can be baked directly into our packages.


Thanks! That would be appreciated.


Actually, it was the elasticsearch sink for flume which had me update Apache
Lucene jars.

I will probably open a ticket against Apache Flume directly sometimes this
week to ask for some clarification.


Please do. I believe I fixed some of those issues in upcoming Flume 1.5.0


WAT? Really. I'm pretty curious at this point



I had the same reaction.
But this was reported as blocked by noscript.
Also it seems to be a configuration activated by default:
https://github.com/cloudera/hue/blob/master/desktop/conf.dist/hue.ini#L63


Do you think we can convince Hue upstream to not spy
on its users by default? ;-)

I guess worst case scenario -- we can always disable it downstream in Bigtop.



I will start with opening a ticket with Hue and Bigtop.
And as you said, worst case scenario, that flag can be disabled over here.

I opened https://issues.apache.org/jira/browse/BIGTOP-1135 to track that 
effort.




Great! Would you be willing to help with a blog post on the release? ;-)



Sure.
Do you have anything particular in mind?


Just a bit more verbose version of the feedback you've provided on this
thread, I guess. Makes sense?




Makes sense.



Well, there's Hue Search app. Have you had a chance to try it?



Not yet. But will do.


Give it a try -- its like Google, but on your data ;-)

And the docs I quoted above cover it as well.

Thanks,
Roman.



Thanks!
Will take a look.


Thanks,
Bruno



Re: [VOTE] Bigtop 0.7.0 RC0

2013-10-28 Thread Bruno Mahé

On 10/18/2013 09:54 PM, Roman Shaposhnik wrote:

This is the seventh release for Apache Bigtop, version 0.7.0

It fixes the following issues:
   http://s.apache.org/Pkp

*** Please download, test and vote by Fri 10/25 noon PST

Note that we are voting upon the source (tag):
release-0.7.0-RC0

Source and binary files:
   
https://repository.apache.org/content/repositories/orgapachebigtop-194/org/apache/bigtop/bigtop/0.7.0/

Binary convenience artifacts:
http://bigtop01.cloudera.org:8080/view/Releases/job/Bigtop-0.7.0/

Documentation on how to install (just make sure to adjust the repos for 0.7.0):
  
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.6.0

Maven staging repo:
https://repository.apache.org/content/repositories/orgapachebigtop-194/

The tag to be voted upon:

https://git-wip-us.apache.org/repos/asf?p=bigtop.git;a=commit;h=fb628180d289335dcf95641b44482fb680f11573

Bigtop's KEYS file containing PGP keys we use to sign the release:
http://svn.apache.org/repos/asf/bigtop/dist/KEYS

Thanks,
Roman.




I am not voting yet since I still have some time, but so far I am 
leaning toward a -1.


I am learning toward a -1 because of 
https://issues.apache.org/jira/browse/BIGTOP-1129 and my issues with Hue.
Other than that, everything I tested either just works out of the box or 
is nitpick.
But BIGTOP-1129 is what I would consider a blocker since it is part of 
the basic use case of Apache Bigtop.


Things I tested:
* Apache Hadoop and some basic jobs
* Apache HBase and Phoenix. Just basic testing
* Apache Flume sending Apache Hadoop and Apache HBase logs to an 
Elasticsearch instance and visualized through Kibana

* Apache Hue smoke tests
* Everything running on OpenJDK 6 on ec2 instances

Things I still want to test (or rather, things I hope I can test by 
Tuesday evening):

* Apache Pig and datafu
* Apache Solr
* Load more data into Phoenix


Things we could do better:
* As described on BIGTOP-1129, I could not stop datanode/namenode 
through init scripts.
* We could provide some templates for Apache Hadoop. I wasted a few 
hours just to get the pi job running. Thankfully we have the init script 
for hdfs (which needs some tweaks for the staging directory) and 
templates for the configuration files in our puppet modules
* I enabled short-circuit in Apache HBase. Not sure if I missed 
something, but I got some 
org.apache.hadoop.security.AccessControlException: Can't continue with 
getBlockLocalPathInfo() authorization exceptions. From reading 
http://www.spaggiari.org/index.php/hbase/how-to-activate-hbase-shortcircuit 
it seems there are a few things we could do to make it work out of the box
* Not sure what I did wrong but although I could access Hue UI, most 
apps I tried were not working. Ex: all shells give me the error value 
222 for UID is less than the minimum UID allowed (500). And the file 
browser gives me the error Cannot access: /. Note: You are a Hue admin 
but not a HDFS superuser (which is hdfs).. Note that the first user I 
created was a user named ec2-user. Although it is not an hdfs super 
user, I would expect to have a working equivalent of what I can browse 
with the hdfs -ls command. Also creating a hue user named hdfs 
yields the same result. Note that I did not have time to dig further.
* Phoenix directly embeds Apache Hadoop, Apache HBase and Apache 
Zookeeper jars. These jars should be symlinks.
* Phoenix required me to delete some old Apache lucene jars from Apache 
Flume installation directory. From the output of the command mvn 
dependency:tree on the flume project, it appears these jars are only 
needed for the ElasticSearch and MorphlineSolrSink plugins. but Flume 
documentation for both of these plugin explicitly ask users to provide 
jars of Apache Lucene and Apache Solr/ElasticSearch themselves (since 
they may use a different version of Apache Lucene). So the dependency on 
Apache Lucene by Apache Flume should probably be marked as provided 
and we should probably provide some packages to manage these dependencies.
* I still need to figure out why my instance of Hue needs access to 
google-analytics.com



Other than that, it was an enjoyable experience to use Apache Bigtop 
0.7.0RC0.
Doing SQL queries through Phoenix was pretty impressive and did not 
require much work to setup.
Also seeing Apache Hadoop and Apache HBase logs being shipped by flume 
to ElasticSearch and then being able to query events and create some 
dynamic charts on kibana was exciting!



Also, since I am about to test Apache Solr, is there an equivalent to 
Kibana I can use for visualizing my indexed logs?



Thanks,
Bruno


Re: [VOTE] Bigtop 0.7.0 RC0

2013-10-27 Thread Anatoli Fomenko
+1
(Basic testing went well on Centos 6.3)

Anatoli 



On Friday, October 18, 2013 9:54 PM, Roman Shaposhnik r...@apache.org wrote:
 
This is the seventh release for Apache Bigtop, version 0.7.0

It fixes the following issues:
  http://s.apache.org/Pkp

*** Please download, test and vote by Fri 10/25 noon PST

Note that we are voting upon the source (tag):
   release-0.7.0-RC0

Source and binary files:
  
https://repository.apache.org/content/repositories/orgapachebigtop-194/org/apache/bigtop/bigtop/0.7.0/

Binary convenience artifacts:
  http://bigtop01.cloudera.org:8080/view/Releases/job/Bigtop-0.7.0/

Documentation on how to install (just make sure to adjust the repos for 0.7.0):
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.6.0

Maven staging repo:
  https://repository.apache.org/content/repositories/orgapachebigtop-194/

The tag to be voted upon:
  
https://git-wip-us.apache.org/repos/asf?p=bigtop.git;a=commit;h=fb628180d289335dcf95641b44482fb680f11573

Bigtop's KEYS file containing PGP keys we use to sign the release:
  http://svn.apache.org/repos/asf/bigtop/dist/KEYS

Thanks,
Roman.

Re: [VOTE] Bigtop 0.7.0 RC0

2013-10-25 Thread Bruno Mahé

On 10/18/2013 09:54 PM, Roman Shaposhnik wrote:

This is the seventh release for Apache Bigtop, version 0.7.0

It fixes the following issues:
   http://s.apache.org/Pkp

*** Please download, test and vote by Fri 10/25 noon PST

Note that we are voting upon the source (tag):
release-0.7.0-RC0

Source and binary files:
   
https://repository.apache.org/content/repositories/orgapachebigtop-194/org/apache/bigtop/bigtop/0.7.0/

Binary convenience artifacts:
http://bigtop01.cloudera.org:8080/view/Releases/job/Bigtop-0.7.0/

Documentation on how to install (just make sure to adjust the repos for 0.7.0):
  
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.6.0

Maven staging repo:
https://repository.apache.org/content/repositories/orgapachebigtop-194/

The tag to be voted upon:

https://git-wip-us.apache.org/repos/asf?p=bigtop.git;a=commit;h=fb628180d289335dcf95641b44482fb680f11573

Bigtop's KEYS file containing PGP keys we use to sign the release:
http://svn.apache.org/repos/asf/bigtop/dist/KEYS

Thanks,
Roman.




I haven't had time to test it yet, but should be able to get to it this 
week end.

Would it be possible to postpone the end of the vote until Sunday?

Also I could not find any jenkins job with all our tests running against 
this release.

It would be great to put such a link in the vote email.


Thanks,
Bruno


[VOTE] Bigtop 0.7.0 RC0

2013-10-18 Thread Roman Shaposhnik
This is the seventh release for Apache Bigtop, version 0.7.0

It fixes the following issues:
  http://s.apache.org/Pkp

*** Please download, test and vote by Fri 10/25 noon PST

Note that we are voting upon the source (tag):
   release-0.7.0-RC0

Source and binary files:
  
https://repository.apache.org/content/repositories/orgapachebigtop-194/org/apache/bigtop/bigtop/0.7.0/

Binary convenience artifacts:
   http://bigtop01.cloudera.org:8080/view/Releases/job/Bigtop-0.7.0/

Documentation on how to install (just make sure to adjust the repos for 0.7.0):
 
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.6.0

Maven staging repo:
   https://repository.apache.org/content/repositories/orgapachebigtop-194/

The tag to be voted upon:
   
https://git-wip-us.apache.org/repos/asf?p=bigtop.git;a=commit;h=fb628180d289335dcf95641b44482fb680f11573

Bigtop's KEYS file containing PGP keys we use to sign the release:
   http://svn.apache.org/repos/asf/bigtop/dist/KEYS

Thanks,
Roman.