Roman also your blog post seems to be useful for answering the above questions... https://blogs.apache.org/bigtop/entry/bigtop_and_why_should_you ...
FYI, I think the links need updates to the respective GIT repos now For example https://svn.apache.org/repos/asf/incubator/bigtop/trunk/bigtop-deploy/puppet/ should be https://github.com/apurtell/bigtop/tree/master/bigtop-deploy -- On Mon, Jun 10, 2013 at 6:23 PM, Jay Vyas <[email protected]> wrote: > Thanks roman :) yes we once lived and died by the SolrOutputFormat for > some time. It is a very nice extension to hadoop's reduce outputs - but > What i mean is that SOLR is not part of the hadoop ecosystem, in the sense > that it doesnt natively depend on HDFS . Rather it uses standard file > system and is a memory intensive app, scaling via more cores, not more data > nodes or task trackers . > > I think of "hadoop ecosytem" tools as tools which rely on HDFS, or > MapReduce, in order to run. > > But maybe the definition of the "hadoop ecosystem" is brodening in the > YARN / Zookeeper era ? > -- Jay Vyas http://jayunit100.blogspot.com
