On Mon, Jun 10, 2013 at 06:23PM, Jay Vyas wrote: > Thanks roman :) yes we once lived and died by the SolrOutputFormat for > some time. It is a very nice extension to hadoop's reduce outputs - but > What i mean is that SOLR is not part of the hadoop ecosystem, in the sense > that it doesnt natively depend on HDFS . Rather it uses standard file > system and is a memory intensive app, scaling via more cores, not more data > nodes or task trackers . > > I think of "hadoop ecosytem" tools as tools which rely on HDFS, or > MapReduce, in order to run.
HDFS largely yes. YARN (not MR per se) isn't that much. Say, Bigtop is/about to integrate in-memory analytic systems (Spark, Shark) that aren't relying on MR at all, and only somewhat benefit from YARN. > But maybe the definition of the "hadoop ecosystem" is brodening in the YARN > / Zookeeper era ? See above. Cos
signature.asc
Description: Digital signature
