Thanks roman :) yes we once lived and died by the SolrOutputFormat for some time. It is a very nice extension to hadoop's reduce outputs - but What i mean is that SOLR is not part of the hadoop ecosystem, in the sense that it doesnt natively depend on HDFS . Rather it uses standard file system and is a memory intensive app, scaling via more cores, not more data nodes or task trackers .
I think of "hadoop ecosytem" tools as tools which rely on HDFS, or MapReduce, in order to run. But maybe the definition of the "hadoop ecosystem" is brodening in the YARN / Zookeeper era ?
