Dear All, What would be the best practice to index a large crawl using Solr? The crawl is performed on a multi node Hadoop cluster using HBase as the back end.. Would Solr become a bottleneck if we use just a single Solr instance? Is it possible to store the indexed data on HBase and to serve them from the HBase it self?
thanks a lot, Thilina -- https://www.cs.indiana.edu/~tgunarat/ http://www.linkedin.com/in/thilina http://thilina.gunarathne.org

