> Sorry if this is a naive question but can you explain why you consider
> that ElasticSearch isn't a distributed solution for realtime search?

I wasn't referring just to ES, mainly to Katta and Solr.  Taking a
step back, RT in Lucene should enable immediate consistency making it
symmetrical with HBase?  Outside of that there are 'containers' for
Lucene, some of which are Katta, Solr, and ES.  My opinion is that
they each have drawbacks compared to HBase as a Lucene container.

If one is running HBase in production, then adding a Lucene index on
that data shouldn't add more complexity to operating HBase.  And so if
one's primary data store is HBase, my opinion is that one'd be adding
significant additional complexity be adding 'another' cluster server
system alongside.  Especially given the requirements and symmetrical
(eg, write-once, immediate consistency) nature of HBase and Lucene.

Once everything is polished I think it'll be a nice solution that can
replace many MySQL depoyments for realtime data access.  One that'd
offer even more types of queries and scalability than MySQL.  If the
user wishes to perform joins they can use Hive?

Reply via email to