Hmm i am not sure i understand your questions. Hadoop is distributed storage system (HDFS) and Map-reduce framework (MR) (among other things) ES is distributed storage/search system (among other things)
So what es-hadoop is giving you: You can read data from ES , and do some complex analysis , taking benefits MR You can write data to ES - one can process some data stored on HDFS and write some pre-aggregated data to ES for example es-hadoop is basically connector between ES and Hadoop I hope this helps On Thursday, June 5, 2014 7:41:34 PM UTC+2, ES USER wrote: > > Try as I might and I have read all the stuff I can find on ES' website > about this I understand somewhat how the integration works but not the > actual nuts and bolts of it. > > For example: > > Is Hadoop just storing the files that would normally be stored in the > local filesystem for the ES indexes or is it storing the data that would > normally be in those indexes and just accessed through es-hadoop? > > If it is the latter how do you go about determining whatto set for the > number of nodes and shards. > > > If anyone has any information on this or even better yet a place to point > me to that has better references so that I can research this on my own it > would be much appreciated. > > Thanks. > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f4019b07-a660-4a49-b9ec-b04bb1ad71e5%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
