You should look at raw option or better look at Logstash. My 2 cents.
David > Le 14 janv. 2015 à 23:29, Chinch Pokli <[email protected]> a écrit : > > Hi, > > I am using elasticsearch to index twitter stream. Until recently I was using > the official river which was working great but realized that it throwing out > much of the data (e.g. it is not storing number of followers etc. data). > > Is there a way to make the river to store all the data? If not, I am fine > with writing a streaming code which will stream and index. But have a > concern. How many documents can elasticsearch index per second? I might > eventually need to index almost 10,000 documents (each document = 2 KB) per > second (current requirement is of 100 documents per second). Is this even > feasible? If yes, do I need to make any special modifications? > > Thanks-in-advance!! > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/da547692-903b-4793-a77e-fd5f0b5a01b7%40googlegroups.com. > For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/0C83816D-AD64-4C6D-B573-C287B8222F2B%40pilato.fr. For more options, visit https://groups.google.com/d/optout.
