> And one more thing, Is there any concept like Real time clustering and On > demand clustering in ES?
I honestly think you've read two different marketing snippets that both relate to the same thing... "On-demand" and "real time" happens on a subset of documents from the index and should return clusters within a reasonable amount of time (so that users can interact with the system). Off-line or batch clustering would denote a situation when you're clustering all your documents, without the context of any query. And this can take significantly longer. This paper has a writeup of on-line clustering techniques in the context of clustering search results (disclosure: I'm partially responsible for it). http://dl.acm.org/citation.cfm?id=1541884&dl=ACM&coll=DL&CFID=298573694&CFTOKEN=23620876 @Jörg: clustering is a pretty established term in information retrieval and it nearly always denotes an unsupervised technique. I would be hesitant to talk about "supervised clustering", even if such a thing could be imagined by either introducing a feedback control loop (clustering-human evaluation-reclustering) or by introducing a predefined concept ontology... in which case it effectively becomes a classification problem. Dawid -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAM21Rt9iDDqD0nj0SSzWSB%2BrMOOWtFqGr2S1_qVZi9OMB%3DLpqQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
