I am considering using the percolator API to classify document, namely, by posting query like "football", "art" to the percolator, and then when adding new documents, percolator should return the right tags. My concerns is, suppose there is thousands of tag to be identified in this way, would it be a performance nightmare? Is there thousands of query that is implicitly running behind the scene?
And what would be the recommended way to tackle these kind of classification problem in Elasticsearch? It seems that Lucene has a classification api. Is it already integrated elsewhere in Elasticsearch? Is there any roadmap concerning its implementation? -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/8cd363be-5c9b-4b10-925c-fb4f1de4d4c3%40googlegroups.com. For more options, visit https://groups.google.com/groups/opt_out.
