I am considering using the percolator API to classify document, namely, by 
posting query like "football", "art" to the percolator, and then when 
adding new documents, percolator should return the right tags. My concerns 
is, suppose there is thousands of tag to be identified in this way, would 
it be a performance nightmare? Is there thousands of query that is 
implicitly running behind the scene?

And what would be the recommended way to tackle these kind of 
classification problem in Elasticsearch?

It seems that Lucene has a classification api. Is it already integrated 
elsewhere in Elasticsearch? Is there any roadmap concerning its 
implementation?

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/8cd363be-5c9b-4b10-925c-fb4f1de4d4c3%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to