Hey Binh, Thanks a lot and it is really nice to hear from someone with 
practical experience on this. Is it correct  to say if I had a thousand 
tags, I would need to make thousands of 

curl -XPUT 'localhost:9200/my-index1/.percolator/tagname1' 

to register each tags? In your implementation is there any pitfalls or nice 
tricks that is worth noting?




On Wednesday, January 22, 2014 8:27:03 AM UTC+8, Binh Ly wrote:
>
> Arthur,
>
> You should be able to use filters in your percolator queries so for 
> example you can use a term/terms filter. Also, in ES 1.0 you can shard the 
> percolator query index out so that percolation can distribute that load 
> around for better scalability. The best way is to experiment with it: 
> http://www.elasticsearch.org/downloads/1-0-0-RC1.
>
> I actually worked for a company that did content classification this way, 
> and the percolator was a perfect fit for that use-case.
>
> On Tuesday, January 21, 2014 10:01:36 AM UTC-5, Arthur Denning wrote:
>>
>> I am considering using the percolator API to classify document, namely, 
>> by posting query like "football", "art" to the percolator, and then when 
>> adding new documents, percolator should return the right tags. My concerns 
>> is, suppose there is thousands of tag to be identified in this way, would 
>> it be a performance nightmare? Is there thousands of query that is 
>> implicitly running behind the scene?
>>
>> And what would be the recommended way to tackle these kind of 
>> classification problem in Elasticsearch?
>>
>> It seems that Lucene has a classification api. Is it already integrated 
>> elsewhere in Elasticsearch? Is there any roadmap concerning its 
>> implementation?
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/965b464c-1cf2-4ae5-83c1-5f18fe8d0228%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to