Custom update handler with deduplication

2013-12-15 Thread Jorge Luis Betancourt González
Currently I've the following Update Request Processor chain to prevent indexing very similar text items into a core dedicated to store queries that our users put into the web interface of our system. !-- Delete similar duplicated documents on index time, using some fuzzy text similary

Re: Custom update handler with deduplication

2013-12-15 Thread Shalin Shekhar Mangar
Firstly, I see that you have overwriteDupes=false in your configuration. This means that a signature will be generated but the similar documents will still be added to the index. Now to your main question about counting duplicate attempts, one simple way is to have another UpdateRequestProcessor