Currently I've the following Update Request Processor chain to prevent indexing
very similar text items into a core dedicated to store queries that our users
put into the web interface of our system.
!-- Delete similar duplicated documents on index time, using some fuzzy text
similary
Firstly, I see that you have overwriteDupes=false in your
configuration. This means that a signature will be generated but the
similar documents will still be added to the index. Now to your main
question about counting duplicate attempts, one simple way is to have
another UpdateRequestProcessor