As I think more about this, we should have a signature processor that uses minhash. The MD5 signature processor was really easy to use.
wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Apr 7, 2018, at 4:55 AM, Emir Arnautović <emir.arnauto...@sematext.com> > wrote: > > Hi Walter, > I did this sample processor for the purpose of having doc values on analysed > field: https://github.com/od-bits/solr-multivaluefield-processor > <https://github.com/od-bits/solr-multivaluefield-processor> > > (+ related blog: > http://www.od-bits.com/2018/02/solr-docvalues-on-analysed-field.html > <http://www.od-bits.com/2018/02/solr-docvalues-on-analysed-field.html>) > > HTH, > Emir > -- > Monitoring - Log Management - Alerting - Anomaly Detection > Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > > > >> On 6 Apr 2018, at 23:46, Walter Underwood <wun...@wunderwood.org> wrote: >> >> Is there an easy way to define an analyzer chain in schema.xml then run it >> in an update request processor? >> >> I want to run a chain ending in the minhash token filter, then take those >> minhashes, convert them to hex, and put them in a string field. I’d like the >> values stored. >> >> It seems like this could all work in an update request processor. Grab the >> text from one field, run it through the chain, format the output tokens and >> add them to the field for hashes. >> >> wunder >> Walter Underwood >> wun...@wunderwood.org >> http://observer.wunderwood.org/ (my blog) >> >