As I think more about this, we should have a signature processor that uses
minhash. The MD5 signature processor was really easy to use.
http://observer.wunderwood.org/ (my blog)
> On Apr 7, 2018, at 4:55 AM, Emir Arnautović <emir.arnauto...@sematext.com>
> Hi Walter,
> I did this sample processor for the purpose of having doc values on analysed
> field: https://github.com/od-bits/solr-multivaluefield-processor
> (+ related blog:
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>> On 6 Apr 2018, at 23:46, Walter Underwood <wun...@wunderwood.org> wrote:
>> Is there an easy way to define an analyzer chain in schema.xml then run it
>> in an update request processor?
>> I want to run a chain ending in the minhash token filter, then take those
>> minhashes, convert them to hex, and put them in a string field. I’d like the
>> values stored.
>> It seems like this could all work in an update request processor. Grab the
>> text from one field, run it through the chain, format the output tokens and
>> add them to the field for hashes.
>> Walter Underwood
>> http://observer.wunderwood.org/ (my blog)