As I think more about this, we should have a signature processor that uses 
minhash. The MD5 signature processor was really easy to use.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)

> On Apr 7, 2018, at 4:55 AM, Emir Arnautović <emir.arnauto...@sematext.com> 
> wrote:
> 
> Hi Walter,
> I did this sample processor for the purpose of having doc values on analysed 
> field: https://github.com/od-bits/solr-multivaluefield-processor 
> <https://github.com/od-bits/solr-multivaluefield-processor>
> 
> (+ related blog: 
> http://www.od-bits.com/2018/02/solr-docvalues-on-analysed-field.html 
> <http://www.od-bits.com/2018/02/solr-docvalues-on-analysed-field.html>)
> 
> HTH,
> Emir
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
> 
> 
> 
>> On 6 Apr 2018, at 23:46, Walter Underwood <wun...@wunderwood.org> wrote:
>> 
>> Is there an easy way to define an analyzer chain in schema.xml then run it 
>> in an update request processor?
>> 
>> I want to run a chain ending in the minhash token filter, then take those 
>> minhashes, convert them to hex, and put them in a string field. I’d like the 
>> values stored.
>> 
>> It seems like this could all work in an update request processor. Grab the 
>> text from one field, run it through the chain, format the output tokens and 
>> add them to the field for hashes.
>> 
>> wunder
>> Walter Underwood
>> wun...@wunderwood.org
>> http://observer.wunderwood.org/  (my blog)
>> 
> 

Reply via email to