Re: format data at source or format data during indexing?

2017-03-30 Thread Derek Poh
Hi Alex The business use case for the field is - exact match - singular-plural stemmingon each terms in the field Eg. search for "dvd cases" must match "dvd case"and "dvds case". This is the field type currently and It satisfy the business use case. The 1 drawback of this is I need to add those

Re: format data at source or format data during indexing?

2017-03-30 Thread Alexandre Rafalovitch
What's you actual business use case? On 30 Mar 2017 1:53 AM, "Derek Poh" wrote: > Hi Erick > > So I could also not use the query analyzer stage to append the code to the > search keyword? > Have the front-end application append the code for every query it issue > instead? > > > On 3/30/2017 12:2

Re: format data at source or format data during indexing?

2017-03-29 Thread Derek Poh
Hi Alex Thank you for pointing out theUpdateRequestProcessor option. On 3/30/2017 11:43 AM, Alexandre Rafalovitch wrote: I am not sure I can tell how to decide on one or another. However, I wanted to mention that you also have an option of doing in in the UpdateRequestProcessor chain. That's st

Re: format data at source or format data during indexing?

2017-03-29 Thread Derek Poh
Hi Erick So I could also not use the query analyzer stage to append the code to the search keyword? Have the front-end application append the code for every query it issue instead? On 3/30/2017 12:20 PM, Erick Erickson wrote: I generally prefer index-time work to query-time work on the theo

Re: format data at source or format data during indexing?

2017-03-29 Thread Erick Erickson
I generally prefer index-time work to query-time work on the theory that the index-time work is done once and the query time work is done for each query. That said, for a corpus this size (and presumably without a large query rate) I doubt you'd be able to measure any difference. So basically cho

Re: format data at source or format data during indexing?

2017-03-29 Thread Alexandre Rafalovitch
I am not sure I can tell how to decide on one or another. However, I wanted to mention that you also have an option of doing in in the UpdateRequestProcessor chain. That's still within Solr (and therefore is consistent with multiple clients feeding into Solr) but is before individual field processi

format data at source or format data during indexing?

2017-03-29 Thread Derek Poh
Hi Ineed to create afield that will be prefix and suffix with code 'z01x'.This field needs to have the code in the index and during query. I can either 1. have the source data of the field formatted with the code before indexing (outside solr). use a charFilter in the query stage of the field