RE: Multivalued fields and the 'copyField' operator

2007-08-11 Thread Chris Hostetter

: If I index with this, the spellcheck Analyser class complains that I'm
: putting two values in a multiValued=false field. Since I have to make it
: multiValued, the same word in successive values is not collapsed into one
: mention of the word.

I think you are missunderstanding hte intent of the multiValued field
attribute -- it really has nothing to do with collapsing values or
removing duplicates, it just tells Solr wether or not you want to allow
multiple descreete values to ever be added -- wether it's by a copyField
or by you sending multiple explicit values when you add the document.

even if you take copyField out of the equation, and sent the values
explicitly; or took multiValued out of hte equation and did a string
concat of the values before adding the doc there would still be no
automatic collapsing of successive instances of a word.


-Hoss



Re: Multivalued fields and the 'copyField' operator

2007-08-09 Thread Yonik Seeley
On 8/9/07, Lance Norskog [EMAIL PROTECTED] wrote:
 I'm adding a field to be the source of the spellcheck database.  Since that
 is its only job, it has raw text lower-cased, de-Latin1'd, and
 de-duplicated.

 Since it is only for the spellcheck DB, it does not need to keep duplicates.

Duplicate token values (words) or duplicate field values?
Could you give some examples?

-Yonik


RE: Multivalued fields and the 'copyField' operator

2007-08-09 Thread Lance Norskog
If we have a field spellcheck_db, and have two copyField lines for it:

fieldType name=spellcheck ... Basically the text type without
stemming... 

field name=title type=string / 
field name=description type=string / 

field name=spellcheck_db multiValued=false
type=spellcheck indexed=true stored=false
required=true /

copyField source=title dest=spellcheck_db /
copyField source=description dest=spellcheck_db /

All I want to do is make a pile of words as input to the spellcheck feature.

If I index with this, the spellcheck Analyser class complains that I'm
putting two values in a multiValued=false field. Since I have to make it
multiValued, the same word in successive values is not collapsed into one
mention of the word.

I suppose this is an 'out' case, and not worth any major internal rework.

Thanks for your time,

Lance

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik Seeley
Sent: Thursday, August 09, 2007 5:28 PM
To: solr-user@lucene.apache.org
Subject: Re: Multivalued fields and the 'copyField' operator

On 8/9/07, Lance Norskog [EMAIL PROTECTED] wrote:
 I'm adding a field to be the source of the spellcheck database.  Since 
 that is its only job, it has raw text lower-cased, de-Latin1'd, and 
 de-duplicated.

 Since it is only for the spellcheck DB, it does not need to keep
duplicates.

Duplicate token values (words) or duplicate field values?
Could you give some examples?

-Yonik