I've got to ask _how_ are you intending to search this field? On the
surface, this feels like an XY problem.
It's a "string" type. Therefore, if this is the input:

102, 111, 114, 32, 97, 32, 114, 101, 118, 105, 101, 119, 32, 115, 101,
101, 32, 66, 114

you'll only ever get a match if you search exactly:
102, 111, 114, 32, 97, 32, 114, 101, 118, 105, 101, 119, 32, 115, 101,
101, 32, 66, 114

None of these will match
102
102,
32
32,
119, 32, 115

etc.

The idea of doing a match on a single _token_ that's over 32K long is
pretty far out there, thus
the check.

The entire multiValued discussion is _probably_ a red herring and
won't help you. multiValued
has nothing to do with multiple terms, that's all up to your field type.

So back up and tell us _how_ you intend to search this field. I'm
guessing you really want
to make it a text-based type instead. But that's just a guess.

Best,
Erick.

On Mon, May 11, 2015 at 8:43 AM, Bernd Fehling
<bernd.fehl...@uni-bielefeld.de> wrote:
> It turned out that I didn't recognized that dcdescription is not indexed,
> only stored. So the next in "chain" ist f_dcperson where dccreator and
> dcdescription is combined and indexed. And this is why the error
> shows up on f_dcperson. ("delay of error")
>
> Thanks for your help, regards.
> Bernd
>
>
> Am 11.05.2015 um 15:35 schrieb Shawn Heisey:
>> On 5/11/2015 7:19 AM, Bernd Fehling wrote:
>>> After reading https://issues.apache.org/jira/browse/LUCENE-5472
>>> one question still remains.
>>>
>>> Why is it complaining about f_dcperson which is a copyField when the
>>> origin problem field is dcdescription which definately is much larger
>>> than 32766?
>>>
>>> I would assume it complains about dcdescription field. Or not?
>>
>> If the value resulting in the error does come from a copyField source
>> that also uses a "string" type, then my guess here is that Solr has some
>> prioritization that causes the copyField destination to be indexed
>> before the sources.  This ordering might make things go a little faster,
>> because if it happens right after copying, all or most of the data for
>> the destination field would already be sitting in one or more of the CPU
>> caches.  Cache hits are wonderful things for performance.
>>
>> Thanks,
>> Shawn
>>

Reply via email to