[jira] Commented: (LUCENE-1221) DocumentsWriter truncates term text at \uFFFF

Marcel Reutegger (JIRA) Tue, 11 Mar 2008 11:24:16 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12577544#action_12577544
 ]


Marcel Reutegger commented on LUCENE-1221:
------------------------------------------

> How/why are you seeing/using this character in Jackrabbit

To avoid an excessive amount of Lucene fields we prefix term values with the 
JCR property name and put everything under the same Lucene field name. The 
0xffff separates the property name from the property value.

See: JCR-106. That was before Lucene 2.1, when each field had a separate norm 
file.

> DocumentsWriter truncates term text at \uFFFF
> ---------------------------------------------
>
>                 Key: LUCENE-1221
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1221
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Index
>    Affects Versions: 2.3, 2.3.1
>            Reporter: Marcel Reutegger
>            Priority: Minor
>         Attachments: OddTermTest.java
>
>
> When a Term text contains the unicode 'character' \uFFFF, DocumentsWriter 
> will truncate the text and only write the text up to the \uFFFF character.
> This has been introduces with changes for LUCENE-843 to reduce memory usage 
> and improve performance.
> This change in behavior prevents us (Jackrabbit) from upgrading to Lucene 2.3.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-1221) DocumentsWriter truncates term text at \uFFFF

Reply via email to