[
https://issues.apache.org/jira/browse/OAK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amit Jain resolved OAK-10384.
-----------------------------
Fix Version/s: 1.58.0
Resolution: Fixed
On trunk with -
https://github.com/apache/jackrabbit-oak/commit/1e55c01b22396239653549b3684bd9d71c606307
> Fix stripping of large indexed ordered properties
> -------------------------------------------------
>
> Key: OAK-10384
> URL: https://issues.apache.org/jira/browse/OAK-10384
> Project: Jackrabbit Oak
> Issue Type: Bug
> Components: lucene
> Reporter: Amit Jain
> Assignee: Amit Jain
> Priority: Major
> Fix For: 1.58.0
>
>
> Currently, the ordered indexed properties are truncated at the max length
> supported by lucene at 32766 in the
> [LuceneDocumentMaker|https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/java/org/apache/jackrabbit/oak/plugins/index/lucene/LuceneDocumentMaker.java#L290-L294].
> The problem is lucene uses a class {{BytesRef}} to represent strings which
> converts it to UTF-8. It then uses the length from this converted string to
> enforce the limit. The transformation between java unicode string to utf-8
> can cause the length to increase for non-ascii characters.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)