Thomas Mueller created OAK-9707:
-----------------------------------
Summary: Don't fail oak-run indexing on invalid data
Key: OAK-9707
URL: https://issues.apache.org/jira/browse/OAK-9707
Project: Jackrabbit Oak
Issue Type: Improvement
Components: indexing, oak-run
Reporter: Thomas Mueller
Assignee: Thomas Mueller
Error like the one below currently mean oak-run indexing will fail. Instead, a
warning should be logged, and just this field should be removed from the
document.
{noformat}
java.lang.IllegalArgumentException: DocValuesField
":dvjcr:content/metadata/dc:title" is too large, must be <= 32766
at
org.apache.lucene.index.SortedDocValuesWriter.addValue(SortedDocValuesWriter.java:68)
at
org.apache.lucene.index.DocValuesProcessor.addSortedField(DocValuesProcessor.java:125)
at
org.apache.lucene.index.DocValuesProcessor.addField(DocValuesProcessor.java:59)
at
org.apache.lucene.index.TwoStoredFieldsConsumers.addField(TwoStoredFieldsConsumers.java:36)
at
org.apache.lucene.index.DocFieldProcessor.processDocument(DocFieldProcessor.java:236)
at
org.apache.lucene.index.DocumentsWriterPerThread.updateDocument(DocumentsWriterPerThread.java:253)
at
org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:455)
at
org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:1534)
at
org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1204)
at
org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1185)
at
org.apache.jackrabbit.oak.plugins.index.lucene.writer.DefaultIndexWriter.updateDocument(DefaultIndexWriter.java:93)
at
org.apache.jackrabbit.oak.plugins.index.lucene.writer.DefaultIndexWriter.updateDocument(DefaultIndexWriter.java:54)
at
org.apache.jackrabbit.oak.plugins.index.lucene.writer.MultiplexingIndexWriter.updateDocument(MultiplexingIndexWriter.java:60)
at
org.apache.jackrabbit.oak.plugins.index.lucene.writer.MultiplexingIndexWriter.updateDocument(MultiplexingIndexWriter.java:37)
at
org.apache.jackrabbit.oak.index.indexer.document.LuceneIndexer.writeToIndex(LuceneIndexer.java:108)
at
org.apache.jackrabbit.oak.index.indexer.document.LuceneIndexer.index(LuceneIndexer.java:80)
at
org.apache.jackrabbit.oak.index.indexer.document.CompositeIndexer.index(CompositeIndexer.java:58)
at
org.apache.jackrabbit.oak.index.indexer.document.DocumentStoreIndexerBase.reindex(DocumentStoreIndexerBase.java:223)
{noformat}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)