[
https://issues.apache.org/jira/browse/CONNECTORS-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615550#comment-14615550
]
Shinichiro Abe commented on CONNECTORS-1219:
--------------------------------------------
r1689485.
StringBuilder(int capacity) , this capacity was approximately 700 MB. In the
past I realized Solrj also have the same limitation, even though Solrj doesn't
use StringBuilder, but use String.getBytes().
org.apache.lucene.lucene.util.ArrayUtil.grow is also using byte array, maybe
occurs OOM when exceeding that size.
> Lucene Output Connector
> -----------------------
>
> Key: CONNECTORS-1219
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1219
> Project: ManifoldCF
> Issue Type: New Feature
> Reporter: Shinichiro Abe
> Assignee: Shinichiro Abe
> Attachments: CONNECTORS-1219-v0.1patch.patch,
> CONNECTORS-1219-v0.2.patch
>
>
> A output connector for Lucene local index directly, not via remote search
> engine. It would be nice if we could use Lucene various API to the index
> directly, even though we could do the same thing to the Solr or Elasticsearch
> index. I assume we can do something to classification, categorization, and
> tagging, using e.g lucene-classification package.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)