There is https://issues.apache.org/jira/browse/LUCENE-3837 but I suppose
it's too far from completion.

On Wed, Apr 4, 2012 at 2:48 PM, Ravish Bhagdev <ravish.bhag...@gmail.com>wrote:

> Updating a single field is not possible in solr.  The whole record has to
> be rewritten.
>
> 300 MB is still not that big a file.  Have you tried doing the indexing (if
> its only a one time thing) by giving it ~2 GB or xmx?
>
> A single file with that size is strange!  May I ask what is it?
>
> Rav
>
> On Tue, Apr 3, 2012 at 7:32 PM, vybe3142 <vybe3...@gmail.com> wrote:
>
> >
> > Some days ago, I posted about an issue with SOLR running out of memory
> when
> > attempting to index large text files (say 300 MB ). Details at
> >
> >
> http://lucene.472066.n3.nabble.com/Solr-Tika-crashing-when-attempting-to-index-large-files-td3846939.html
> >
> > Two things I need to point out:
> >
> > 1. I don't need Tika for content extraction as the files are already in
> > plain text format.
> > 2. The heap space error was caused by a futile Tika/SOLR attempt at
> > creating
> > the corresponding huge XML document in memory
> >
> > I've decided to develop a custom handler that
> > 1. reads the file text directly
> > 2. attempts to create a SOLR document and directly add the text data to
> the
> > corresponding field.
> >
> > One approach I've taken is to read manageable chunks of text data
> > sequentially from the file and process. We've used this approach
> > sucessfully
> > with Lucene in the past and I'm attempting to make it work with SOLR
> too. I
> > got most of the work done yesterday, but need a bit of guidance w.r.t.
> > point
> > 2.
> >
> > How can I achieve updating the same field multiple times. Looking at the
> > SOLR source, processor.addField() merely
> > a. adds to the in-memory field map and
> > b. attempts to write EVERYTHING to the index later on.
> >
> > In my situation, (a) eventually causes a heap space error.
> >
> >
> >
> >
> > Here's part of the handler code.
> >
> >
> >
> > Thanks much
> >
> > Thanks
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Incremantally-updating-a-VERY-LARGE-field-Is-this-possibe-tp3881945p3881945.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>



-- 
Sincerely yours
Mikhail Khludnev
ge...@yandex.ru

<http://www.griddynamics.com>
 <mkhlud...@griddynamics.com>

Reply via email to