Look at the IndexWriter Javadocs.  One of the fields allows you to set
maximum term length.  This may also be a problem with the HTML parser
you are using.  You didn't share a lot of details, so I cannot help
more.

Otis

--- Syr�n_Per <[EMAIL PROTECTED]> wrote:
> Hi all,
> 
> Have a question concerning indexing of HTML files.
> 
> One of the files I'm trying to index have a <input type="image" ...>
> tag
> that also contain a call to a javascript with a string argument that
> is
> about 1300 characters long. At this point Lucene seems to stop
> indexing the
> remaining part the current document, but do index the other files in
> the
> same directory.
> 
> How do I workaround this?
> 
> psy 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 


__________________________________
Do you Yahoo!?
Yahoo! Hotjobs: Enter the "Signing Bonus" Sweepstakes
http://hotjobs.sweepstakes.yahoo.com/signingbonus

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to