Look at the IndexWriter Javadocs. One of the fields allows you to set maximum term length. This may also be a problem with the HTML parser you are using. You didn't share a lot of details, so I cannot help more.
Otis --- Syr�n_Per <[EMAIL PROTECTED]> wrote: > Hi all, > > Have a question concerning indexing of HTML files. > > One of the files I'm trying to index have a <input type="image" ...> > tag > that also contain a call to a javascript with a string argument that > is > about 1300 characters long. At this point Lucene seems to stop > indexing the > remaining part the current document, but do index the other files in > the > same directory. > > How do I workaround this? > > psy > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > __________________________________ Do you Yahoo!? Yahoo! Hotjobs: Enter the "Signing Bonus" Sweepstakes http://hotjobs.sweepstakes.yahoo.com/signingbonus --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
