120k documents?! You're asking a lot of the *demo*. It was not ever meant to be a production-quality HTML or text file processor. My suggestion is that you create your own custom indexer code using something more production quality for HTML parsing like Neko HTML or JTidy.

Erik


On Tuesday, November 4, 2003, at 04:04 PM, Chong, Herb wrote:
no change in 1.3RC2. the crash is at the same statement, although the absolute line number has moved.

this is the demo IndexFiles application with a change in the main() method to allow specifying more than one file/directory in the parameter list. i also modified the FileWriter class to save the first 300 bytes of each document in an Unindexed field. i used the 1.2 version of the demo application to as my base for these changes. the application is being run with a 1000M JVM size. there are about 120,000 documents averaging about 600 bytes in size.

Herb....
-----Original Message-----
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: Tuesday, November 04, 2003 1:42 PM
To: Lucene Users List
Subject: Re: crash in Lucene


Could you try the latest CVS version or 1.3 RC build and see if the problem has been resolved?

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to