RE: OutOfMemoryException while Indexing an XML file

2003-02-14 Thread Marcel Stor
-Original Message- From: Rob Outar [mailto:[EMAIL PROTECTED]] Sent: Freitag, 14. Februar 2003 14:13 To: Lucene Users List Subject: OutOfMemoryException while Indexing an XML file Hi all, I was using the sample code provided I believe by Doug Cutting to index an XML

Index entire filesystem

2003-11-05 Thread Marcel Stor
Hi all, I'm thinkin' about writing a search tool for my filesystem. I know such things exist already but programming it myself is much more fun ;-) So, I would have Lucene crawl through my filesystem and pass each file to an appropriate indexer (PDF - PDFbox, etc.). Yes, I run a Windows system

RE: Document Clustering

2003-11-11 Thread Marcel Stor
Stefan Groschupf wrote: Hi, How is document clustering different/related to text categorization? Clustering: try to find own categories and put documents that match in it. You group all documents with minimal distance together. Would I be correct to say that you have to define a distance