Re: document support for file system crawling

2007-01-15 Thread Chris Hostetter
: In that respect I agree with the original posting that Solr lacks : functionality with respect to desired functionality. One can argue that : more or less random data should be structured by the user writing a : decent application. However a more easy to use and configurable plugin : architectur

Re: document support for file system crawling

2006-08-30 Thread Chris Hostetter
: the text out of these types of documents. You could borrow the : document parsing pieces from Lucene's contrib and Nutch and glue them : together into your client that speaks to Solr, or perhaps Solr isn't : the right approach for your needs? It certainly is possible to add : these capabiliti

Re: document support for file system crawling

2006-08-30 Thread Erik Hatcher
On Aug 30, 2006, at 2:42 AM, Bruno wrote: browsing through the message thread I tried to find a trail addressing file system crawls. I want to implement an enterprise search over a networked filesystem, crawling all sorts of documents, such as html, doc, ppt and pdf. Nutch provides plugins