Hi Benny, Check out this mail thread
http://www.mail-archive.com/[email protected]/msg00340.html HTH, Praveen. On 8/22/05, Benny <[EMAIL PROTECTED]> wrote: > Hi, > > Can someone give me some hints how index local files? > > I have a lot of plain HTML files (more than 50K pages, the size is > around 2-3k/page). I don't prefer puting them in the web service and > using url to index them. I'd like NUTCH to index them from local HD. > Is it possible? if it is, what kind of url I need inject into db? for > example, if you use web service, we use the > > http://domain/file.html > > How about local HD file's format? I believe no more "http", what's > protocol supposed to be. These file are still in plain HTML format. > > > Benny > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nutch-general mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/nutch-general > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
