Hi, Thanx for the poniter. If I index it with "local files", I'll get the search result relative to local m/c. But my requirement is something like this. 1) I've a html file ex: xyz.com homepage... 2) so while creating segments & fetching data, I want to fetch it from the saved html file instead of connecting to net & extract the data. 3) after indexing, search result should point to xyz.com instead of local html file.
So is there any direct approach like config file modification available or I need to modify the code itself. Thanx & Regards, Rajendra -----Original Message----- From: 牟晓峰 [mailto:[EMAIL PROTECTED] Sent: Friday, September 02, 2005 6:52 AM To: [email protected] Subject: Re: how to generate segments with html pages as input take a look at the Nutch FAQ ( http://wiki.apache.org/nutch/FAQ), in section Indexing, "How do I index my local file system?" 2005/9/1, Rajendra Patil <[EMAIL PROTECTED]>: > HI, > Any idea how to generate segments with some html pages instead of > fetching/crawling from urllist or dmoz . I have bunch of html pages & is > it possible to create segments with these html pages as input??? > > Thanx & Regards, > Rajendra > > >
