Hey, I'm sorry but I just see someone asked the same question and some one answered,I am just copied the answer .I haven't tried index the local files. :)
在 05-9-2,Rajendra Patil<[EMAIL PROTECTED]> 写道: > Hi, > Thanx for the poniter. > > If I index it with "local files", I'll get the search result relative to > local m/c. > But my requirement is something like this. > 1) I've a html file ex: xyz.com homepage... > 2) so while creating segments & fetching data, I want to fetch it from the > saved html file instead of connecting to net & extract the data. > 3) after indexing, search result should point to xyz.com instead of local > html file. > > So is there any direct approach like config file modification available or I > need to modify the code itself. > > Thanx & Regards, > Rajendra > > -----Original Message----- > From: 牟晓峰 [mailto:[EMAIL PROTECTED] > Sent: Friday, September 02, 2005 6:52 AM > To: [email protected] > Subject: Re: how to generate segments with html pages as input > > take a look at the Nutch FAQ ( > http://wiki.apache.org/nutch/FAQ), in section Indexing, "How do I index my > local file system?" > > > 2005/9/1, Rajendra Patil <[EMAIL PROTECTED]>: > > HI, > > Any idea how to generate segments with some html pages instead of > > fetching/crawling from urllist or dmoz . I have bunch of html pages & is > > it possible to create segments with these html pages as input??? > > > > Thanx & Regards, > > Rajendra > > > > > > >
