Renaud Richardet wrote:
> I see. I was thinking that I could index the feed items without having 
> to fetch them individually.

Okay, so if Parser#parse returned a Map<String,Parse>, then the URL for 
each parse should be that of its link, since you don't want to fetch 
that separately.  Right?

So now the question is, how much impact would this change to the Parser 
API have on the rest of Nutch?  It would require changes to all Parser 
implementations, to ParseSegement, to ParseUtil, and to Fetcher.  But, 
as far as I can tell, most of these changes look straightforward.

Doug

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to