Armel T. Nene wrote: > Andrzej, the feature that I am after can be implemented by this patch if I > just adapt it right. I am not sure of this but the patch seems a little bit > old to be implemented in the latest release of Nutch 0.8.1. >
Right, that's why I wrote it needs to be brought up-to-date with the current trunk/ . > I want to implement a feature where the fetcher will fetch files but only > add them if there have been modified after the latest fetch time. Now, I > want to implement that on a filesystem first and then update later for > network fetching. I would like to have a look at your full source code for > your patch in a zip file if possible. Once the feature implemented, I will > post it back here. I'd like to start working from your code first. You can > either make the source code available here or mail them to me at armel dot > nene @ idna-solutions dot com. > Patches attached to the JIRA issue already support this. Please bear in mind that the notion of "change" is dependent on how you compare the content of old and new pages, especially if you lack the Last-Modified header from the server. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
