Yes you can do it using the readdb command line option of nutch, as shown below:
nutch readdb <index db> -pageurl <pageurl> where indexdb is the db folder in your index, and pageurl is the url you want to know details for. This will print out the next fetch date and other details too. See http://wiki.apache.org/nutch/CommandLineOptions for more command line options for Nutch. - Ravi On 2/22/06, Ilya Kasnacheev <[EMAIL PROTECTED]> wrote: > I need to refetch specific pages after they've changed. To know that, > I have to find out date when page in WebDB was fetched. How do I do > that? Is it possible at all? > > Page objects do not have such attribute... > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
