Hello,

How could I inject metadata for urls that I provide?

In Injector.java :

/** This class takes a flat file of URLs and adds them to the of pages to be
 * crawled.  Useful for bootstrapping the system.
 * The URL files contain one URL per line, optionally followed by
custom metadata
 * separated by tabs with the metadata key separated from the
corresponding value by '='. <br>
 * Note that some metadata keys are reserved : <br>
 * - <i>nutch.score</i> : allows to set a custom score for a specific URL <br>
 * - <i>nutch.fetchInterval</i> : allows to set a custom fetch
interval for a specific URL <br>
 * e.g. http://www.nutch.org/ \t nutch.score=10 \t
nutch.fetchInterval=2592000 \t userType=open_source
 **/


could I extend this structure to store metadata about urls?

Best Regards,
-C.B.

Reply via email to