that's just the way the toString() method concatenates things, the key
values are stored correctly and this should not be a problem.
look at plugin/urlmeta for a way of propagating the features to the outlinks

On 25 July 2011 17:47, Cam Bazz <[email protected]> wrote:

> Hello,
>
> I have figured out that it can be done indeed. However when I
> inject/generate/readdb dump
>
> Score: 1.0
> Signature: null
> Metadata: status: 9catId: 1
>
> In the metadata part there is no space between 9 and catId, I wonder
> if that is a problem.
>
> Best Regards,
> C.B.
>
>
>
> On Mon, Jul 25, 2011 at 7:21 PM, Cam Bazz <[email protected]> wrote:
> > Hello,
> >
> > How could I inject metadata for urls that I provide?
> >
> > In Injector.java :
> >
> > /** This class takes a flat file of URLs and adds them to the of pages to
> be
> >  * crawled.  Useful for bootstrapping the system.
> >  * The URL files contain one URL per line, optionally followed by
> > custom metadata
> >  * separated by tabs with the metadata key separated from the
> > corresponding value by '='. <br>
> >  * Note that some metadata keys are reserved : <br>
> >  * - <i>nutch.score</i> : allows to set a custom score for a specific URL
> <br>
> >  * - <i>nutch.fetchInterval</i> : allows to set a custom fetch
> > interval for a specific URL <br>
> >  * e.g. http://www.nutch.org/ \t nutch.score=10 \t
> > nutch.fetchInterval=2592000 \t userType=open_source
> >  **/
> >
> >
> > could I extend this structure to store metadata about urls?
> >
> > Best Regards,
> > -C.B.
> >
>



-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com

Reply via email to