To be accurate, the metaKey seeded does appear in the fetched Content
Metadata but the _pst_ does not.

Another question is: Will the seeded metadata pass to deeper links ?

Thanks.


On Wed, Dec 11, 2013 at 10:09 PM, Amit Sela <[email protected]> wrote:

> Hi all,
> I would like to add two types of metadata to the ContentData written to
> the Segments.
>
> 1) seeded metadata:
> example for seed row - http://www.xyz.com/\tmetaKey=metaValue
>
> 2) protocol status:
> CrawlDatum metadata corresponding to "_pst_" key.
>
> I thought this would work if I add urlmeta to plugins.include and add
> "_pst_,metaKey" to urlmeta.tags in nutch-site.xml.
> that didn't work...
>
> At which phase is the urlmeta scoring filter executed ? Is the urlmeta the
> way to go here ?
>
> Thanks.
>
>

Reply via email to