Hi all,
I would like to add two types of metadata to the ContentData written to the
Segments.

1) seeded metadata:
example for seed row - http://www.xyz.com/\tmetaKey=metaValue

2) protocol status:
CrawlDatum metadata corresponding to "_pst_" key.

I thought this would work if I add urlmeta to plugins.include and add
"_pst_,metaKey" to urlmeta.tags in nutch-site.xml.
that didn't work...

At which phase is the urlmeta scoring filter executed ? Is the urlmeta the
way to go here ?

Thanks.

Reply via email to