According to the documentation nutch inject is supposed to allow for an entry
with custom score and fetchInterval as well as custom metadata values.

I have tried injecting a tab delimited text file with entires like:
http://www.domain-one.com/ nutch.score=10 nutch.fetchInterval=172800     
memberType=pi
http://www.domain-two.com/   nutch.score=10 nutch.fetchInterval=172800     
memberType=pi
http://www.domain-three.com/  nutch.score=10 nutch.fetchInterval=172800      
memberType=pi

yet when I look them up with:
bin/nutch readdb crawldb -url http://www.domain-one.com/ while the metadata is
included the retry interval and score are the default values.
 ie.
URL: http://www.domain-one.com/
Version: 7
Status: 1 (db_unfetched)
Fetch time: Tue Jun 12 14:04:51 EDT 2012
Modified time: Wed Dec 31 19:00:00 EST 1969
Retries since fetch: 0
Retry interval: 2592000 seconds (30 days)
Score: 1.0
Signature: null
Metadata: memberType: pi


I am running Nutch 1.4 and had made sure the urls were not already in the
crawldb before injecting.

Is this feature of inject no longer supported and/or is the documentation
incorrect?

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.

Reply via email to