Hey,
The lucene document id , an integer, may not be same for 2 different
crawls.
I am not sure if this is wht u r looking for but U can store a hash
value of the url crawled ;)
- Sagar
Sagar Vibhute wrote:
Hello,
Does nutch/lucene provide for a unique ID for every item that it has
crawled?
I checked the Lucene docid but from what I understood, the lucene docid is
not unique for every item crawled. Is that so?
How can I get this unique ID, if it is available?
Thanks.
- Sagar
--
This message has been scanned for viruses and
dangerous content and is believed to be clean.