Hi All,

I need to add dmoz meta-data to my index. I see some people have commented
about it but I didn't find a solution. Can someone read the steps below and
give me some hints or pointers? This is the code that I added:

1) injector.java: datum.setCategory("dmoz-cat"); 

2) crawldatum.java: add a new private data 'category' along with set and get
methods for it. 

3) BasicIndexingFilter.java: doc.add(new Field("category",
datum.getCategory(),Field.Store.YES, Field.Index.UN_TOKENIZED));

However, the code breaks at the third step ( when I run index ) saying that
category is null. 

Another way I was thinking about is whether I am supposed to add the
category to the metadata in CrawlDatum. In that case do I have to modify the
readFields() method on CrawlDatum? 

Thanks in advance.



-- 
View this message in context: 
http://www.nabble.com/adding-dmoz-meta-data-to-index.-tf4760430.html#a13614050
Sent from the Nutch - Dev mailing list archive at Nabble.com.

Reply via email to