Hi All,
I need to add dmoz meta-data to my index. I see some people have commented
about it but I didn't find a solution. Can someone read the steps below and
give me some hints or pointers? This is the code that I added:
1) injector.java: datum.setCategory("dmoz-cat");
2) crawldatum.java: add a new private data 'category' along with set and get
methods for it.
3) BasicIndexingFilter.java: doc.add(new Field("category",
datum.getCategory(),Field.Store.YES, Field.Index.UN_TOKENIZED));
However, the code breaks at the third step ( when I run index ) saying that
category is null.
Another way I was thinking about is whether I am supposed to add the
category to the metadata in CrawlDatum. In that case do I have to modify the
readFields() method on CrawlDatum?
Thanks in advance.
--
View this message in context:
http://www.nabble.com/adding-dmoz-meta-data-to-index.-tf4760430.html#a13614050
Sent from the Nutch - Dev mailing list archive at Nabble.com.