Hello, I'm using Nutch 2.1 on top of Hadoop 1.0.4, with HBase 0.90.4 as storage system. I run Nutch in distributed mode.
I need to associate an id to each url inside the seed list of nutch and to store this information in HBase. I think that I have to create a new column family in HBase and modify the gora and hbase configuration files in the nutch conf folder. However, I think I need to modify the code of Nutch, but I don't know which classes I have to modify. I googled a bit, but I didn't find any documentation; I've searched inside the code but I wasn't able to solve my problem. Can anybody help me? Thank you! -- Adriana Farina

