Hi Anand, Couple of things. 1. Good to hear you solved your problem, I will be working on the jira issue regardless. 2. If you are interested in contributing your code to the Nutch community it would be welcomed. You can open a jira issue and upload it, or alternatively you can open a wiki page and embed your code there. Its up to you. Thanks for the feedback anyways lewis
On Wednesday, March 13, 2013, Anand Bhagwat <[email protected]> wrote: > Hi Lewis, > I looked at the JIRA you mentioned and its little different then what I was > looking for. What I need is a way to associate seed url to all the records > which are derived from this url. So I added seedUrl and its value to > metadata column during inject phase if it is null and later on in updatedb > phase I propagated it to subsequent outlinks / new records. So now all the > records and any future child records will have the same seedurl as one of > the metadata. > > I was looking for some plugin which I could use but in this case I did not > find any suitable plugin. > > Regards, > Anand. > > On 13 March 2013 22:40, Lewis John Mcgibbney <[email protected] >wrote: > >> Hi Anand, >> The first step is to look at thew issue over on NUTCH-1533 >> If you feel like addressing anything then please do. >> This particular issue has nothing to do with Gora, or Hadoop so you will >> not need to look at any of the code there. >> I will also be working on that issue when I get some time.s >> Thanks >> Lewis >> >> On Mon, Mar 11, 2013 at 9:44 PM, Anand Bhagwat <[email protected] >> >wrote: >> >> > I would love to work on it but the thing is I am new to all the >> frameworks >> > which are being used here. I mean Apache Hadoop, Apache Gora and Nutch >> > itself. I am going though the source code of Nutch 2. But as you said >> with >> > little bit of help I think I would be able to contribute. >> > >> > -Anand. >> > >> > >> > -- *Lewis*

