Hi Alex,
On Tue, Jun 17, 2014 at 2:06 PM, <[email protected]> wrote:
>
> I am using nutch-2.x with GORA_97.
You mean GORA-94, the Avro upgrade?
With which gora- backend please?
> Further investigation shows that DbUpdateReducer
> calls
> inlinkedScoreData.clear();
>
I see this on line ~72 of DbUpdateReducer
>
> and it calls this function
>
> public void readFields(DataInput in) throws IOException {
>
Can you please point me to where ScoreDatum#readFields is called?
>
> And metaData.clear(); line clears all metadata.
>
Yes this should result in an empty HashMap data structure.
>
> Why metaData.clear(); line is needed in this function?
>
>
It is poorly documented and this Class has not be altered for some time so
off the top of my head I need to say that I do not know why. Based on the
Javadoc for Writable, @Override readFields should "...should attempt to
re-use storage in the existing object where possible." so I am not sure why
we clear the metadata from the HashMap structure. I would need to debug
this to understand.
If you can provide more context on where ScoreDatum#readFields is called
then I can set break point up until then.
Thanks Alex
Lewis