[
https://issues.apache.org/jira/browse/NUTCH-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16045450#comment-16045450
]
Kaidul Islam commented on NUTCH-2393:
-------------------------------------
Hi [~wastl-nagel] Thanks for pointing out, I've changed {{buf.array().length}}
to {{buf.remaining()}} and updated the PR. Didn't know the
{{page.getContent()}} can contain more than a single page, I will check the
code flow. As you checked with single page, is any further testing needed for
scenario when {{page.getContent()}} will contain more than one page?
> 2.x patch for MD5 duplication issue addressed in NUTCH-2391
> -----------------------------------------------------------
>
> Key: NUTCH-2393
> URL: https://issues.apache.org/jira/browse/NUTCH-2393
> Project: Nutch
> Issue Type: Bug
> Components: commoncrawl
> Affects Versions: 2.3.1
> Reporter: Kaidul Islam
> Assignee: Kaidul Islam
> Priority: Minor
> Fix For: 2.4
>
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> Equivalent patch for 2.x for issue addressed in NUTCH-2391
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)