[
https://issues.apache.org/jira/browse/NUTCH-3122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18015302#comment-18015302
]
ASF GitHub Bot commented on NUTCH-3122:
---------------------------------------
TamimEhsan commented on PR #859:
URL: https://github.com/apache/nutch/pull/859#issuecomment-3208872693
@lewismc It was from this discussion in
https://github.com/apache/stormcrawler/issues/1588 by Sebastian Nagel and the
issue created in Jira
[NUTCH-3122](https://issues.apache.org/jira/browse/NUTCH-3122).
> According to https://www.ietf.org/rfc/rfc2616.txt (4.2 Message Headers),
HTTP headers are case insensitive.
I am actually new in NUTCH. Sorry, if I missed some step for contribution. I
could not make any comments on the Jira ticket or ask for any clarification.
> Make SpellCheckedMetadata case-insensitive for all Metadata names
> -----------------------------------------------------------------
>
> Key: NUTCH-3122
> URL: https://issues.apache.org/jira/browse/NUTCH-3122
> Project: Nutch
> Issue Type: Improvement
> Components: metadata
> Affects Versions: 1.21
> Reporter: Sebastian Nagel
> Priority: Major
> Fix For: 1.22
>
>
> See NUTCH-3002 and the discussion in
> [stormcrawler#1588|https://github.com/apache/stormcrawler/issues/1588]:
> - we have CaseInsensitiveMetadata (used by protocol-okhttp) and
> - SpellCheckedMetadata (used elsewhere) which does case-insensitive look-ups
> only for header listed in
> [HttpHeaders|https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/metadata/HttpHeaders.java]
> We might consider to make SpellCheckedMetadata case-insensitive in general.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)