[
https://issues.apache.org/jira/browse/NUTCH-3122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18015449#comment-18015449
]
ASF GitHub Bot commented on NUTCH-3122:
---------------------------------------
sebastian-nagel commented on PR #859:
URL: https://github.com/apache/nutch/pull/859#issuecomment-3210968382
Thanks, @TamimEhsan!
To add more details to @lewismc's comment: the "Metadata" class implements
"Writable" because it needs to be serialized when data is stored in the CrawlDb
and in segments, or when Nutch is run on a Hadoop cluster and data is exchanged
between distributed tasks. Ideally, serialization is backward-compatible, that
is Metadata written before this change is readable afterwards.
> make any comments on the Jira ticket
If you want, you can request a Apache Jira account
[here](https://selfserve.apache.org/jira-account.html).
> Make SpellCheckedMetadata case-insensitive for all Metadata names
> -----------------------------------------------------------------
>
> Key: NUTCH-3122
> URL: https://issues.apache.org/jira/browse/NUTCH-3122
> Project: Nutch
> Issue Type: Improvement
> Components: metadata
> Affects Versions: 1.21
> Reporter: Sebastian Nagel
> Priority: Major
> Fix For: 1.22
>
>
> See NUTCH-3002 and the discussion in
> [stormcrawler#1588|https://github.com/apache/stormcrawler/issues/1588]:
> - we have CaseInsensitiveMetadata (used by protocol-okhttp) and
> - SpellCheckedMetadata (used elsewhere) which does case-insensitive look-ups
> only for header listed in
> [HttpHeaders|https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/metadata/HttpHeaders.java]
> We might consider to make SpellCheckedMetadata case-insensitive in general.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)