[ https://issues.apache.org/jira/browse/NUTCH-3122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18015449#comment-18015449 ]
ASF GitHub Bot commented on NUTCH-3122: --------------------------------------- sebastian-nagel commented on PR #859: URL: https://github.com/apache/nutch/pull/859#issuecomment-3210968382 Thanks, @TamimEhsan! To add more details to @lewismc's comment: the "Metadata" class implements "Writable" because it needs to be serialized when data is stored in the CrawlDb and in segments, or when Nutch is run on a Hadoop cluster and data is exchanged between distributed tasks. Ideally, serialization is backward-compatible, that is Metadata written before this change is readable afterwards. > make any comments on the Jira ticket If you want, you can request a Apache Jira account [here](https://selfserve.apache.org/jira-account.html). > Make SpellCheckedMetadata case-insensitive for all Metadata names > ----------------------------------------------------------------- > > Key: NUTCH-3122 > URL: https://issues.apache.org/jira/browse/NUTCH-3122 > Project: Nutch > Issue Type: Improvement > Components: metadata > Affects Versions: 1.21 > Reporter: Sebastian Nagel > Priority: Major > Fix For: 1.22 > > > See NUTCH-3002 and the discussion in > [stormcrawler#1588|https://github.com/apache/stormcrawler/issues/1588]: > - we have CaseInsensitiveMetadata (used by protocol-okhttp) and > - SpellCheckedMetadata (used elsewhere) which does case-insensitive look-ups > only for header listed in > [HttpHeaders|https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/metadata/HttpHeaders.java] > We might consider to make SpellCheckedMetadata case-insensitive in general. -- This message was sent by Atlassian Jira (v8.20.10#820010)