[ 
https://issues.apache.org/jira/browse/NUTCH-3122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18015449#comment-18015449
 ] 

ASF GitHub Bot commented on NUTCH-3122:
---------------------------------------

sebastian-nagel commented on PR #859:
URL: https://github.com/apache/nutch/pull/859#issuecomment-3210968382

   Thanks, @TamimEhsan! 
   
   To add more details to @lewismc's comment: the "Metadata" class implements 
"Writable" because it needs to be serialized when data is stored in the CrawlDb 
and in segments, or when Nutch is run on a Hadoop cluster and data is exchanged 
between distributed tasks. Ideally, serialization is backward-compatible, that 
is Metadata written before this change is readable afterwards.
   
   > make any comments on the Jira ticket
   
   If you want, you can request a Apache Jira account 
[here](https://selfserve.apache.org/jira-account.html).




> Make SpellCheckedMetadata case-insensitive for all Metadata names
> -----------------------------------------------------------------
>
>                 Key: NUTCH-3122
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3122
>             Project: Nutch
>          Issue Type: Improvement
>          Components: metadata
>    Affects Versions: 1.21
>            Reporter: Sebastian Nagel
>            Priority: Major
>             Fix For: 1.22
>
>
> See NUTCH-3002 and the discussion in 
> [stormcrawler#1588|https://github.com/apache/stormcrawler/issues/1588]:
> - we have CaseInsensitiveMetadata (used by protocol-okhttp) and
> - SpellCheckedMetadata (used elsewhere) which does case-insensitive look-ups 
> only for header listed in 
> [HttpHeaders|https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/metadata/HttpHeaders.java]
> We might consider to make SpellCheckedMetadata case-insensitive in general.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to