[
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13399228#comment-13399228
]
Markus Jelsma commented on NUTCH-1406:
--------------------------------------
Hello, a few notes on your patch:
* Nutch uses double space for a single indentation, not tabs;
* convertIndicatior seems to be misspelled;
* yyyy-MM-dd doesn't look like Solr's supported DateField as it's missing time
and timezone Z.
> metadata-index plugin: conversion to Solr date format
> -----------------------------------------------------
>
> Key: NUTCH-1406
> URL: https://issues.apache.org/jira/browse/NUTCH-1406
> Project: Nutch
> Issue Type: Improvement
> Components: indexer, parser
> Reporter: Kristof
> Priority: Minor
> Labels: conversion, date
> Attachments: index-metadata.patch
>
>
> This improvement to the index-metatags plugin (sometimes also refered to
> parse-metatags plugin) allows for conversion of selected fields to the Solr
> date format. The main benefit of this conversion is the possibility to create
> range facets.
> In order to convert the values of selected metatags to Solr date format, you
> must specify in nutch-site.xml. This can be for example used with Dublin Core
> elements. A subdomain which would have pages with the meta tag
> dcterms.modified would be cic.gc.ca. dcterms.modified must also be defined in
> the metatags.names and index.parse.md properties.
>
> {code}
> <property>
> <name>index.dateconvert.md</name>
> <value>metatag.dcterms.modified</value>
> <description>For plugin index-metadata: Indicate here the name of the
> html meta tag that should be converted to Solr date format.
> </description>
> </property>
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira