[ 
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kristof  updated NUTCH-1406:
----------------------------

    Description: 
This improvement to the index-metatags plugin (sometimes also refered to 
parse-metatags plugin) allows for conversion of selected fields to the Solr 
date format.

In order to convert the values of selected metatags to Solr date format, you 
must specify in nutch-site.xml. This can be for example used with Dublin Core 
elements.  dcterms.modified with the seed url http://www.cic.gc.ca 
dcterms.modified must also be defined in the metatags.names and index.parse.md 
propertie. 
{code}
<property>
        <name>index.dateconvert.md</name>
        <value>metatag.dcterms.modified</value>
        <description>For plugin index-metadata: Indicate here the name of the 
html meta tag that should be converted to Solr date format.
        </description>
</property>
{code}


  was:
This improvement to the index-metatags plugin (sometimes also refered to 
parse-metatags plugin) allows for conversion of selected fields to the Solr 
date format.

In order to convert the values of selected metatags to Solr date format, you 
must specify in nutch-site.xml. The example used is an extended Dublin Core 
element dcterms.modified with the seed url http://www.cic.gc.ca/. 
dcterms.modified must also be defined in the metatags.names property.
{code}
<property>
        <name>index.dateconvert.md</name>
        <value>dcterms.modified</value>
        <description>For plugin index-metadata: Indicate here the name of the 
html meta tag that should be converted to Solr date format.
        </description>
</property>
{code}


    
> metadata-index plugin: conversion to Solr date format
> -----------------------------------------------------
>
>                 Key: NUTCH-1406
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1406
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer, parser
>            Reporter: Kristof 
>            Priority: Minor
>              Labels: conversion, date
>         Attachments: index-metadata.patch
>
>
> This improvement to the index-metatags plugin (sometimes also refered to 
> parse-metatags plugin) allows for conversion of selected fields to the Solr 
> date format.
> In order to convert the values of selected metatags to Solr date format, you 
> must specify in nutch-site.xml. This can be for example used with Dublin Core 
> elements.  dcterms.modified with the seed url http://www.cic.gc.ca 
> dcterms.modified must also be defined in the metatags.names and 
> index.parse.md propertie. 
> {code}
> <property>
>       <name>index.dateconvert.md</name>
>       <value>metatag.dcterms.modified</value>
>       <description>For plugin index-metadata: Indicate here the name of the 
> html meta tag that should be converted to Solr date format.
>       </description>
> </property>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to