[ 
https://issues.apache.org/jira/browse/NUTCH-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13021732#comment-13021732
 ] 

Dietrich Schmidt commented on NUTCH-985:
----------------------------------------

Ideally org.apache.nutch.indexer.more.MoreIndexingFilter should store the 
lastModifiedDate in date format. Having limited knowledge about the Nutch 
source, I am not sure whether  dependencies exist that would break things by 
doing that, but at this point I can't see what that would be.  

> Problems indexing lastModifiedDate in Solr
> ------------------------------------------
>
>                 Key: NUTCH-985
>                 URL: https://issues.apache.org/jira/browse/NUTCH-985
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>            Reporter: Dietrich Schmidt
>         Attachments: indexlastmodifieddate.jar
>
>
> I am using the index-more plugin to parse the lastModified data in web
> pages in order to store it in a Solr data field.
> In solrindex-mapping.xml I am mapping lastModified to a field "changed" in 
> Solr:
>                 <field dest="changed" source="lastModified"/>
> However, when posting data to Solr the SolrIndexer posts it as a long,
> not as a date:
> <add><doc boost="1.0"><field
> name="changed">1079326800000</field><field
> name="tstamp">20110414144140188</field><field
> name="date">20040315</field>
> Solr rejects the data because of the improper data type.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to