[
https://issues.apache.org/jira/browse/NUTCH-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13021712#comment-13021712
]
Markus Jelsma commented on NUTCH-985:
-------------------------------------
This is similar to another issue described today about the failing dedup.
Although i believe it would be a good idea to port longs to properly formatted
dates for 1.3 i do think it'll be quite a task since it's not only reformatting
before sending it over. Dedup for example relies on dates as long stored in
Solr for it to work. I'm also unsure whether a simple reformat in the Solr
indexer is a better idea than changing it in the plugins themselves.
Thoughts?
> Problems indexing lastModifiedDate in Solr
> ------------------------------------------
>
> Key: NUTCH-985
> URL: https://issues.apache.org/jira/browse/NUTCH-985
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Reporter: Dietrich Schmidt
> Attachments: indexlastmodifieddate.jar
>
>
> I am using the index-more plugin to parse the lastModified data in web
> pages in order to store it in a Solr data field.
> In solrindex-mapping.xml I am mapping lastModified to a field "changed" in
> Solr:
> <field dest="changed" source="lastModified"/>
> However, when posting data to Solr the SolrIndexer posts it as a long,
> not as a date:
> <add><doc boost="1.0"><field
> name="changed">1079326800000</field><field
> name="tstamp">20110414144140188</field><field
> name="date">20040315</field>
> Solr rejects the data because of the improper data type.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira