[
https://issues.apache.org/jira/browse/NUTCH-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13021759#comment-13021759
]
Dietrich Schmidt commented on NUTCH-985:
----------------------------------------
Markus,
the source code is the JAR. It's a custom plugin (not a hacked
MoreIndexingFilter) that I use as a workaround. The code also might be useful
to demonstrate how to properly format the date for Solr. It has been tested
with hundreds of thousands of web pages.
> Problems indexing lastModifiedDate in Solr
> ------------------------------------------
>
> Key: NUTCH-985
> URL: https://issues.apache.org/jira/browse/NUTCH-985
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Reporter: Dietrich Schmidt
> Attachments: indexlastmodifieddate.jar
>
>
> I am using the index-more plugin to parse the lastModified data in web
> pages in order to store it in a Solr data field.
> In solrindex-mapping.xml I am mapping lastModified to a field "changed" in
> Solr:
> <field dest="changed" source="lastModified"/>
> However, when posting data to Solr the SolrIndexer posts it as a long,
> not as a date:
> <add><doc boost="1.0"><field
> name="changed">1079326800000</field><field
> name="tstamp">20110414144140188</field><field
> name="date">20040315</field>
> Solr rejects the data because of the improper data type.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira