On Fri, 11 Nov 2011, Swapna Vuppala wrote:
Am using Tika to index .msg files of Outlook. It has been working very
good for me but am facing problem while indexing some .msg files. The
indexing fails with the below Solr exception
SEVERE: org.apache.solr.common.SolrException: Invalid Date String:' Fri, 14 Oct
2011 12:35:51 +0100'
at org.apache.solr.schema.DateField.parseMath(DateField.java:165)
at
org.apache.solr.schema.TrieField.createField(TrieField.java:387)
at
org.apache.solr.schema.TrieDateField.createField(TrieDateField.java:120)
at
org.apache.solr.schema.SchemaField.createField(SchemaField.java:104)
at
org.apache.solr.update.DocumentBuilder.addField(DocumentBuilder.java:203)
This would look to be a SOLR issue. Tika will try to return an ISO-8601
date string where it can, but sometimes it gets a random String.
Downstream apps will need to handle this
Nick