[ https://issues.apache.org/jira/browse/SOLR-3707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13428489#comment-13428489 ]
Jan Høydahl edited comment on SOLR-3707 at 8/4/12 12:09 AM: ------------------------------------------------------------ Patch for trunk upgrading to tika1.2. There are two new JARs included: * xc-1.0.jar for more compress formats * juniversalchardet-1.0.3.jar for new charset detection We have also removed two unused Jars: * scannotation-1.0.2.jar * javassist-3.6.0.GA.jar Tests pass, after updating some tests to ignore the extra metadata fields being parsed out by the enhanced metadata parser in Tika1.2 was (Author: janhoy): Patch for trunk upgrading to tika1.2. There are two new JARs included: * xc-1.0.jar for more compress formats * juniversalchardet-1.0.3.jar for new charset detection We have also removed two unused Jars: * scannotation-1.0.2.jar * javassist-3.6.0.GA.jar Tests pass, after updating some tests to ignore the extra metadata fields being parsed out by the enhanced metadata parser in Tika1.2 > Upgrade Solr to Tika 1.2 > ------------------------ > > Key: SOLR-3707 > URL: https://issues.apache.org/jira/browse/SOLR-3707 > Project: Solr > Issue Type: Improvement > Components: contrib - LangId, contrib - Solr Cell (Tika extraction) > Reporter: Jan Høydahl > Assignee: Jan Høydahl > Fix For: 4.0, 5.0 > > Attachments: SOLR-3707.patch > > > Tika 1.2 has been released with these improvements: > http://tika.apache.org/1.2/index.html -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org