Hi,
We have solved our problem in another way.
There is a file \org\apache\jackrabbit\core\query\lucene\tika-config.xml
that is located in jackrabbit-core.jar
We added section:
<parser class="org.apache.tika.parser.EmptyParser">
<mime>application/vnd.openxmlformats-officedocument.spreadsheetml.sheet</mime>
</parser>
And commented this section:
<parser name="parse-pdf" class="org.apache.tika.parser.pdf.PDFParser">
<mime>application/pdf</mime>
</parser>
Is there a chance to configure it in another way? Otherwise we will have
to change tika-config.xml manually each time we make a build.. Maybe
your solution about parameters in workspace.xml will solve the problem?
Regards,
Anton
One thing more ...
If you have problems to start jackrabbit you could add following in the
workspace.xml
in the failing workspace.
<SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
...
<param name="forceConsistencyCheck" value="true"/>
<param name="autoRepair" value="true"/>
<param name="onWorkspaceInconsistency" value="log"/>
...
see also
https://issues.apache.org/jira/browse/JCR-2651
greets
claus