Re: AW: Jackrabbit indexing in a separate thread

Anton Bachevsky Wed, 22 Feb 2012 04:56:32 -0800

Hi,

We have solved our problem in another way.

There is a file \org\apache\jackrabbit\core\query\lucene\tika-config.xmlthat is located in jackrabbit-core.jar


We added section:
<parser class="org.apache.tika.parser.EmptyParser">
<mime>application/vnd.openxmlformats-officedocument.spreadsheetml.sheet</mime>
</parser>

And commented this section:
<parser name="parse-pdf" class="org.apache.tika.parser.pdf.PDFParser">
<mime>application/pdf</mime>
</parser>

Is there a chance to configure it in another way? Otherwise we will haveto change tika-config.xml manually each time we make a build.. Maybeyour solution about parameters in workspace.xml will solve the problem?


Regards,
Anton

One thing more ...

If you have problems to start jackrabbit you could add following in the 
workspace.xml
in the failing workspace.

<SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
...
<param name="forceConsistencyCheck" value="true"/>
<param name="autoRepair" value="true"/>
<param name="onWorkspaceInconsistency" value="log"/>
...


see also
https://issues.apache.org/jira/browse/JCR-2651

greets
claus

Re: AW: Jackrabbit indexing in a separate thread

Reply via email to