Re: textFilterClasses deprecated. How to specify extractors?

Mark Herman Thu, 05 Apr 2012 17:53:20 -0700

Robert Siska wrote
> 
> How does it know, what binary files it should index, when I'm not
> specifying no extractors? How can I disable/enable them?
>


I'm not an expert but what I do know that JR uses Tika to extract text, and
it determines how based on the jcr:mimeType property. If you don't supply
mimetype, then it won't know how to extract it (although I wouldn't
recommend that as a practice). I believe there is a way to supply  JR with a
Tika config that might give you what you want.

Additionally you can specify a indexing config in the repository/workspace
xml files that you can set some rules on what gets indexed and how by
lucene.


--
View this message in context: 
http://jackrabbit.510166.n4.nabble.com/textFilterClasses-deprecated-How-to-specify-extractors-tp4534050p4536443.html
Sent from the Jackrabbit - Users mailing list archive at Nabble.com.

Re: textFilterClasses deprecated. How to specify extractors?

Reply via email to