Re: Search in binary Content

Marcel Reutegger Wed, 05 Mar 2008 06:56:19 -0800

Hi Katia,

Katia Santos wrote:

I´m trying to search in PDF binary content, the text is being extracted, but
when I do the query, I get no results :(
Do anyone has the same problem, or anyone knows what the problem is?


my query is:

//*[jcr:contains(.,'myword')]

Did you set the testFilterClasses parameter in your workspace.xml? Please alsomake sure you put all depending jar files into your classpath.

Here's a list of supported classes and the corresponding mime types that arerecognized:

http://jackrabbit.apache.org/jackrabbit-text-extractors.html

See also query section in the FAQ:
http://jackrabbit.apache.org/frequently-asked-questions.html

regards
 marcel

I have another problem....When the text is being extracted, in xls, odt,
odp, and ods files  works fine, but in pdf, xml, txt, rtf , doc, ppt doesnt
:(
No text is extracted in this last file types. If some one could help me wiht
that...

Thanks

Re: Search in binary Content

Reply via email to