Hi Katia,

Katia Santos wrote:
I´m trying to search in PDF binary content, the text is being extracted, but
when I do the query, I get no results :(
Do anyone has the same problem, or anyone knows what the problem is?

my query is:

//*[jcr:contains(.,'myword')]

Did you set the testFilterClasses parameter in your workspace.xml? Please also make sure you put all depending jar files into your classpath.

Here's a list of supported classes and the corresponding mime types that are recognized:
http://jackrabbit.apache.org/jackrabbit-text-extractors.html

See also query section in the FAQ:
http://jackrabbit.apache.org/frequently-asked-questions.html

regards
 marcel

I have another problem....When the text is being extracted, in xls, odt,
odp, and ods files  works fine, but in pdf, xml, txt, rtf , doc, ppt doesnt
:(
No text is extracted in this last file types. If some one could help me wiht
that...

Thanks


Reply via email to