Hi,

On 7/24/07, Ishai Borovoy <[EMAIL PROTECTED]> wrote:
Is it possible to perform full text search on unstructured node type that
contains binary file/s (e.g.: word,pdf,excel)?

See https://issues.apache.org/jira/browse/JCR-729 for a related feature request.

Currently Jackrabbit only indexes binary "jcr:data" properties that
have a sibling "jcr:mimeType" property that indicates the relevant
mime type.

There is currently no active effort to implement JCR-729, I guess we
will do that once the incubating Tika project
(http://incubator.apache.org/tika/) or some other project comes up
with a generic library that allows us to avoid having to deal with all
the complexities of automatic mime type detection and various
different parser libraries.

BR,

Jukka Zitting

Reply via email to