Hi, On 7/24/07, Ishai Borovoy <[EMAIL PROTECTED]> wrote:
Is it possible to perform full text search on unstructured node type that contains binary file/s (e.g.: word,pdf,excel)?
See https://issues.apache.org/jira/browse/JCR-729 for a related feature request. Currently Jackrabbit only indexes binary "jcr:data" properties that have a sibling "jcr:mimeType" property that indicates the relevant mime type. There is currently no active effort to implement JCR-729, I guess we will do that once the incubating Tika project (http://incubator.apache.org/tika/) or some other project comes up with a generic library that allows us to avoid having to deal with all the complexities of automatic mime type detection and various different parser libraries. BR, Jukka Zitting
