[ https://issues.apache.org/jira/browse/JCR-728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12469564 ]
Paco Avila commented on JCR-728: -------------------------------- Why LGPL is troublesome? Source code using a LGPL library does not have to be LGPL or GPL. A port of libmagic to Java should be nice because there is lots of MIME definitions in its format. And yes, I think that is more useful to add more functionality to jackrabbit-index-filters. By the way some MS Office files thows errors when they are indexed. I know this is a POI issue, but is this project abandoned? There is no updates since 04-08-2004 :( > Automatic MIME type detection > ----------------------------- > > Key: JCR-728 > URL: https://issues.apache.org/jira/browse/JCR-728 > Project: Jackrabbit > Issue Type: Improvement > Components: indexing > Reporter: Jukka Zitting > Priority: Minor > > Currently only the jcr:mimeType property is used to determine the MIME type > and thus the applicable text extractor to use for indexing a document. If the > jcr:mimeType property is not available or is set to a generic value like > "application/octet-stream", then the indexer could also use some heuristics > based on the node name or magic numbers within the binary stream to determine > the type of the document. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.