[ https://issues.apache.org/jira/browse/JCR-728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12469982 ]
Jukka Zitting commented on JCR-728: ----------------------------------- Thanks for the POI update! A commons project for mime type detection seems a nice prospect. I'll try come up with something along these lines in near future. > Automatic MIME type detection > ----------------------------- > > Key: JCR-728 > URL: https://issues.apache.org/jira/browse/JCR-728 > Project: Jackrabbit > Issue Type: Improvement > Components: indexing > Reporter: Jukka Zitting > Priority: Minor > > Currently only the jcr:mimeType property is used to determine the MIME type > and thus the applicable text extractor to use for indexing a document. If the > jcr:mimeType property is not available or is set to a generic value like > "application/octet-stream", then the indexer could also use some heuristics > based on the node name or magic numbers within the binary stream to determine > the type of the document. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.