[
https://issues.apache.org/jira/browse/JCR-728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12469982
]
Jukka Zitting commented on JCR-728:
-----------------------------------
Thanks for the POI update!
A commons project for mime type detection seems a nice prospect. I'll try come
up with something along these lines in near future.
> Automatic MIME type detection
> -----------------------------
>
> Key: JCR-728
> URL: https://issues.apache.org/jira/browse/JCR-728
> Project: Jackrabbit
> Issue Type: Improvement
> Components: indexing
> Reporter: Jukka Zitting
> Priority: Minor
>
> Currently only the jcr:mimeType property is used to determine the MIME type
> and thus the applicable text extractor to use for indexing a document. If the
> jcr:mimeType property is not available or is set to a generic value like
> "application/octet-stream", then the indexer could also use some heuristics
> based on the node name or magic numbers within the binary stream to determine
> the type of the document.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.