Raimund Merkert created TIKA-1047:
-------------------------------------
Summary: Provide a JAX-RS to detect only the mediatype
Key: TIKA-1047
URL: https://issues.apache.org/jira/browse/TIKA-1047
Project: Tika
Issue Type: Wish
Components: server
Reporter: Raimund Merkert
Priority: Critical
Currently I can use the JAX-RS server to detect the mediatype using the meta
endpoint. The problem I have with this is that I need to send the entire
document to get all metadata.
To detect the mediatype, only a few bytes are often necessary and so I'd like
to only send, say 8K or so, to the server and let it tell me the mediatype.
In order to accomplish this, it would be good to modify the /meta endpoint to
address the individual fields that might be returned:
/meta/mediatype
/meta/author
/meta/lastModified
The parts currently following the path could be turned into a query parameter,
which I think is more appropriate anyways (also easier to manipulate with tools
like jquery).
If sufficient data is not available, I'd just return with a BAD_REQUEST.
If this would be of interest to TIKA, I think I could possibly implement this.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira