Reto Bachmann-Gmür created STANBOL-762:
------------------------------------------
Summary: XMP Extractor Engine
Key: STANBOL-762
URL: https://issues.apache.org/jira/browse/STANBOL-762
Project: Stanbol
Issue Type: Improvement
Reporter: Reto Bachmann-Gmür
Assignee: Reto Bachmann-Gmür
Many file formats (images, pdfs, videos) may contain XMP metadata. The XMP
syntax is a subset of RDF/XML and can thus be parsed as RDF. While some of this
data is aleready extracted by the tika engine the XMP block often contains more
information. As the relevance of this information depends on usage scenarios a
dedicated xmpextractor engine should be created so that clients can decide if
they want that engine in the chain or not.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira