Reto Bachmann-Gmür created STANBOL-762:
------------------------------------------

             Summary: XMP Extractor Engine
                 Key: STANBOL-762
                 URL: https://issues.apache.org/jira/browse/STANBOL-762
             Project: Stanbol
          Issue Type: Improvement
            Reporter: Reto Bachmann-Gmür
            Assignee: Reto Bachmann-Gmür


Many file formats (images, pdfs, videos) may contain XMP metadata. The XMP 
syntax is a subset of RDF/XML and can thus be parsed as RDF. While some of this 
data is aleready extracted by the tika engine the XMP block often contains more 
information. As the relevance of this information depends on usage scenarios a 
dedicated xmpextractor engine should be created so that clients can decide if 
they want that engine in the chain or not.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to