Microsoft Project (MPP) basic support
-------------------------------------
Key: TIKA-789
URL: https://issues.apache.org/jira/browse/TIKA-789
Project: Tika
Issue Type: New Feature
Components: parser
Affects Versions: 1.0
Reporter: Nick Burch
Assignee: Nick Burch
The Microsoft Project file format (MPP) could fairly easily be better supported
by Tika. Gaps to fill are:
* Correct mimetype definition (it's OLE2 based)
* OLE2 detection for MPP
* Common OLE2 metadata extraction
For fuller support (such as text contents), we'd probably want a parser which
used MPXJ. However, as MPXJ is LGPL, it'd need to be an external 3rd party
parser. (MPXJ is based on top of POI, but it's under a more copyleft license.
POI itself doesn't have MPP support)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira