Tim,
I opened issue TIKA-2069 https://issues.apache.org/jira/browse/TIKA-2069
Regarding support for Microsoft Office macro-enabled files, where is it
stated that extraction is only metadata and content? If it isn't stated,
it would be a nice addition to the files supported web section.
Cheers,
Jeff
On 9/7/2016 8:42 PM, Allison, Timothy B. wrote:
I don't think we're extracting macros at this point. Any chance you could open
a ticket with example files/unit tests/patch :)
Cheers,
Tim
-----Original Message-----
From: Jeff Swindle [mailto:[email protected]]
Sent: Wednesday, September 7, 2016 5:31 PM
To: [email protected]
Subject: Extract macro content from Microsoft Office macro enabled files
I did a quick search of the tika user and dev mailing lists back 2 years and a
Google search and didn't come up with an answer.
Is there a configuration setting that allows the macros to be extracted from
Microsoft Office macro enabled files?
I used tika-app-1.13.jar to extract from a Word.docm and Excel.xlsm files. I
get the metadata and content but neither contain the macro embedded in the
files.
Cheers,
Jeff