[
https://issues.apache.org/jira/browse/TIKA-2302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943467#comment-15943467
]
Hudson commented on TIKA-2302:
------------------------------
FAILURE: Integrated in Jenkins build tika-2.x-windows #186 (See
[https://builds.apache.org/job/tika-2.x-windows/186/])
TIKA-2302 -- make macro extraction configurable and set default to false
(tallison: rev 1826112e6c3bfd4001cef896279263ccbe0a1923)
* (edit)
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/OOXMLExtractorFactory.java
* (add)
tika-test-resources/src/test/resources/org/apache/tika/parser/microsoft/tika-config-sax-docx.xml
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (edit)
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/AbstractOfficeParser.java
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
* (add)
tika-test-resources/src/test/resources/org/apache/tika/parser/microsoft/ooxml/tika-config-sax-macros.xml
* (edit)
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OfficeParserConfig.java
* (add)
tika-test-resources/src/test/resources/org/apache/tika/parser/microsoft/tika-config-macros.xml
* (edit)
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/PowerPointParserTest.java
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
* (add)
tika-test-resources/src/test/resources/org/apache/tika/parser/microsoft/ooxml/tika-config-dom-macros.xml
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXWPFExtractorTest.java
* (edit)
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXSLFExtractorTest.java
* (edit) CHANGES.txt
* (edit)
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java
> Make handling of macros equivalent btwn VBA in MSOffice and JS in PDFs
> ----------------------------------------------------------------------
>
> Key: TIKA-2302
> URL: https://issues.apache.org/jira/browse/TIKA-2302
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
>
> The current default behavior is to extract VBA macros from MSOffice files but
> not to extract JS from PDFs. Now that we have a config for MSOffice files, I
> propose changing the default behavior to NOT extract VBA macros from MSOffice
> files. Users can opt in to extraction of macros via configuration.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)