[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15511744#comment-15511744 ]
Hudson commented on TIKA-2069: ------------------------------ FAILURE: Integrated in Jenkins build tika-2.x-windows #50 (See [https://builds.apache.org/job/tika-2.x-windows/50/]) TIKA-2069 -- extract macros from MSOffice files. (tallison: rev 66f433471f59d5af931f0a49bf8bddd33a7f27a7) * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java * (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSLFPowerPointExtractorDecorator.java * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java * (add) tika-test-resources/src/test/resources/test-documents/testWORD_macros.doc * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java * (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/PowerPointParserTest.java * (add) tika-test-resources/src/test/resources/test-documents/testEXCEL_macro.xlsm * (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java * (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java * (add) tika-test-resources/src/test/resources/test-documents/testPPT_macros.pptm * (edit) CHANGES.txt * (add) tika-test-resources/src/test/resources/test-documents/testEXCEL_macro.xls * (add) tika-test-resources/src/test/resources/test-documents/testPPT_macros.ppt * (edit) tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java * (add) tika-test-resources/src/test/resources/test-documents/testWORD_macros.docm > Extract Macro text from Microsoft Office documents > -------------------------------------------------- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser > Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat > Reporter: Jeff Swindle > Labels: features > Fix For: 2.0, 1.14 > > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)