[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15511744#comment-15511744
 ] 

Hudson commented on TIKA-2069:
------------------------------

FAILURE: Integrated in Jenkins build tika-2.x-windows #50 (See 
[https://builds.apache.org/job/tika-2.x-windows/50/])
TIKA-2069 -- extract macros from MSOffice files. (tallison: rev 
66f433471f59d5af931f0a49bf8bddd33a7f27a7)
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSLFPowerPointExtractorDecorator.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java
* (add) 
tika-test-resources/src/test/resources/test-documents/testWORD_macros.doc
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/PowerPointParserTest.java
* (add) 
tika-test-resources/src/test/resources/test-documents/testEXCEL_macro.xlsm
* (edit) 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (add) 
tika-test-resources/src/test/resources/test-documents/testPPT_macros.pptm
* (edit) CHANGES.txt
* (add) 
tika-test-resources/src/test/resources/test-documents/testEXCEL_macro.xls
* (add) tika-test-resources/src/test/resources/test-documents/testPPT_macros.ppt
* (edit) 
tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
* (add) 
tika-test-resources/src/test/resources/test-documents/testWORD_macros.docm


> Extract Macro text from Microsoft Office documents
> --------------------------------------------------
>
>                 Key: TIKA-2069
>                 URL: https://issues.apache.org/jira/browse/TIKA-2069
>             Project: Tika
>          Issue Type: Improvement
>          Components: detector, parser
>    Affects Versions: 1.13
>         Environment: RHEL 5.x, Apache Tomcat
>            Reporter: Jeff Swindle
>              Labels: features
>             Fix For: 2.0, 1.14
>
>         Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to