[ 
https://issues.apache.org/jira/browse/TIKA-2024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15354182#comment-15354182
 ] 

Hudson commented on TIKA-2024:
------------------------------

SUCCESS: Integrated in tika-2.x #116 (See 
[https://builds.apache.org/job/tika-2.x/116/])
TIKA-2024 extract original file name/path where possible, take 1 (tallison: rev 
e62f2305783763aad0a2c587f96b162ae4be1c36)
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/apple/AppleSingleFileParser.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OfficeParser.java
* 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java
* 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
* 
tika-test-resources/src/test/resources/test-documents/testExcel_embeddedPDF.xlsx
* tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/WordExtractor.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/rtf/RTFObjDataParser.java
* tika-test-resources/src/test/resources/test-documents/testPPT_EmbeddedPDF.ppt
* 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/xml/XML2003ParserTest.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/JackcessExtractor.java
* 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/apple/AppleSingleFileParserTest.java
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/xml/AbstractXML2003Parser.java
* 
tika-test-resources/src/test/resources/test-documents/testExcel_embeddedPDF.xls
* tika-test-resources/src/test/resources/test-documents/testAppleSingleFile.pdf
* 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/xml/WordMLParser.java
* 
tika-parser-modules/tika-parser-pdf-module/src/main/java/org/apache/tika/parser/pdf/AbstractPDF2XHTML.java
* tika-test-resources/src/test/resources/test-documents/testPPT_EmbeddedPDF.pptx


> Extract original filename/path when possible
> --------------------------------------------
>
>                 Key: TIKA-2024
>                 URL: https://issues.apache.org/jira/browse/TIKA-2024
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>
> Several file formats include original file names or original file paths for 
> themselves or for embedded documents.  Let's extract that information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to