[
https://issues.apache.org/jira/browse/TIKA-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13853845#comment-13853845
]
Vikram edited comment on TIKA-1212 at 12/20/13 10:23 AM:
---------------------------------------------------------
Please find the attached standalone program. You need to change the package. I
am using tika 1.4. I am using this program only to get the output. Those output
information is already there in the other attached files.
Note: Please refer the abc.zip which I have copied as the output information is
respect to the file abc.zip which I have attached.
More over, you can refer the following site:
http://wiki.apache.org/tika/RecursiveMetadata#Main_from_Jukka.27s_Example
was (Author: vikrama):
Please find the attached standalone program. You need to change the package. I
am using tika 1.4. I am using this program only to get the output. Those output
information is already there in the other attached files. More over, you can
refer the following site:
http://wiki.apache.org/tika/RecursiveMetadata#Main_from_Jukka.27s_Example
> Recursive Extraction of Archive File
> ------------------------------------
>
> Key: TIKA-1212
> URL: https://issues.apache.org/jira/browse/TIKA-1212
> Project: Tika
> Issue Type: Bug
> Reporter: Vikram
> Priority: Critical
> Attachments: RecursiveMetadataParserZukka.java, TIKA-Output.xlsx,
> abc.zip, abc.zip
>
>
> Please refer the code:
> http://wiki.apache.org/tika/RecursiveMetadata#Main_from_Jukka.27s_Example
> Requirement:
> -----------------
> abc.zip
> ---> a.doc
> ---> b.xls
> ---> pqr.zip
> -------------> m.ppt
> There are two issues with TIKA:
> 1. How to block extraction embedded doc separately optionally?
> 2. When I extract recussively, file name / or resourceKeyName is not coming
> properly. For example
> --> a.doc should have value abc.zip/a.doc. Similarily for b.xls. This is
> fine BUT m.ppt is having resource file name as pqr/m.ppt which is WRONG. This
> should have value abc.zip/pqr.zip/m.ppt.
> --> Even for the Embedded doc, only random name is coming.. not even with
> proper file path.
--
This message was sent by Atlassian JIRA
(v6.1.4#6159)