[ https://issues.apache.org/jira/browse/TIKA-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jukka Zitting resolved TIKA-232. -------------------------------- Resolution: Duplicate Assignee: Jukka Zitting With TIKA-238 resolved, the former case above is now the default: Parser parser = new ZipParser(); And the latter case is much simpler: TikaConfig config = TikaConfig.getDefaultConfig(); // without a delegate parser Parser parser = new AutoDetectParser(config); Resolving this as a Duplicate of TIKA-238. > Scanning of archive files > ------------------------- > > Key: TIKA-232 > URL: https://issues.apache.org/jira/browse/TIKA-232 > Project: Tika > Issue Type: New Feature > Components: parser > Affects Versions: 0.3 > Environment: All > Reporter: Karl Heinz Marbaise > Assignee: Jukka Zitting > Priority: Minor > > If i parse an archive all the files inside the archive will be extracted with > their text as well. It would be nice to have the choice to extract only the > list of files (directory) of an archive instead of extracting the whole > contents. This seemed to be usable only for zip, tar, tar.gz, tar.bz2, .jar. > May be this could be realized by using a different calling or by a run time > configuration. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.