[
https://issues.apache.org/jira/browse/CONNECTORS-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13015019#comment-13015019
]
Karl Wright commented on CONNECTORS-118:
----------------------------------------
This ticket is stalled.
The driver behind it was being able to support a feature that Aperture has.
The way it would need to be done in ManifoldCF is to have individual connectors
deal with the feature. Each connector that supports it would know how to
generate a specialized URL which referred to the archive contents, and the
document identifiers for such connectors would also need to be changed to be
able to represent archive contents as well. The connectors under consideration
would be the file system connector, the JCIFS connector, and the Web connector.
> Crawled archive files should be expanded into their constituent files
> ---------------------------------------------------------------------
>
> Key: CONNECTORS-118
> URL: https://issues.apache.org/jira/browse/CONNECTORS-118
> Project: ManifoldCF
> Issue Type: New Feature
> Components: Framework crawler agent
> Reporter: Jack Krupansky
>
> Archive files such as zip, mbox, tar, etc. should be expanded into their
> constituent files during crawling of repositories so that any output
> connector would output the flattened archive.
> This could be an option, defaulted to ON, since someone may want to implement
> a "copy" connector that maintains crawled files as-is.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira