[
https://issues.apache.org/jira/browse/CONNECTORS-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14711740#comment-14711740
]
Karl Wright commented on CONNECTORS-1233:
-----------------------------------------
Yes, in general we try to avoid duplicating functionality wherever possible.
Since 1.7, we've also tried hard to make sure common functionality appears in
transformation connectors, rather than repository or output connectors. So if
you can explore using the Tika Extractor to extract your content and metadata,
that would be helpful. If it doesn't work for you, we should figure out why
not.
> AmazonS3 Repository Connector
> -----------------------------
>
> Key: CONNECTORS-1233
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1233
> Project: ManifoldCF
> Issue Type: New Feature
> Reporter: Gunaratnam Kuhajeyan
> Assignee: Karl Wright
> Labels: features
> Attachments: amazons3patch.diff
>
> Original Estimate: 240h
> Remaining Estimate: 240h
>
> Feature Patch
> AmazonS3 Repository Connector
> AmazonS3 Repository Connector
> A. Overview
> 1. Connects to Amazons3 buckets, and indexes the artifact. if any buckets to
> be avoided it can be skipped ( it can be configured in job)
> 2. Internally documents are parsed and meta data are extracted using Tika
> 3. Support Locale - English US ( Currently common_en_US.properties,
> available, looking for support from some to do the translation for the keys)
> B. Documentation - Work in progress, will be attached issue on the following
> days
> C. Dependencies - (common-lib)
> 1. aws-java-sdk-{version}.jar
> 2. aws-java-sdk-core-{version}.jar
> 3. aws-java-sdk-s3-{version}.jar
> 4. joda-time-2.2.jar
> D. Connectors.xml
> <!-- Add your authority connectors here -->
> <authorityconnector name="Amazons3"
> class="org.apache.manifoldcf.authorities.authorities.amazons3.AmazonS3Authority"/>
>
> <!-- Add your repository connectors here -->
> <repositoryconnector name="AmazonS3"
> class="org.apache.manifoldcf.crawler.connectors.amazons3.AmazonS3Connector"/>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)