[
https://issues.apache.org/jira/browse/OOZIE-426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13900621#comment-13900621
]
Robert Kanter commented on OOZIE-426:
-------------------------------------
Kevin is correct. If the war file is ever redeployed, you'll lose anything you
changed in WEB-INF/lib. The supported way to do this in the tarball
installation is to put any extra jars in the libext directory and run the
{{oozie-setup.sh prepare-war}} command to repackage it into the WAR file (this
is how the hadoop jars and extjs zip are added). This is the currently
supported and documented way of adding additional jars.
I know that CDH has a /var/lib/oozie dir that is automatically included when
starting Oozie, so in CDH you can simply add extra jars there without having to
worry about the WAR file. I'm not a packaging expert, but CDH uses Bigtop for
packaging, so Bigtop likely works this way too, but [~mackrorysd] would know
for sure.
To make using s3 with Oozie easier for users, all we need to do is update the
hadooplibs pom files to have a dependency on whatever includes the jets3t jar.
This way, when they put the hadoop libs into the libext and run oozie-setup.sh
to prepare the war file, it will have the s3 jar.
> OOZIE-9: Support (Amazon) s3/s3n schemes
> ----------------------------------------
>
> Key: OOZIE-426
> URL: https://issues.apache.org/jira/browse/OOZIE-426
> Project: Oozie
> Issue Type: New Feature
> Reporter: Hadoop QA
> Assignee: Roman Shaposhnik
>
> Currently Oozie supports only the 'hdfs' scheme. For people having their
> hadoop cluster in Amazon EC2 and using S3 for storage, it would be very
> useful if oozie supports the 's3n/s3' scheme(s). The use case I am talking
> about is as follows
> Hadoop cluster in Amazon EC2
> Uses hdfs for intermediate storage
> Uses s3 for getting input for, storing output of map-reduce jobs.
> More details on the above use-case and the exceptions/failures I have seen is
> documented here (http://tech.groups.yahoo.com/group/Oozie-users/message/1138).
> There can be other use cases as well - say use s3 as the DFS instead of HDFS.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)