Alejandro
(phone typing)

On Sep 30, 2013, at 9:15, Robert Kanter <rkan...@cloudera.com> wrote:

>> 
>>  I am not clear as to what the problem here is.
> 
> Suppose you have a script that starts up your cluster (e.g. it starts HDFS,
> Oozie, JobTracker, etc).  Now the script has to ensure that HDFS has
> finished starting up before starting Oozie; previously, the script could
> start them in any order and also didn't have to wait for any of them to
> finish starting up as there were no dependencies.  You'll have HDFS
> eventually, but not necessarily when Oozie starts.
> 
> 
> thanks
> - Robert
> 
> 
> 
> On Sun, Sep 29, 2013 at 5:42 PM, Rohini Palaniswamy <rohini.adi...@gmail.com
>> wrote:
> 
>>> By adding this dependency on HDFS to be running, you can no longer start
>> your cluster in any order; HDFS has to come before Oozie.  We should have
>> it defer the loading of the jars until it either somehow detects that HDFS
>> is up or at the first job submission.
>> 
>>  I am not clear as to what the problem here is. Even now with 3.3.x and
>> 4.0 if HDFS is not up the Oozie's jobs cannot be submitted to the cluster,
>> nor can users submit jobs to oozie as parsing workflow.xml will fail. What
>> is being made worse with Oozie copying the sharelib jars during startup or
>> before submitting the first job? Even if the admin has to run
>> oozie-setup.sh before starting Oozie, HDFS needs to be up.
>> 
>> Regards,
>> Rohini
>> 
>> 
>> 
>> 
>> On Fri, Sep 27, 2013 at 6:36 PM, Alejandro Abdelnur <t...@cloudera.com
>>> wrote:
>> 
>>> IMO, making a required order of services startup is a no go.
>>> 
>>> 
>>> On Fri, Sep 27, 2013 at 5:10 PM, Robert Kanter <rkan...@cloudera.com>
>>> wrote:
>>> 
>>>> By adding this dependency on HDFS to be running, you can no longer
>> start
>>>> your cluster in any order; HDFS has to come before Oozie.  We should
>> have
>>>> it defer the loading of the jars until it either somehow detects that
>>> HDFS
>>>> is up or at the first job submission.
>>>> 
>>>> An admin can put oozie in safe mode after it starts and then issues
>>> series
>>>>> of commands to do the necessary maintenance (install/upgrade/purge)
>> of
>>>>> sharelibs.
>>>> 
>>>> Also, I agree with Alejandro's point earlier: if the purpose of all
>> this
>>> is
>>>> to make this all automatic so that the admin doesn't have to do
>> anything,
>>>> then this won't work; it sounds like more steps for the admin to deal
>>> with.
>>>> 
>>>> 
>>>> What if we just enhanced the oozie-setup.sh script to add the logic for
>>>> temp and staging sharelib stuff and didn't have Oozie try to do
>> anything
>>>> itself.  e.g. the admin wants to transition to a new set of sharelibs,
>>>> they'd run oozie-setup.sh upgrade blah blah.  Or perhaps make the admin
>>>> command handle this logic and do it only when the command is run; then
>> we
>>>> don't have to worry about starting up and it would simplify the HA work
>>>> because the different Oozies won't be changing the sharelib at startup.
>>>> Thoughts?
>>>> 
>>>> - Robert
>>>> 
>>>> 
>>>> On Fri, Sep 27, 2013 at 11:58 AM, Rohini Palaniswamy <
>>>> rohini.adi...@gmail.com> wrote:
>>>> 
>>>>> Basically I am trying to answer the question is what was possible
>>> before
>>>>> when Oozie was up and hdfs is down?
>>>>>  - Before oozie could still get job submissions if hdfs was down.
>> But
>>> we
>>>>> are going to try validate the workflow.xml from hdfs and submission
>> is
>>>>> anyway going to fail. So don't think that is a big issue
>>>>>  - It can't submit new jobs and input dependency checks will anyway
>>>> fail.
>>>>> So that is also not a issue
>>>>>  - Users can continue to query the status of oozie which will now be
>>> not
>>>>> possible as Oozie will be down.
>>>>> 
>>>>> Is there something else that I am missing?
>>>>> 
>>>>> One thing that I can think of to address this is to defer loading to
>>> job
>>>>> submission time if hdfs is down during startup time.
>>>>> 
>>>>> -Rohini
>>>>> 
>>>>> On Fri, Sep 27, 2013 at 11:51 AM, Rohini Palaniswamy <
>>>>> rohini.adi...@gmail.com> wrote:
>>>>> 
>>>>>>> Unless I'm missing something, is not just a missing hadoop conf
>>>> issue,
>>>>>> now
>>>>>> Oozie won't start if HDFS is not running.
>>>>>> 
>>>>>>   Why would this be a issue? If sharelib is in hdfs and that is
>> not
>>>>>> accessible all the jobs would anyway fail and things are not going
>> to
>>>>> work.
>>>>>> 
>>>>>> Regards,
>>>>>> Rohini
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> On Fri, Sep 27, 2013 at 8:08 AM, Alejandro Abdelnur <
>>> t...@cloudera.com
>>>>>> wrote:
>>>>>> 
>>>>>>> afaik, the whole point of these changes was to make things
>>> adminless,
>>>>>>> thus the extra complexity. if we need admin intervention, i'd go
>> to
>>>> the
>>>>>>> previous model.
>>>>>>> 
>>>>>>> thx
>>>>>>> 
>>>>>>> Alejandro
>>>>>>> (phone typing)
>>>>>>> 
>>>>>>> On Sep 26, 2013, at 22:46, Virag Kothari <vi...@yahoo-inc.com>
>>> wrote:
>>>>>>> 
>>>>>>>> That¹s a good point. An alternative is to do share lib install
>>> after
>>>>>>> Oozie
>>>>>>>> starts.
>>>>>>>> An admin can put oozie in safe mode after it starts and then
>>> issues
>>>>>>> series
>>>>>>>> of commands to do the necessary maintenance
>>> (install/upgrade/purge)
>>>> of
>>>>>>>> sharelibs.
>>>>>>>> (OOZIE-1519 is already tracking admin upgrade of sharelibs)
>>>>>>>> Also all this commands can be optionally made to accept a
>>>>> configuration
>>>>>>>> parameter so the hadoop conf's are not required in oozie's class
>>>> path.
>>>>>>>> 
>>>>>>>> Thanks,
>>>>>>>> Virag
>>>>>>>> 
>>>>>>>> On 9/26/13 1:54 PM, "Alejandro Abdelnur" <t...@cloudera.com>
>>> wrote:
>>>>>>>> 
>>>>>>>>> Unless I'm missing something, is not just a missing hadoop conf
>>>>> issue,
>>>>>>> now
>>>>>>>>> Oozie won't start if HDFS is not running.
>>>>>>>>> 
>>>>>>>>> This does not seem right.
>>>>>>>>> 
>>>>>>>>> If we can sort this out  i would prefer the previous manual
>>> update
>>>> fo
>>>>>>> the
>>>>>>>>> sharelib via oozie-setup.sh
>>>>>>>>> 
>>>>>>>>> thanks.
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> On Thu, Sep 26, 2013 at 1:22 PM, Virag Kothari <
>>>> vi...@yahoo-inc.com>
>>>>>>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> There is no need for this setting if you have the hadoop
>> configs
>>>>> under
>>>>>>>>>> conf/hadoop-conf. Having this setting would be more useful if
>>> you
>>>>> have
>>>>>>>>>> oozie configured to talk with multiple hadoops.
>>>>>>>>>> This configs are now required at startup as the share lib
>>> service
>>>>>>> needs
>>>>>>>>>> to
>>>>>>>>>> connect to the filesystem on which the share lib jars need to
>> be
>>>>>>> copied
>>>>>>>>>> (probably only require the core-site.xml, need to check)
>>>>>>>>>> Before also it was recommended to have the hadoop configs on
>>>>>>>>>> oozie-server,
>>>>>>>>>> so users don't need to define some of  this configurations in
>>>> their
>>>>>>>>>> workflows. As now its mandatory, we should make it clear in
>> our
>>>>>>>>>> documentation.
>>>>>>>>>> 
>>>>>>>>>> Thanks,
>>>>>>>>>> Virag
>>>>>>>>>> 
>>>>>>>>>> From: bowen zhang
>>>>>>>>>> <bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com
>>>>>>>>>> Reply-To: bowen zhang <bowenzhang...@yahoo.com<mailto:
>>>>>>>>>> bowenzhang...@yahoo.com>>
>>>>>>>>>> Date: Thursday, September 26, 2013 12:20 PM
>>>>>>>>>> To: "dev@oozie.apache.org<mailto:dev@oozie.apache.org>" <
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>, Virag
>>>> Kothari <
>>>>>>>>>> vi...@yahoo-inc.com<mailto:vi...@yahoo-inc.com>>
>>>>>>>>>> Subject: Re: issue after OOZIE-1461
>>>>>>>>>> 
>>>>>>>>>> I second on Robert's concern. Right now, even the easiest way
>>> for
>>>> me
>>>>>>> to
>>>>>>>>>> get around this is to point "
>>>>>>>>>> oozie.service.HadoopAccessorService.hadoop.configurations" to
>>>>>>>>>> "*=Absolute
>>>>>>>>>> path of my hadoop conf".
>>>>>>>>>> Bowen
>>>>>>>>>> 
>>>>>>>>>> ________________________________
>>>>>>>>>> From: Robert Kanter <rkan...@cloudera.com<mailto:
>>>>> rkan...@cloudera.com
>>>>>>>>> 
>>>>>>>>>> To: Virag Kothari <vi...@yahoo-inc.com<mailto:
>>> vi...@yahoo-inc.com
>>>>>> 
>>>>>>>>>> Cc: "dev@oozie.apache.org<mailto:dev@oozie.apache.org>" <
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>; bowen
>>> zhang <
>>>>>>>>>> bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com>>
>>>>>>>>>> Sent: Thursday, September 26, 2013 11:11 AM
>>>>>>>>>> Subject: Re: issue after OOZIE-1461
>>>>>>>>>> 
>>>>>>>>>> Is there any way to make that step not required, or to at
>> least
>>>>>>>>>> streamline
>>>>>>>>>> it somehow?  I imagine we'll see many questions from users
>>>> wondering
>>>>>>> why
>>>>>>>>>> their Oozie server doesn't start because of this.  Also, Oozie
>>>> used
>>>>> to
>>>>>>>>>> work
>>>>>>>>>> out-of-the-box after running a few scripts; now it requires
>>>> manually
>>>>>>>>>> setting the
>>>>> oozie.service.HadoopAccessorService.hadoop.configurations
>>>>>>>>>> property.
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> thanks
>>>>>>>>>> - Robert
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> On Mon, Sep 23, 2013 at 2:57 PM, Virag Kothari <
>>>> vi...@yahoo-inc.com
>>>>>>>>>> <mailto:vi...@yahoo-inc.com>> wrote:
>>>>>>>>>> 
>>>>>>>>>>> Bowen,
>>>>>>>>>>> 
>>>>>>>>>>> Including hadoop configs on oozie-server is documented at
>>> https://oozie.apache.org/docs/3.3.2/AG_HadoopConfiguration.html.
>>>>>>>>>>> Documentation for sharelib installation can be updated once
>>>>>>> OOZIE-1518
>>>>>>>>>> and
>>>>>>>>>>> OOZIE-1519 goes in. We can also update quick start guide
>> during
>>>>> that
>>>>>>>>>> time.
>>>>>>>>>>> 
>>>>>>>>>>> Regards,
>>>>>>>>>>> Virag
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> On 9/23/13 2:31 PM, "bowen zhang" <bowenzhang...@yahoo.com
>>>> <mailto:
>>>>>>>>>> bowenzhang...@yahoo.com>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> Virag,
>>>>>>>>>>>> Can you add documentation for this change since the current
>>>> build
>>>>>>> and
>>>>>>>>>>>> setup of oozie doesn't cover this?
>>>>>>>>>>>> Bowen
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> ________________________________
>>>>>>>>>>>> From: Virag Kothari
>>>>>>>>>> <vi...@yahoo-inc.com<mailto:vi...@yahoo-inc.com>>
>>>>>>>>>>>> To: bowen zhang
>>>>>>>>>> <bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com>>;
>>>>>>>>>> "dev@oozie.apache.org<mailto:dev@oozie.apache.org>"
>>>>>>>>>>>> <dev@oozie.apache.org<mailto:dev@oozie.apache.org>>; Robert
>>>>> Kanter
>>>>>>> <
>>>>>>>>>> rkan...@cloudera.com<mailto:rkan...@cloudera.com>>
>>>>>>>>>>>> Sent: Monday, September 16, 2013 2:58 PM
>>>>>>>>>>>> Subject: Re: issue after OOZIE-1461
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> Hi Robert/Bowen,
>>>>>>>>>>>> 
>>>>>>>>>>>> The hadoop configs need to be there in class path
>> (hadoop-conf
>>>> dir
>>>>>>> or
>>>>>>>>>>>> oozie-server/lib). So the HadoopAccessorService can create
>> the
>>>>>>>>>>>> appropriate filesystem object. This will fix your current
>>> issue.
>>>>>>>>>>>> But there is one more problem you might face while running a
>>> job
>>>>>>>>>> where
>>>>>>>>>>>> permissions are not recursively applied. This fix is in
>>>> OOZIE-1528
>>>>>>>>>> and
>>>>>>>>>>>> will be checked in shortly.
>>>>>>>>>>>> 
>>>>>>>>>>>> Thanks,
>>>>>>>>>>>> Virag
>>>>>>>>>>>> 
>>>>>>>>>>>> From: bowen zhang
>>>>>>>>>>>> <bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com
>>>>> <mailto:
>>>>>>>>>> bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com>>>
>>>>>>>>>>>> Reply-To: bowen zhang
>>>>>>>>>>>> <bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com
>>>>> <mailto:
>>>>>>>>>> bowenzhang...@yahoo.com<mailto:bowenzhang...@yahoo.com>>>
>>>>>>>>>>>> Date: Monday, September 16, 2013 2:49 PM
>>>>>>>>>>>> To: "dev@oozie.apache.org<mailto:dev@oozie.apache.org
>>>> <mailto:
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>"
>>>>>>>>>>>> <dev@oozie.apache.org<mailto:dev@oozie.apache.org><mailto:
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>>, Virag
>>>> Kothari
>>>>>>>>>>>> <vi...@yahoo-inc.com<mailto:vi...@yahoo-inc.com><mailto:
>>>>>>>>>> vi...@yahoo-inc.com<mailto:vi...@yahoo-inc.com>>>
>>>>>>>>>>>> Subject: Re: Fwd: issue after OOZIE-1461
>>>>>>>>>>>> 
>>>>>>>>>>>> what I found is the variable uri from tmpShareLibPath has
>>>>> authority
>>>>>>>>>> of
>>>>>>>>>>>> "null".
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> ________________________________
>>>>>>>>>>>> From: Robert Kanter <rkan...@cloudera.com<mailto:
>>>>>>> rkan...@cloudera.com
>>>>>>>>>>> <mailto:rkan...@cloudera.com<mailto:rkan...@cloudera.com>>>
>>>>>>>>>>>> To: Virag Kothari <vi...@yahoo-inc.com<mailto:
>>>> vi...@yahoo-inc.com
>>>>>>>>>>> <mailto:vi...@yahoo-inc.com<mailto:vi...@yahoo-inc.com>>>;
>>>>>>>>>>>> "dev@oozie.apache.org<mailto:dev@oozie.apache.org><mailto:
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>"
>>>>>>>>>>>> <dev@oozie.apache.org<mailto:dev@oozie.apache.org><mailto:
>>>>>>>>>> dev@oozie.apache.org<mailto:dev@oozie.apache.org>>>
>>>>>>>>>>>> Sent: Monday, September 16, 2013 2:39 PM
>>>>>>>>>>>> Subject: Fwd: issue after OOZIE-1461
>>>>>>>>>>>> 
>>>>>>>>>>>> Hi Virag,
>>>>>>>>>>>> 
>>>>>>>>>>>> After OOZIE-1461, Bowen (and I too) have run into this
>>> exception
>>>>>>> when
>>>>>>>>>>>> starting Oozie, so it fails and won't start.  I checked, and
>>> for
>>>>> me
>>>>>>>>>> at
>>>>>>>>>>>> least, the share/lib/ dir looks like it has the correct
>>>>> permissions.
>>>>>>>>>>>> Any thoughts?
>>>>>>>>>>>> 
>>>>>>>>>>>> thanks
>>>>>>>>>>>> - Robert
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> On Mon, Sep 16, 2013 at 2:26 PM, Bowen Zhang
>>>>>>>>>>>> <bzh...@hortonworks.com<mailto:bzh...@hortonworks.com
>>>> <mailto:
>>>>>>>>>> bzh...@hortonworks.com<mailto:bzh...@hortonworks.com>>>
>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> Hi Robert,
>>>>>>>>>>>>> After rebasing to the trunk, I hit this error when trying
>> to
>>>>> bring
>>>>>>>>>> up
>>>>>>>>>>>>> oozie.
>>>>>>>>>>>>> org.apache.oozie.service.ServiceException: E0100: Could not
>>>>>>>>>> initialize
>>>>>>>>>>>>> service [org.apache.oozie.service.ShareLibService], Failed
>> to
>>>> set
>>>>>>>>>>>>> permissions of path:
>>>>>>>>>> /user/bzhang/share/lib/tmp-20130916135406/oozie
>>>>>>>>>> to
>>>>>>>>>>>>> 0755
>>>>>>>>>>>>>      at
>> org.apache.oozie.service.ShareLibService.init(ShareLibService.java:81)
>>>>>>>>>>>>>      at
>>>> org.apache.oozie.service.Services.setServiceInternal(Services.java:368)
>>>>>>>>>>>>>      at
>>> org.apache.oozie.service.Services.setService(Services.java:354)
>>>>>>>>>>>>>      at
>>>> org.apache.oozie.service.Services.loadServices(Services.java:287)
>>>>>>>>>>>>>      at
>>>>> org.apache.oozie.service.Services.init(Services.java:208)
>>>>>>>>>>>>>      at
>>>>>>>>>> 
>>>> org.apache.oozie.servlet.ServicesLoader.contextInitialized(ServicesLoad
>>>>>>>>>>>> er
>>>>>>>>>>>>> .java:45)
>>>>>>>>>>>>>      at
>>>>>>>>>> 

Reply via email to