Here is a guess: a remote depo went missing during an install, and the package system was left in a locked state, never to be cleared again.

What if Whirr forced the dpkg lock clear? Does it rely on that lock for serialization?

Paul


On 20111112 10:44 , Paul Baclace wrote:
I am seeing this error, not due to any change I made:

E: Could not get lock /var/lib/dpkg/lock - open (11: Resource temporarily unavailable) E: Unable to lock the administration directory (/var/lib/dpkg/), is another process using it?

What causes this intermittent problem? At the moment, it is very repeatable.


Paul

On 20111111 22:23 , Andrei Savu wrote:
Can you make the S3 files public? Is this happening on all machines?

You should probably consider using whirr.instance-templates-max-percent-failures as described here:
http://whirr.apache.org/docs/0.6.0/configuration-guide.html

Cheers,

-- Andrei Savu / andreisavu.ro <http://andreisavu.ro>

On Sat, Nov 12, 2011 at 2:22 AM, Arun Ramakrishnan <[email protected] <mailto:[email protected]>> wrote:

    Guys,

    It looks like the apt hadoop packages aren't getting installed.
    Any ideas ?

    ###################################################

    2011-11-11 12:31:31,893 DEBUG [jclouds.compute] (user thread 6)
    << stderr from jclouds-script-1321043482986 as [email protected]
    <mailto:[email protected]>
    sed: can't read /etc/hadoop-0.20/conf.dist/hadoop-env.sh: No such
    file or directory
    sed: can't read /etc/hadoop-0.20/conf.dist/hadoop-env.sh: No such
    file or directory
    chgrp: invalid group: `hadoop'
    chgrp: invalid group: `hadoop'
    E: Could not get lock /var/lib/dpkg/lock - open (11: Resource
    temporarily unavailable)
    E: Unable to lock the administration directory (/var/lib/dpkg/),
    is another process using it?
    hadoop-0.20-datanode: unrecognized service
    E: Could not get lock /var/lib/dpkg/lock - open (11: Resource
    temporarily unavailable)
    E: Unable to lock the administration directory (/var/lib/dpkg/),
    is another process using it?
    hadoop-0.20-tasktracker: unrecognized service

    ##################################################

    I am using a binaries that i built form 0.7 a few weeks back.


    Full log : http://incentica-public.s3.amazonaws.com/whirr-ccore44.log
    Config  :
    http://incentica-public.s3.amazonaws.com/whirr_cdh.properties


    This seems to happen non-deterministically and more so for larger
    clusters 10+


    thanks
    Arun




Reply via email to