Re: [galaxy-dev] Setting up Galaxy with SGE

2011-06-15 Thread Peter Cock
On Wed, Jun 15, 2011 at 2:30 AM, Ka Ming Nip km...@bcgsc.ca wrote:
 Hello,

 I have been trying to set up Galaxy to run its jobs on my SGE
 cluster using the Unified Method, based on the steps described
 in the wiki:

 https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster

 My local instance of Galaxy is installed under my home directory,
 but my cluster is on a different file system. I need to ssh to the
 head node before I can submit jobs.

 Is there any way to set up Galaxy with a SGE cluster on a different
 file system?

I haven't looked into it in great detail, but we currently have a
similar network setup, and would also like to link Galaxy to SGE.

The main problem is the lack of a shared file system - in order
to run a job on the cluster we (Galaxy) has to get the data onto
the cluster, run the job, and get the results back.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] [galaxy-user] Error running latest Galaxy Distribution: numpy.dtype exception

2011-06-15 Thread Michael Steder


On Jun 15, 2011, at 3:36 PM, Kanwei Li wrote:

 What is the version for numpy in your eggs.ini?

eggs.ini has numpy = 1.3.0.  I have no local edits this is just a clean 
checkout of galaxy-dist.  

~Mike

 On Wed, Jun 15, 2011 at 3:14 PM, Michael Steder ste...@ci.uchicago.edu 
 wrote:
 Hi Nate,
 
 On Jun 15, 2011, at 12:18 PM, Nate Coraor wrote:
 
 It should be working, this would have been a problem for a short time
 using galaxy-central, but should not have affected -dist.  Could you
 send the output of:
 
  python ./scripts/get_platforms.py
  python -V
 
 $ python ./scripts/get_platforms.py 
 macosx-10.6-universal-ucs2
 $ python -V
 Python 2.6.1
 
 
 Should I be running galaxy-central instead?
 
 No, galaxy-dist is fine (and should be less prone to issues due to
 cloning/updating at the wrong moment).
 
 Thanks,
 --nate
 
 Just in case it helps:
 
 I'm currently running tip of galaxy-dist:
 
 hg tip
 changeset:   5585:8c11dd28a3cf
 tag: tip
 user:Nate Coraor n...@bx.psu.edu
 date:Thu May 19 10:07:53 2011 -0400
 summary: Add Picard and fastqc tools to Main
 
 My prior version which is still working for me is:
 
 changeset:   5355:50e249442c5a
 user:Daniel Blankenberg d...@bx.psu.edu
 date:Thu Apr 07 08:39:07 2011 -0400
 summary: Additional fix for typo propagated from manual_builds, which was 
 updated in 5350:cd2aff5b117c. 
 
 ~Mike
 
 
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
  http://lists.bx.psu.edu/
 

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] Setting up Galaxy with SGE

2011-06-15 Thread Ravi Madduri
Hi
We have been able to get the following setup working pretty reliably:
Automated deployment of Galaxy clusters on EC2. These clusters have the 
following setup:
A NFS server providing global directories for home directories, software, and 
scratch space. We are also experimenting with glusterfs
A NIS server providing an authentication domain within the cluster. The list of 
users must be specified when the cluster is created
A GridFTP server, which is automatically registered as a Globus Online 
(www.globusonline.org) endpoint for high speed and reliable data transfer.
Ability to script transfer tasks as part of the workflow using Globus Online 
Galaxy tools (To get large quantities of data in and out of galaxy EC2 cluster 
reliably)
A Condor pool with jobs running using user's credentials (using users accounts 
that are provisioned in the EC2 cluster)
A Galaxy server with the modifications outlined above, which we plan to 
contribute back to the galaxy community soon

I gave a talk at the recent Galaxy users conference about this setup. You can 
find slides here: PowerPoint

Please let me know if you are interested and would like more information.

Regards
On Jun 15, 2011, at 5:28 AM, Marina Gourtovaia wrote:

 There are two slightly different problems here. One with the files ystem and 
 the second is whether the machine Galaxy is running on can submit jobs to the 
 cluster (ssh is mentioned in the original e-mail, suggesting that it's 
 impossible to communicate with the cluster from the machine the job is 
 running on). The Galaxy instance is constantly communicating with the cluster 
 job scheduling system (SGE or other) in order to get updates on the status of 
 the jobs. It should be possible to do it over ssh, but, in my experience, 
 this slows down the code.
 
 We run the Galaxy server on one of the cluster nodes. I imagine that other 
 people using Galaxy with the cluster do the same. Are they?
 
 Marina
 
 On 15/06/2011 08:51, Peter Cock wrote:
 On Wed, Jun 15, 2011 at 2:30 AM, Ka Ming Nipkm...@bcgsc.ca  wrote:
 Hello,
 
 I have been trying to set up Galaxy to run its jobs on my SGE
 cluster using the Unified Method, based on the steps described
 in the wiki:
 
 https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster
 
 My local instance of Galaxy is installed under my home directory,
 but my cluster is on a different file system. I need to ssh to the
 head node before I can submit jobs.
 
 Is there any way to set up Galaxy with a SGE cluster on a different
 file system?
 I haven't looked into it in great detail, but we currently have a
 similar network setup, and would also like to link Galaxy to SGE.
 
 The main problem is the lack of a shared file system - in order
 to run a job on the cluster we (Galaxy) has to get the data onto
 the cluster, run the job, and get the results back.
 
 Peter
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
   http://lists.bx.psu.edu/
 
 
 -- 
 The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a 
 charity registered in England with number 1021457 and a company registered in 
 England with number 2742969, whose registered office is 215 Euston Road, 
 London, NW1 2BE. ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
 http://lists.bx.psu.edu/

--
Ravi K Madduri
The Globus Alliance | Argonne National Laboratory | University of Chicago
http://www.mcs.anl.gov/~madduri

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] Setting up Galaxy with SGE

2011-06-15 Thread Chorny, Ilya
Ravi,

Do your galaxy jobs submit to the cluster as the galaxy user or as the user?

Thanks,

Ilya

Sent from my iPhone

On Jun 15, 2011, at 2:43 PM, Ravi Madduri 
madd...@mcs.anl.govmailto:madd...@mcs.anl.gov wrote:

Hi
We have been able to get the following setup working pretty reliably:

  *   Automated deployment of Galaxy clusters on EC2. These clusters have the 
following setup:
 *   A NFS server providing global directories for home directories, 
software, and scratch space. We are also experimenting with glusterfs
 *   A NIS server providing an authentication domain within the cluster. 
The list of users must be specified when the cluster is created
 *   A GridFTP server, which is automatically registered as a Globus Online 
(http://www.globusonline.orgwww.globusonline.orghttp://www.globusonline.org)
 endpoint for high speed and reliable data transfer.
 *   Ability to script transfer tasks as part of the workflow using Globus 
Online Galaxy tools (To get large quantities of data in and out of galaxy EC2 
cluster reliably)
 *   A Condor pool with jobs running using user's credentials (using users 
accounts that are provisioned in the EC2 cluster)
 *   A Galaxy server with the modifications outlined above, which we plan 
to contribute back to the galaxy community soon

I gave a talk at the recent Galaxy users conference about this setup. You can 
find slides here: 
PowerPointhttp://wiki.g2.bx.psu.edu/GCC2011?action=AttachFiledo=gettarget=GalaxyGridFTPCondorAndGlobusOnline.pptx

Please let me know if you are interested and would like more information.

Regards
On Jun 15, 2011, at 5:28 AM, Marina Gourtovaia wrote:

There are two slightly different problems here. One with the files ystem and 
the second is whether the machine Galaxy is running on can submit jobs to the 
cluster (ssh is mentioned in the original e-mail, suggesting that it's 
impossible to communicate with the cluster from the machine the job is running 
on). The Galaxy instance is constantly communicating with the cluster job 
scheduling system (SGE or other) in order to get updates on the status of the 
jobs. It should be possible to do it over ssh, but, in my experience, this 
slows down the code.

We run the Galaxy server on one of the cluster nodes. I imagine that other 
people using Galaxy with the cluster do the same. Are they?

Marina

On 15/06/2011 08:51, Peter Cock wrote:
On Wed, Jun 15, 2011 at 2:30 AM, Ka Ming 
Nipmailto:km...@bcgsc.cakm...@bcgsc.camailto:km...@bcgsc.ca  wrote:
Hello,

I have been trying to set up Galaxy to run its jobs on my SGE
cluster using the Unified Method, based on the steps described
in the wiki:

https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Clusterhttps://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster

My local instance of Galaxy is installed under my home directory,
but my cluster is on a different file system. I need to ssh to the
head node before I can submit jobs.

Is there any way to set up Galaxy with a SGE cluster on a different
file system?
I haven't looked into it in great detail, but we currently have a
similar network setup, and would also like to link Galaxy to SGE.

The main problem is the lack of a shared file system - in order
to run a job on the cluster we (Galaxy) has to get the data onto
the cluster, run the job, and get the results back.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/ http://lists.bx.psu.edu/


--
The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a 
charity registered in England with number 1021457 and a company registered in 
England with number 2742969, whose registered office is 215 Euston Road, 
London, NW1 2BE. ___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

http://lists.bx.psu.edu/http://lists.bx.psu.edu/

--
Ravi K Madduri
The Globus Alliance | Argonne National Laboratory | University of Chicago
http://www.mcs.anl.gov/~maddurihttp://www.mcs.anl.gov/~madduri

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/ http://lists.bx.psu.edu/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] The new hg based Galaxy Tool Shed

2011-06-15 Thread Ravi Madduri
I apologize for jumping on to this thread a bit late. I read below that there 
is a plan to pull tools into a galaxy installation automagically. I wonder if 
you plan on providing some kind of API to query the tool registry and discover 
the tools and install them into an existing galaxy installation.

PS: The link : How to upload, download and install tools under Help seems to be 
broken.
On Jun 1, 2011, at 3:00 PM, Nate Coraor wrote:

 Peter Cock wrote:
 
 Well, use of the dependency system isn't required, so just setting
 things up on the $PATH is always a possibility.  I was going to suggest
 that your patch could be applied if it was conditional on the local
 runner and checked after any requirement type=package
 dependencies were setup, ...
 
 Is that a request for me to update the patch? I've not delved into the
 job runner code before, so it might take me a bit longer that it would
 take you. Hint hint ;) I'd help with testing though.
 
 It's not a completely trivial thing, which is why I didn't do it at the
 time.  It's probably something that should be added to the DRM wrapper
 script so that a nice error message can be supplied.  I can't think of a
 way to check at tool load that wouldn't be painfully slow.
 
 ... but there's still the problem of people running jobs through
 the local runner which are actually sent to the cluster without Galaxy's
 knowledge.  Perhaps this is something we shouldn't worry too much about,
 but I know there are people doing it.
 
 You mean if Galaxy blindly calls a tool or script, and that script
 then submits the job to the cluster? I'd say checking the cluster
 dependencies there was the tool author's responsibility.
 
 Yeah, that's the idea.  Unfortunately, if the binary isn't installed on
 the Galaxy server (which is irrelevant), the tool won't load, which is
 certainly not what we want.
 
 --nate
 
 
 Peter
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
  http://lists.bx.psu.edu/

--
Ravi K Madduri
The Globus Alliance | Argonne National Laboratory | University of Chicago
http://www.mcs.anl.gov/~madduri

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/