Re: [galaxy-dev] cheetah chaet sheet

2015-06-19 Thread Martin Čech
I usually check this page:
http://www.cheetahtemplate.org/docs/users_guide_html/users_guide.html#SECTION00060

M.
On Fri, Jun 19, 2015 at 4:34 PM Alexander Vowinkel 
vowinkel.alexan...@gmail.com wrote:

 Hi,

 someone knows a cheat sheet with all the commands supported by cheetah?

 Best,
 Alexander
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   https://lists.galaxyproject.org/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] package_bowtie_2_2_4 Executable Permissions - can't run bowtie as non-galaxy user

2015-06-19 Thread Nicola Soranzo
  Hi David,
can you open an issue at
https://github.com/galaxyproject/docker-build ?

Thanks,
Nicola

Il
18.06.2015 21:55 David Trudgian ha scritto: 

 Installed bowtie2 from
the toolshed, which brings in package_bowtie_2_2_4, which downloads
bowtie binaries from depot.galaxyproject.org 
 

http://depot.galaxyproject.org/package/linux/x86_64/bowtie2/bowtie2-2.2.4-linux-x86_64.tgz

 
 Jobs submitted on our setup, running as a 'real user' (not galaxy
user) via pulsar + SLURM fail with a permissions error. This is due to
the permissions on some of the bowtie executables in the .tar.gz from
depot being owner/group executable only. Other users cannot run bowtie2:

 
 -rwxrwx---. 1 galaxy galaxy 18K Oct 22 2014 bowtie2 
 
 If I
download the tar.gz from depot manually and extract then I get: 
 

-rwxrwx--- 1 dtrudgian biohpc_admin 18K Oct 22 2014 bowtie2 
 
 Could
the .tar.gz be fixed, so that the executables will work in a run as real
user configuration? Is there any sensible way Galaxy could enforce
permissions to guard against this issue? 
 
 Cheers, 
 
 -- 
 

David Trudgian Ph.D. 
 
 Computational Scientist, BioHPC 
 
 UT
Southwestern Medical Center 
 
 Dallas, TX 75390-9039 
 
 Tel: (214)
648-4833 
 
 Please contact biohpc-help@utsouthwestern with general
BioHPC inquries. 
 
 -
 
 UT Southwestern 


 Medical Center 
 
 The future of medicine, today.
 



Connetti gratis il mondo con la nuova indoona:  hai la chat, le chiamate, le 
video chiamate e persino le chiamate di gruppo.
E chiami gratis anche i numeri fissi e mobili nel mondo!
Scarica subito l’app Vai su https://www.indoona.com/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] BAM datatype - bam bai

2015-06-19 Thread Alexander Vowinkel
Hi,

thanks! Let me clarify myself on the multiple generation question.

When I do command line, I do this:
Bam,Bai - Prog1 - Bam,Bai(v2) - Prog2 - Bam,Bai(v3)

What I understand, what galaxy is doing (always overwrite [1]):

Bam,Bai - Prog1 - Bam,Bai(v2) - Bam(v2) - Bam,Bai(v2) -
Prog2 - Bam,Bai(v3) - Bam(v3) - Bam,Bai(v3)

Best,
Alexander

[1]
https://github.com/galaxyproject/galaxy/blob/dev/lib/galaxy/datatypes/binary.py#L314


2015-06-19 1:26 GMT-05:00 Björn Grüning bjoern.gruen...@gmail.com:

 Hi Alexander,

 Am 19.06.2015 um 01:44 schrieb Alexander Vowinkel:
  Hi,
 
  when I 'produce' a bam file in my output, I can download the bam file
 and a
  corresponding bai file.
 
  I am now wondering where does the bai file come from?

 From Galaxy! :)
 Galaxy has a concept of datatypes and metadata. In the case of BAM
 files, Galaxy is producing the index file to every position sorted BAM
 file.

  When is it being generated, what code is used for that?

 As soon as the BAM file is created.

 https://github.com/galaxyproject/galaxy/blob/dev/lib/galaxy/datatypes/binary.py#L312

  When I do multiple actions with my bam file - is the index generated
 every
  time? (overhead)

 No why should it? The BAM files stays the same right? So if you are
 using the BAM multiple time the one index is used multiple times.

  Where is the bai saved on disk (if it is)?

 You can access it with this snipped: $your_bam_file.metadata.bam_index

  I found that in the IUC gatk, a new bai is always generated when a gatk
  tool is started. This creates quite an overhead. Isn't there a better
  solution?

 Do you have an idea?
 Without Galaxy you also create an index once and reuse it, this is was
 Galaxy is doing. This concept is also used for other datatypes and
 improves the usability dramatically.


 If this was helpful, please consider to ask this question on Biostar
 again. I think this might be useful for others as well.

 Cheers,
 Bjoern

  @ref https://github.com/galaxyproject/tools-iuc/issues/194
 
  Best,
  Alexander
 
 
 
  ___
  Please keep all replies on the list by using reply all
  in your mail client.  To manage your subscriptions to this
  and other Galaxy lists, please use the interface at:
https://lists.galaxyproject.org/
 
  To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/
 

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] cloudman error somewhere

2015-06-19 Thread Enis Afgan
Hi Alexander,
When you build the image, did you run the image cleanup step? RabbitMQ's
database needs to be cleaned up before building the image otherwise the
application won't start properly and the AMQP connection will never be
established.

You can also post the entire CloudMan's log somewhere and I can take a look
there for other clues.



On Tue, Jun 16, 2015 at 4:07 PM, Alexander Vowinkel 
vowinkel.alexan...@gmail.com wrote:

 Maybe connected. this error (happening multiple times):

 
 Exception happened during processing of request from ('127.0.0.1', 37201)
 Traceback (most recent call last):
   File
 /home/ubuntu/.virtualenvs/CM/local/lib/python2.7/site-packages/paste/httpserver.py,
 line 1068, in process_request_in_thread
 self.finish_request(request, client_address)
   File /usr/lib/python2.7/SocketServer.py, line 334, in finish_request
 self.RequestHandlerClass(request, client_address, self)
   File /usr/lib/python2.7/SocketServer.py, line 651, in __init__
 self.finish()
   File /usr/lib/python2.7/SocketServer.py, line 710, in finish
 self.wfile.close()
   File /usr/lib/python2.7/socket.py, line 279, in close
 self.flush()
   File /usr/lib/python2.7/socket.py, line 303, in flush
 self._sock.sendall(view[write_offset:write_offset+buffer_size])
 error: [Errno 32] Broken pipe
 



 2015-06-16 13:59 GMT-05:00 Alexander Vowinkel 
 vowinkel.alexan...@gmail.com:

 Hi,

 I have the following problem:
 First, the job scheduling didn't work.

 I tried to upload a file, but the job was waiting.
 Then I checked squeue, which timed out.
 The slurm service was running.

 Then I restarted the cluster.
 Now I even have problems access the cloudman admin page.
 Meaning, the page is loading endless.
 The machine is not busy (CPU and mem @ htop)
 In the paster.log it tries this every 4 seconds:

 2015-06-16 18:35:56,547 DEBUG master:2579 Trying to setup AMQP
 connection; conn = 'cm.util.comm.CMMasterComm object at 0x7f0aa26d72d0'
 2015-06-16 18:35:56,547 DEBUG   comm:30   Setting up a new AMQP
 connection
 2015-06-16 18:35:56,548 DEBUG   comm:49   AMQP Connection
 Failure:  [Errno 111] Connection refused


 Any idea what this might be?
 Something seems to be really weird.

 I am using the next version of CloudMan (dev).
 I created an own instance with the Cloud Man playbook.

 Best,
 Alexander



 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   https://lists.galaxyproject.org/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Custom Tool Dev Questions

2015-06-19 Thread John Chilton
The option in the GUI for selecting multiple datasets for a single
data parameter - causes multiple jobs to be created as you noted. If
you just place multiple=true on your data input parameter  - that
button will go away and all the datasets will be used for a single
job.

If your tool consumes pairs of datasets - you will want to expose an
option where you are using a parameter like:

input name=input_pairs type=data_collection
collection_type=list:paired format=fastq /

and then your users will have to use the data collection creator to
pair off the datasets into a big list before using your tool - but the
creator is actually really nice - I think it makes it very easy to
build large lists of pairs.

Hope this helps - happy to provide more details,
-John

On Fri, Jun 19, 2015 at 11:24 AM, Ryan G ngsbioinformat...@gmail.com wrote:
 Hi all - I have a tool which is essentially an entire NGS pipeline bundled
 into a perl script and associated modules.  I'd like to offer this tool to
 users via Galaxy.

 I'm trying to pass a bunch of fastq files to the tool, but noticed that
 operating on multiple datasets creates a separate job for each dataset.  I
 want to pass all the datasets to the tool so only one galaxy job is actually
 created.  Is this possible, and if so, how do I go about doing this?

 Ryan


 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   https://lists.galaxyproject.org/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] [spam?] Cloudman cluster not starting workers

2015-06-19 Thread Enis Afgan
Hi Nick,
Sorry to hear you're having trouble. I just tried a couple of scenarios and
they all worked as expected (e.g., with and without elastic IPs, different
instance types).

The main CloudMan log is located in /mnt/cm/paster.log, on both master and
worker instances (if you didn't download the ssh key from cloudlaunch, you
can ssh with ubuntu username and the same password as provided on the
cloudlaunch form). The log is also available from the UI if you go to Admin
page and then click 'Show CloudMan log' under 'System controls'. If you can
share that, we can hopefully figure out what's going on.

Best,
Enis

On Wed, Jun 17, 2015 at 11:47 AM, Nicholas Dickens 
nick.dick...@glasgow.ac.uk wrote:

   Hi All,

  First time post, as a quick intro, I’m have some reasonable experience
 with EC2  AWS for developing our own pipelines, I’m comfortable in python
 and *nix flavours and I am developing a completely custom galaxy in AWS for
 members of the WTCMP, Glasgow.  In the meantime, I have thrown up a quick
 cloudman galaxy using the cloudstart and the cloudman 2.3 ami
 (ami-a7dbf6ce) in us-east-1.  Auto–scaling didn’t seem to work so I’ve
 switched it off and added nodes manually, I tried various sizes including
 ‘same as master’ instances but they just don’t start – in the EC2 console I
 can see them and see them running.  But they’re constantly pending in the
 /cloud interface and in the log they reboot 4 times and then terminate –
 apparently not responding 10:16:56 - Instance i-xx not responding
 after 4 reboots. Terminating instance.

  It’s out of the box, I editted the universe_wsgi.ini… file to disallow
 user registration and allow me to impersonate users but didn’t change
 anything else.  The only other configuration I’ve done is associate an
 elastic IP with the master instance so I can have a more static url for a
 couple of test users (if I need to destroy it and start again, etc).

  I’m new to the system so don’t know which logs are best to check…and am
 I missing something obvious?  It there a known bug when using elastic IPs?
 I’ve googled but with no joy.

  Thanks for your help and best wishes,

  Nick
  --
  Nick Dickens
 DPhil BSc ARCS

  Bioinformatics Team Leader
 Wellcome Trust Centre for Molecular Parasitology
 B6-21 SGDB
 120 University Place
 Glasgow
 G12 8TA

  Tel: +44 141 330 8282

  http://fb.me/WTCMPbix
 @WTCMPbix
 http://www.gla.ac.uk/researchinstitutes/iii/staff/nickdickens/

 http://www.gla.ac.uk/researchinstitutes/iii/staff/jeremymottram/comparativegenomicsofleishmania/


 - Upozorenje -

 Automatskom detekcijom utvrdjeno je da tekst ove poruke
 podsjeca na tzv. phishing poruku.

  AKO SE U PORUCI TRAZI DA POSALJETE VASU IRB LOZINKU ILI
 DA UNESETE IRB PODATKE NA NAVEDENOM LINKU, RADI SE O
 NAPADU S CILJEM KRADJE I ZLOUPOTREBE PODATAKA.

  Centar za informatiku i racunarstvo,
 Institut Rudjer Boskovic

 - Upozorenje -

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   https://lists.galaxyproject.org/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Custom Tool Dev Questions

2015-06-19 Thread Ryan G
Ok, Great.  I think I have part of this figured out.  I have a conditional
input as specified:

conditional name=input_type
  param name=input_type_selector type=select label=Select input
type help=Select between single and paired fastq data
option value=singleSingle-End/option
option value=pairedPaired-End/option
option value=paired_collectionPaired Collection/option
  /param
  when value=single
param name=fastq_input1 type=data format=fastq label=Select
fastq dataset multiple=true help=Specify dataset with single end reads
/
  /when
  when value=paired
param name=fastq_input1 type=data format=fastq label=Select
fastq dataset help=Specify dataset with paired-end reads /
param name=fastq_input2 type=data format=fastq label=Select
fastq dataset help=Specify dataset with paired-end reads /
  /when
  when value=paired_collection
param name=fastq_collection type=data_collection
collection_type=list:paired label=Select a paired collection help=See
help section for an explanation of dataset collections/
  /when
/conditional


In my command section, I want to test how this is passed to the tool, so
I have:

  command

#if str( $input_type.input_type_selector ) == single:

  echo Input Type: Single
  echo Input File: $input_type.fastq_input1

#elif str( $input_type.input_type_selector ) == paired:

  echo Input Type: Paired
  echo FastQ1: $input_type.fastq_input1\nFastQ2:
$input_type.fastq_input2

#elif str( $input_type.input_type_selector ) == paired_collection:

  print Input Type: Collection\n
  #for $element in $input_type.$fastq_collection:

echo FastQ1: $element.fastq_input1

  #end for

#end if

  /command

The single and paired works as expected, however I can select multiple
input for paired which I don't want to allow.  A user should only be able
to select a single pair of fastq files.  To select multiple pairs, I have
paired_collection.

The second problem is in the command section, I'm trying to iterated
through the pair of inputs, however my echo statement is causing a Cheetah
error.  I tried different incarnations of it as well.  My latest (and what
I thought correct) is shown above.  It generates the error:

Traceback (most recent call last):
  File 
/apps/sys/galaxy/galaxy-v15.05-staging/lib/galaxy/jobs/runners/__init__.py,
line 163, in prepare_job
job_wrapper.prepare()
  File /apps/sys/galaxy/galaxy-v15.05-staging/lib/galaxy/jobs/__init__.py,
line 858, in prepare
self.command_line, self.extra_filenames = tool_evaluator.build()
  File /apps/sys/galaxy/galaxy-v15.05-staging/lib/galaxy/tools/evaluation.py,
line 411, in build
self.__build_command_line( )
  File /apps/sys/galaxy/galaxy-v15.05-staging/lib/galaxy/tools/evaluation.py,
line 427, in __build_command_line
command_line = fill_template( command, context=param_dict )
  File /apps/sys/galaxy/galaxy-v15.05-staging/lib/galaxy/util/template.py,
line 9, in fill_template
return str( Template( source=template_text, searchList=[context] ) )
  File 
/apps/sys/galaxy/galaxy-v15.05-staging/eggs/Cheetah-2.2.2-py2.7-linux-x86_64-ucs2.egg/Cheetah/Template.py,
line 1004, in __str__
return getattr(self, mainMethName)()
  File cheetah_DynamicallyCompiledCheetahTemplate_1434732603_97_56713.py,
line 107, in respond
AttributeError: 'dict' object has no attribute 'VFFSL'

How do I iterate through the fastq inputs when they are passed as a list:paired?



On Fri, Jun 19, 2015 at 11:31 AM, John Chilton jmchil...@gmail.com wrote:

 The option in the GUI for selecting multiple datasets for a single
 data parameter - causes multiple jobs to be created as you noted. If
 you just place multiple=true on your data input parameter  - that
 button will go away and all the datasets will be used for a single
 job.

 If your tool consumes pairs of datasets - you will want to expose an
 option where you are using a parameter like:

 input name=input_pairs type=data_collection
 collection_type=list:paired format=fastq /

 and then your users will have to use the data collection creator to
 pair off the datasets into a big list before using your tool - but the
 creator is actually really nice - I think it makes it very easy to
 build large lists of pairs.

 Hope this helps - happy to provide more details,
 -John

 On Fri, Jun 19, 2015 at 11:24 AM, Ryan G ngsbioinformat...@gmail.com
 wrote:
  Hi all - I have a tool which is essentially an entire NGS pipeline
 bundled
  into a perl script and associated modules.  I'd like to offer this tool
 to
  users via Galaxy.
 
  I'm trying to pass a bunch of fastq files to the tool, but noticed that
  operating on multiple datasets creates a separate job for each dataset.
 I
  want to pass all the datasets to the tool so only one galaxy job is
 actually
  created.  Is this possible, and if so, how do I go about doing this?
 
  Ryan