[galaxy-dev] Bug with workflow

2014-07-03 Thread julie dubois
Hi all,
I've a bug with the use of workflow. I've created my own workflow and
since the latest update the workflow can be launched correctly but not
completely!!!

The first steps works with outputs on history, a job runs in
job_working_directory for each of these steps. But for my final step :
no job begin in job_working_directory so no data in history because
the jobs don't exist.

In my workflow the outputs are selected to be not hidden and the final
step is connected with other steps.

Any other one have this problem ?

Thanks.

Julie
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] ImportError: No module named galaxy_utils.sequence.fastq

2014-07-03 Thread John Chilton
Hello Julie,

  This is odd - sorry.

  I might be worth uninstalling and re-installing this in the tool
shed admin options - in particular ensure you install tool
dependencies and that this installation process succeeds.

  Otherwise -  it might be worth trying to put some debug statements
in the fastq groomer tool to try to track down the problem. Here
(https://gist.github.com/jmchilton/2d61f7ab81d99a0f02fc) is a diff to
have that tool write out the run-time value  PYTHONPATH to Galaxy's
home directory - it might be worth running a job, finding the output,
setting PYTHONPATH to that value on the compute node and attempting to
just import it directly.

cd $HOME; python -c import galaxy.sequences.fastq

  Out of curiosity - what job runner are you using (local, DRMAA, etc...)?

-John

On Tue, Jul 1, 2014 at 4:50 AM, julie dubois dubju...@gmail.com wrote:
 Hi,
 I've this error message when I run groomer.

 My local instance of galaxy runs on a python virtualenv (python 2.7)
 and I have make the latest update with : hg pull -u.

 I've verified and this library is present in my
 galaxy-dist/lib/galaxy_utils/sequence directory.

 I've found no other idea than the update on this forum to run groomer
 correctly, any other idea ?

 Thanks.
 Julie
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] repeat tag enhancement?

2014-07-03 Thread John Chilton
Hello Jens,

  A high priority for the team is to replace the tool form with one
that is more javascript driven - right now it repeatedly POSTs to the
web server and has a lot of complicated server state handling. Moving
this client side and doing these things more dynamically should enable
enhancements like the ones outlined in this card
(https://trello.com/c/RllVlsfg). These issues came up at the
conference and they will hopefully be addressed soon-ish.

  This addresses the repeat tag limitations specifically - but
workflows with variable numbers of input parameters isn't a problem
that is just solved with UI enhancements. To broadly tackle the
problem the team has been working on datasets collections and I just
presented the work at the community conference
bit.ly/gcc2014workflows. A lot of this is in -stable, but -central
contains some important UI work and bug fixes that will be released
with the next version of Galaxy.

  There are certainly times when it makes sense to place input dataset
parameters inside of repeat blocks - but I think more tools are
switching over to us multiple=true on a single data parameter input.
It allows for more rapid selection of many inputs and work more
seamlessly with dataset collections.

Hope this helps,
-John

On Wed, Jul 2, 2014 at 2:03 AM, Keilwagen, Jens
jens.keilwa...@jki.bund.de wrote:
 Hi guys,

 this threads has been inactive for quite a long while. However, I'd like to 
 restart the discussion.

 There are two alternatives for improving the usability of the repeat tag.
 1) If the repeat tag arguments could be populated using white pages and a 
 filter similar to the possibility for workflow, the user would be able to add 
 several inputs at once.
 2) If workflows would allow for using multiple (but not fixed number of) 
 inputs obtained via white pages as input of a repeat tag argument of a tool, 
 the user could use the functionality of multiple inputs (white pages) in 
 workflows.

 I would vote for the second proposal as it allows for building workflows with 
 variable number of inputs (e.g. read mapping and combined SNP calling for a 
 variable number of samples). Workflows with variable number of inputs have 
 been brought up from time to time at several posts or meetings. Hence, 
 solving this could be hitting 2 birds with one stone.

 However, proposal 2 has the drawback that using repeat tag arguments outside 
 of workflows is still a lot of manual work.

 This could be circumvented by tiny workflows consisting only of the tool of 
 interest (containing the repeat tag argument).

 best regards, Jens

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] Galaxy/ODoSE - Output accessory orthologous genes ERROR

2014-07-03 Thread Sérgio Miguel Marcelino dos Santos
Dear Galaxy team,

I am having a problem running Galaxy/ODoSE. In the step Extract  categorize 
orthologs, I selected the option Output accessory orthologous genes but it 
didn't work and didn't output any file. The stdout contains multiple warnings:


...

WARNING 15:18:06 select_taxa.select_genomes_by_ids:33   Could not find genome 
with BioProject ID 11 in complete genomes table
WARNING 15:18:06 select_taxa.select_genomes_by_ids:33   Could not find genome 
with BioProject ID 10 in complete genomes table
WARNING 15:18:06 select_taxa.select_genomes_by_ids:33   Could not find genome 
with BioProject ID 14 in complete genomes table
WARNING 15:18:08 __init__.get_most_recent_gene_name:186 Could not retrieve gene 
name annotation based on date; returning first gene name annotation instead
WARNING 15:18:08 __init__.get_most_recent_gene_name:186 Could not retrieve gene 
name annotation based on date; returning first gene name annotation instead
WARNING 15:18:08 __init__.get_most_recent_gene_name:186 Could not retrieve gene 
name annotation based on date; returning first gene name annotation instead

...


How can I solve this problem? Thank you in advance!

Best Regards,

Sérgio
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Job execution order mixed-up

2014-07-03 Thread John Chilton
To close this loop -  Ilya Sytchev was experiencing this problem as well
and did a bunch of tests at the recent GCC Hackathon that pretty clearly
demonstrated that this problem occurs when all of the following three
conditions hold - 1) Galaxy targets MySQL 2) MyISAM tables are used and 3)
Galaxy runs separate web and handler processes (even with 1 and 1).

Ilya's work has been documented here: https://trello.com/c/uVYR3IHc and he
kindly updated the wiki to reflect these problems. The Galaxy team already
strongly recommends Postgres over MySQL - but if you have to use MySQL
please use InnoDB tables (the new default) instead of MyISAM.

-John



On Sat, Dec 7, 2013 at 9:36 AM, John Chilton chil...@msi.umn.edu wrote:

 Hello Yves,

 duplicates the history the same amount of times as the number of
 threadpool_workers we have configured for the job handlers

 Well that shouldn't be happening. Do each of these multiple histories have
 all of the datasets for the workflow run populated as well?

 I assume then that Send results to a new history are checked when
 running the workflow. Does this problem happen when that is not checked?

 I have couple of related questions - but perhaps if you could send a
 version of your universe_wsgi.ini to galaxy-b...@bx.psu.edu
 with sensitive information redacted (database passwords, id_secret,
 admin_users) it would answer all of them.

 It would be great to rule out tool related problems (though they are
 seeming more and more unlikely as you describe the symptoms) - can you
 recreate this with a big workflow that just uses standard Galaxy tools
 bundled with the distribution? Eitherway, if you could send us a workflow
 example that demonstrates the issue along with tool xml files for any
 custom tools that would help. Again - you can send that to
 galaxy-b...@bx.psu.edu if it is sensitive.

 Also, this workflow is being run through the GUI right - I want to rule
 out this being an API related problem.

 ... as you can probably tell I am still flummoxed. Sorry I don't have
 answers and thanks for your patience :(.

 -John



 On Wed, Dec 4, 2013 at 10:09 AM, Yves Gagnon yves.gag...@dnalandmarks.ca
 wrote:

 Dear Galaxy mailing list,

 I will be taking over this issue here now.

 Here is an update on this issue and our observations.  I would like to
 know your insight on the matter if you can think of anything!

 We tried to revert back to a Galaxy instance running only one job
 handler.  With that configuration, we observed that the out of order
 execution problem still occured, but MUCH less frequently than when using
 three handlers.  On multiple workflow runs, only one job started while its
 input was not ready in only one run of the workflow.  When using three job
 handlers, it occured on all workflow runs and on multiple jobs inside each
 workflow run.

 Our poweruser also noticed that his workflow, when taking too much time
 to prepare (which is always the case I think since it's a huge workflow),
 duplicates the history the same amount of times as the number of
 threadpool_workers we have configured for the job handlers.  Now I am not
 sure both are related at all since I do not know much what is the effect of
 running a handler on multiple thread_workers.

 In any case, as of now we are staying with a single handler configuration
 since it in part fix the out of order execution problems, but since we have
 many regular users and some powerusers, that is not the ideal solution.

 Hope somebody can shed some light on this!

 Cheers!

 *Yves Gagnon*
 LIMS/ELN Developer

 Phone: +1 450 357-3370 Fax: +1 450 358-1154 E-Mail:
 yves.gag...@dnalandmarks.ca
 Postal Address: DNA LandMarks Inc., 84 Rue Richelieu,
 Saint-Jean-Sur-Richelieu, Quebec, CANADA, J3B 6X3

 * DNA LandMarks - une compagnie de BASF Plant Science / a BASF Plant
 Science company*

 Confidentiality notice: The information contained in this e-mail is
 confidential and may be the subject of legal professional privilege. It is
 intended for the authorized use of the individual or entity addressed. If
 the receiver or reader of this message is not the intended recipient, you
 are hereby notified that any disclosure, copying, distribution or use of
 the contents of this message is prohibited. If this email is received in
 error, please accept our apologies, delete all copies from your system, and
 notify us at supp...@dnalandmarks.ca.Confidentialité: L'information
 contenue dans ce courriel est confidentielle et peut être assujettie aux
 règles concernant le secret professionel. L'information contenue dans ce
 courriel est autorisée uniquement pour l'individu ou l'entité légale
 adressée. Si le récipiendaire ou le lecteur de ce message n'est pas celui
 ou celle prévue, vous êtes tenu de ne pas présenter, copier, distribuer ou
 utiliser le contenu de ce message. Si ce courriel est reçu par erreur,
 veuillez nous en excuser, veuillez détruire toutes copies de votre système
 nous informer à