[galaxy-dev] Hiding a tool via tool_conf.xml

2011-02-10 Thread Peter Cock
Hi all,

I'm aware that in an individual tool's XML wrapper you can add
hidden=true to the tool element to hide the tool from the
listing shown in Galaxy's left hand panel (but the tool is still
loaded and can be called by old workflows etc).

Can this be done via the tool_conf.xml file as well? This would
seem useful from a system configuration point of view where
it seems wrong to edit the tool wrappers themselves.

In fact, it might also be nice to add hidden=true support to
section as well as tool in tool_conf.xml - although I'm not
so sure how useful that would be.

Peter
___
galaxy-dev mailing list
galaxy-dev@lists.bx.psu.edu
http://lists.bx.psu.edu/listinfo/galaxy-dev


Re: [galaxy-dev] unable to login user to local instance

2011-02-18 Thread Peter Cock
On Fri, Feb 18, 2011 at 5:01 PM, Ryan Golhar golha...@umdnj.edu wrote:
 Ok, I don't know what happened with this...I reinstalled from scratch and ...

 Traceback (most recent call last):
  File /home/galaxy/galaxy-dist/tools/new_operations/get_flanks.py, line
 16, in ?
 ...
  File
 /home/galaxy/galaxy-dist/lib/galaxy/web/framework/helpers/__init__.py,
 line 29
    return (content if len(content) = length else content[:length].rsplit('
 ', 1)[0]+suffix)
                     ^
 SyntaxError: invalid syntax


I deduce you're using Python 2.4, since that line of code
is using the ternary operator added in Python 2.5
http://www.python.org/dev/peps/pep-0308/


Now that it has been reported I think it will be fixed to work with
Python 2.4. However, the Galaxy team have indicated they will
be dropping Python 2.4 (probably this year) so if I were you
since this is a new setup I'd install Python 2.6 on this machine
(Galaxy doesn't yet support Python 2.7).

Peter

___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Hiding a tool via tool_conf.xml

2011-03-02 Thread Peter Cock
On Thu, Feb 10, 2011 at 5:58 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 I'm aware that in an individual tool's XML wrapper you can add
 hidden=true to the tool element to hide the tool from the
 listing shown in Galaxy's left hand panel (but the tool is still
 loaded and can be called by old workflows etc).

 Can this be done via the tool_conf.xml file as well? This would
 seem useful from a system configuration point of view where
 it seems wrong to edit the tool wrappers themselves.

 In fact, it might also be nice to add hidden=true support to
 section as well as tool in tool_conf.xml - although I'm not
 so sure how useful that would be.

 Peter


I've filed an enhancement request for this:
https://bitbucket.org/galaxy/galaxy-central/issue/481/

Peter
___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Shared a value for multiple set-at-runtime workflow parameters

2011-03-02 Thread Peter Cock
Hi all,

I have a multi-step workflow where I've marked some parameters to be
set at run time (in this case, a minimum read length, used in two
different workflow steps). I want to use the same parameter value in
multiple tools, rather than give the user one prompt for each tool. Is
this possible?

Related to this, it would be nice to specify a default in the workflow
(overriding any default in the tool wrapper), and give some caption
text to the workflow level parameter. I guess some of this
functionality would be related to any enhancement to make a workflow
act like a tool (e.g. to combine subworkflows into larger workflows).

Thanks,

Peter
___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Shared a value for multiple set-at-runtime workflow parameters

2011-03-02 Thread Peter Cock
On Wed, Mar 2, 2011 at 3:42 PM, Peter Cock p.j.a.c...@googlemail.com wrote:

 P.S. I do like the color highlighting and live substitution of
 the workflow parameters into each tool step that uses them.
 Very slick!


That was the good news. Now the bad news - two bug reports.

Firstly if you clear the workflow parameter (leaving it empty), then
the caption is used for the highlighted placeholders BUT only the
first word is used, e.g.

${Minimum read length (after any clipping)}

Initially the the placeholders show Minimum read length (after any
clipping), and if you enter a value like 30 then 30 is shown. If
you then edit the field to become empty, the placeholder now
shows just Minimum.

Furthermore, if I leave my workflow parameters empty, and
press execute, I get a traceback ending with:

AttributeError: 'WorkflowStep' object has no attribute 'module'

Since an empty value is valid for some tools for some parameters
(e.g. optional text fields) I would have expected any error later -
from the tools themselves if they don't like the empty value.

Peter
___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] QUAL file format(s)

2011-03-03 Thread Peter Cock
Hi all,

Is there any documentation on what Galaxy means by the different qual formats?

In particular why is there a qual454 and not a qualsanger (to match fastq)?

Historically QUAL files were used back in Sanger/capillary sequencing
by PHRED, and just hold PHRED scores as integers. This was followed
by the Roche 454 output.

Peter
___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Fwd: [Bosc] Bioinformatics Open Source Conference (BOSC 2011)--Call for Abstracts

2011-03-03 Thread Peter Cock
Hopefully this will be of interest to some of you...

-- Forwarded message --
From: Nomi Harris nlhar...@lbl.gov
Date: Thu, Mar 3, 2011 at 7:37 PM
Subject: [Bosc] Bioinformatics Open Source Conference (BOSC
2011)--Call for Abstracts
To: bosc-annou...@lists.open-bio.org, memb...@open-bio.org, GMOD
Announcements List gmod-annou...@lists.sourceforge.net, GMOD
Developers List gmod-de...@lists.sourceforge.net
Cc: Nomi Harris nlhar...@lbl.gov


We invite you to submit an abstract to BOSC 2011!  Please forward this
message as appropriate, and forgive multiple postings.

Call for Abstracts for the 12th Annual Bioinformatics Open Source
Conference (BOSC 2011)
An ISMB 2011 Special Interest Group (SIG)

Dates: July 15-16, 2011
Location: Vienna, Austria
Web site: http://www.open-bio.org/wiki/BOSC_2011
Email: b...@open-bio.org
BOSC announcements mailing list:
http://lists.open-bio.org/mailman/listinfo/bosc-announce

Important Dates:
April 18, 2011: Deadline for submitting abstracts to BOSC 2011
May 9, 2011: Notifications of accepted abstracts emailed to
corresponding authors
July 13-14, 2011: Codefest 2011 programming session (see
http://www.open-bio.org/wiki/Codefest_2011 for details)
July 15-16, 2011: BOSC 2011
July 17-19, 2011: ISMB 2011

The Bioinformatics Open Source Conference (BOSC) is sponsored by the
Open Bioinformatics Foundation (O|B|F), a non-profit group dedicated
to promoting the practice and philosophy of Open Source software
development within the biological research community. To be considered
for acceptance, software systems representing the central topic in a
presentation submitted to BOSC must be licensed with a recognized Open
Source License, and be freely available for download in source code
form.

We invite you to submit abstracts for talks and posters.  Sessions include:
- Approaches to parallel processing
- Cloud-based approaches to improving software and data accessibility
- The Semantic Web in open source bioinformatics
- Data visualization
- Tools for next-generation sequencing
- Other Open Source software

In addition to the above sessions, there will be a panel discussion
about Meeting the challenges of inter-institutional collaboration.
We are also working to arrange a joint session with one of the other
ISMB SIGs.

Thanks to generous sponsorship from Eagle Genomics and an anonymous
donor, we are pleased to announce a competition for three Student
Travel Awards for BOSC 2011. Each winner will be awarded $250 to
defray the costs of travel to BOSC 2011.

For instructions on submitting your abstract, please visit
http://www.open-bio.org/wiki/BOSC_2011#Abstract_Submission_Information

BOSC 2011 Organizing Committee:
Nomi Harris and Peter Rice (co-chairs); Brad Chapman, Peter Cock,
Erwin Frise, Darin London, Ron Taylor


___
BOSC mailing list
b...@lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/bosc

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] missing setup.sh in galaxy source code

2011-03-06 Thread Peter Cock
On Sun, Mar 6, 2011 at 6:36 AM, Zhi-Wei Lu z...@ucdavis.edu wrote:
 Hi Galaxy developers,

 I have download the galaxy code via mercurial

 hg clone https://bitbucket.org/galaxy/galaxy-central

 The downloaded code, however, does not contain the setup.sh file.

 Could you let me know how do I run setup now?

 Thank you.

There is no setup.sh file any more, see for example:
http://lists.bx.psu.edu/pipermail/galaxy-dev/2011-February/004346.html

If there is some out of date documentation that needs fixing, please report it

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] possible bug - user log in / browser cache

2011-03-07 Thread Peter Cock
On Mon, Mar 7, 2011 at 5:56 PM, Nate Coraor n...@bx.psu.edu wrote:
 Ryan Golhar wrote:
 I have a local instance running.  When I'm logged in as user1, log
 out, then try to log in as user2, I keep getting the You have been
 logged out message.

 Its as if the browser's cache is retaining some information.  To get
 around this, after I log out as user1, I have to click on Analyze
 Data, then click on Log in to log in as user2.

 I'm running Safari on Mac, but have noticed this with Firefox as well.

 Hi Ryan,

 The login page will redirect you back to whatever page you to whatever
 page you were on before clicking log in.  If this page is the logout
 page, the behavior you reported will be what happens.  If you log in
 from any other page, this should go away.

Bug confirmed on Firefox on Mac, and Chrome on Windows using
http://test.g2.bx.psu.edu/

I guess you just need to say if the previous page was the logout
page, go to the main page instead, rather than a blind redirect.

Issue filed:
https://bitbucket.org/galaxy/galaxy-central/issue/486

I expect many people to be caught out by this (back when I was
debugging a proxy problem I was probably hitting this bug too).

Regards,

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Upload files from filesystem paths - backwards checkboxes

2011-03-11 Thread Peter Cock
On Thu, Mar 10, 2011 at 5:53 PM, Glen Beane glen.be...@jax.org wrote:

 On Mar 10, 2011, at 12:24 PM, Peter Cock wrote:

 Hi all,

 Via the Galaxy admin interface, I'm currently importing some BAM
 files into a library from the local file system. Since they are big, I
 don't want Galaxy to copy them.

 So I looked at the Copy data into Galaxy? option and made
 sure it wasn't ticked. Galaxy did copy the file :(

 I looked at the interface again, and see you have:

 Copy data into Galaxy?
 [check box] No
 Normally data uploaded with this tool is copied into Galaxy's files
 directory so any later changes to the data will not affect Galaxy.
 However, this may not be desired (especially for large NGS
 datasets), so use of this option will force Galaxy to always
 read the data from its original path.

 You've got a tick box labelled No so it does the opposite to the
 question above it. That is a HORRIBLE interface. I can't be the
 first person to be tricked by it, can I?

 I had a co-worker mess this up too.  I thought this was a poor design
 decision.  I thought maybe yes/no radio options would work.
  Copy data into galaxy? yes(*) no( )

 I would urge you to remove the work No and flip the meaning
 (and default) to match. But that could confuse existing admin
 users used to clicking from previous habit.

 How about making it a select parameter instead, with options
 Copy files (default) and Link to files which is very explicit?

 I think this would be better than the current no check box,  and
 more explicit than my yes(*) no () radio buttons


You may be right, but switching from a checkbox to a select
isn't as easy as it could be... at least not with my familiarity
with the code base.

 Likewise for the Preserve directory structure? setting, I'd
 remove the No caption and flip the setting, or just replace
 the backwards tick box with a select parameter offering
 Preserve directory structure and Place in selected folder
 (or something like that).

 Pretty please? How about if I write and test the patch?


Patch for flipping the  Preserve directory structure? setting,
which I hope you can test then merge/transplant to the trunk:

https://bitbucket.org/peterjc/galaxy-central/changeset/669dcc2803ba

I'm still working on the Copy data into Galaxy? setting...

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Upload files from filesystem paths - backwards checkboxes

2011-03-11 Thread Peter Cock
On Fri, Mar 11, 2011 at 3:04 PM, Greg Von Kuster g...@bx.psu.edu wrote:
 Hello Peter,

 Thanks for the patch - I've applied a corrected version of it in change
 set 5215:74c4dd43485a.  Let  me know if you'd rather have me flip
 the Copy data into Galaxy? feature.  I'd be happy to, but I don't
 want to reinvent the wheel if you're close.

 Thanks!

 Greg Von Kuster

Using util.string_as_bool looks good - I didn't know about that
and it looks far more rhobust.

Would you might doing Copy data into Galaxy? as well? I got
most of the way but got dragged off to a meeting in the middle.

Thanks!

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] History information popup broken on library entries?

2011-03-18 Thread Peter Cock
Hi all,

I've been exploring how to use Galaxy's shared library functionality,
and have noticed that once imported into a working history, that the
little 'i' information icon's black popup window isn't working for these
datasets. All it says is 'Could not find the job for this dataset.'

This appears to affect multiple file types (FASTA and BAM tested)
and both files copied into Galaxy or linked to at import.

I'm using the current latest code from galaxy-central,
https://bitbucket.org/galaxy/galaxy-central/changeset/e39495db6071

This also seems to affect http://test.g2.bx.psu.edu/ and also
http://main.g2.bx.psu.edu/

Can anyone confirm this behaviour? I'm using Firefox on the Mac,
but I doubt that is relevant.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Metadata recursion error with empty tabular file (with patch)

2011-03-18 Thread Peter Cock
On Fri, Mar 18, 2011 at 6:12 PM, Kanwei Li kan...@gmail.com wrote:
 Thanks for reporting, committed fix


https://bitbucket.org/galaxy/galaxy-central/changeset/3070455afbdd

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Alternative bowtie tools

2011-03-29 Thread Peter Cock
On Tue, Mar 29, 2011 at 1:31 AM, Assaf Gordon gor...@cshl.edu wrote:
 Hello all,

 We're developing alternative bowtie tools that more closely suit our
 needs, are we're happy to share (and get comments).

 The main differences are:
 1. separate tools for paired-end and single-end

Sounds sensible to me.

 2. the tools accepts FASTA, FASTQ in both Sanger and Illumina
 format (no more need for grooming). Illumina is the default for
 newly uploaded FASTQ files.

I think that's a bad idea - use Sanger FASTQ as the default to be
consistent with the rest of Galaxy, and also with CASAVA 1.8
Illumina machines will produce that too, see:
http://seqanswers.com/forums/showthread.php?t=8895

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Alternative bowtie tools

2011-03-29 Thread Peter Cock
On Tue, Mar 29, 2011 at 4:46 PM, Assaf Gordon gor...@cshl.edu wrote:
 Hi Dan,

 Daniel Blankenberg wrote, On 03/29/2011 10:55 AM:
 When files are added to Galaxy, the datatype can be directly set to
 any of the fastq variants (e.g. fastqillumina), which removes the
 requirement of grooming (but should only be done when users know what
 they are doing).

 I'm not using the get data tool, we have our own import tools (uploading
 huge files with HTTP is not stable enough for me). You're right that I should
 change the format of this tool from 'fastq' to 'fastqillumina' (but the tool
 pre-dated all those built-in formats in galaxy, so I never bothered to update
 it...).

Why not do the Illumina to Sanger conversion as part of your pipeline
that gets the data into Galaxy (and mark the files as fastqsanger)?
As Glen said, with a C tool that isn't really so slow. That future proofs
you for the pending Illumina CASAVA 1.8 release, and means you don't
need to maintain divergent Bowtie wrappers for Galaxy.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Updating BLAST+ wrapper

2011-03-31 Thread Peter Cock
Hi all,

The NCBI have just released BLAST 2.2.25+ which includes some
interesting new stuff of interest to the tabular output, i.e.

Added support for query and subject length to tabular output

I would therefore like to update my BLAST+ wrappers in Galaxy to add
these two columns to the 'extended tabular' output option. They are going
to be very helpful as you can now calculate things like the percentage
identity (or similarity) compared to the query or sequence.

If there are no objections I propose to add these as columns 23 and 24
(i.e. at the end), so minimise disruption to anyone already using the
tabular output. I would hope to have a branch ready for merging next
week...

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Default value for data_column not working?

2011-04-01 Thread Peter Cock
On Fri, Apr 1, 2011 at 3:55 PM, Kanwei Li kan...@gmail.com wrote:
 Hi Peter,

 I don't see your attempt to use default in the example given. A bit more 
 detail?

 -K

Hi Kanwei,

My example tool is now here:
https://bitbucket.org/peterjc/galaxy-central/changeset/198bf927ca30

As per the original email, I have three parameters which are column
numbers, and I want the defaults to be columns 1, 2 and 12. However,
on my Galaxy installation they all default to the first column.

I have tried giving the default values as 1, 2, and 12, and also as
c1, c2 and c12 - neither works:
https://bitbucket.org/peterjc/galaxy-central/changeset/374143be8e45

If you need more information, let me know.

Thanks,

Peter

P.S. Bug filed here:
https://bitbucket.org/galaxy/galaxy-central/issue/507/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating BLAST+ wrapper

2011-04-04 Thread Peter Cock
On Mon, Apr 4, 2011 at 12:19 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 On Fri, Apr 1, 2011 at 12:01 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 On Thu, Mar 31, 2011 at 6:28 PM, Peter Cock p.j.a.c...@googlemail.com 
 wrote:
 Hi all,

 The NCBI have just released BLAST 2.2.25+ which includes some
 interesting new stuff of interest to the tabular output, ...

 One reason why I want to move to BLAST 2.2.25+ on our local
 machine is they fixed the subject IDs in tabular output, ...

 Hi Kanwei,

 Could you transplant or merge this commit please?
 https://bitbucket.org/peterjc/galaxy-central/changeset/ab40f95393ec

 It updates the test output files to work with BLAST 2.2.25+, which I
 will be using locally due to several important bug fixes. As you don't
 offer BLAST+ on the Penn State server I don't expect this to cause
 you any problems.

 Also (and unrelated to this change), do you see this from the test suite?

 $ ./run_functional_tests.sh -id ncbi_blastp_wrapper
 ...
 ==
 ERROR: NCBI BLAST+ blastp ( ncbi_blastp_wrapper )  Test-1
 --
 Traceback (most recent call last):
  File 
 /home/pjcock/repositories/galaxy-central/test/functional/test_toolbox.py,
 line 155, in test_tool
    self.do_it( td )
  File 
 /home/pjcock/repositories/galaxy-central/test/functional/test_toolbox.py,
 line 71, in do_it
    page_inputs =
 self.__expand_grouping(testdef.tool.inputs_by_page[0], all_inputs)
  File 
 /home/pjcock/repositories/galaxy-central/test/functional/test_toolbox.py,
 line 122, in __expand_grouping
    elif isinstance(declared_inputs[value.name], str):
 KeyError: 'blast_type'

Solved, it seems I can't omit any parameters in the test and expect
the defaults to be used. This doesn't seem desirable (new bug?), but the
quick fix is to add a few lines to the test. Could you commit this too:

https://bitbucket.org/peterjc/galaxy-central/changeset/124e4556f3c5

Thanks,

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating BLAST+ wrapper

2011-04-04 Thread Peter Cock
On Mon, Apr 4, 2011 at 4:03 PM, Kanwei Li kan...@gmail.com wrote:
 I tried both ways, no dice (you can try yourself with a clean
 galaxy-central base)

I'm a bit puzzled and not an hg expert. Was there an error message?

And for plan (B), what went wrong with using patch and the raw
changes? Surely that should work?
https://bitbucket.org/peterjc/galaxy-central/changeset/ab40f95393ec/raw/galaxy-central-ab40f95393ec.diff
https://bitbucket.org/peterjc/galaxy-central/changeset/124e4556f3c5/raw/galaxy-central-124e4556f3c5.diff

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating BLAST+ wrapper

2011-04-04 Thread Peter Cock
On Mon, Apr 4, 2011 at 4:36 PM, Kanwei Li kan...@gmail.com wrote:
 dhcp243253:galaxy-central kanwei$ hg transplant -s
 ~/peter/galaxy-central/ -b blast25
 searching for changes
 changeset:   5585:ab40f95393ec
 branch:      blast25
 parent:      5583:086c9c2c52b9
 user:        peterjc p.j.a.c...@googlemail.com
 date:        Mon Apr 04 12:12:11 2011 +0100
 summary:     Update tests for NCBI BLAST 2.2.25+ which fixed subject
 ID in tabular output

 apply changeset? [ynmpcq?]: y
 changeset:   5586:124e4556f3c5
 branch:      blast25
 user:        peterjc p.j.a.c...@googlemail.com
 date:        Mon Apr 04 14:44:52 2011 +0100
 summary:     Include optional parameters in BLASTP test to avoid KeyError

 apply changeset? [ynmpcq?]: y
 searching for changes
 adding changesets
 adding manifests
 adding file changes
 added 2 changesets with 4 changes to 4 files
 dhcp243253:galaxy-central kanwei$ hg tip
 changeset:   5338:124e4556f3c5
 branch:      blast25
 tag:         tip
 user:        peterjc p.j.a.c...@googlemail.com
 date:        Mon Apr 04 14:44:52 2011 +0100
 summary:     Include optional parameters in BLASTP test to avoid KeyError

The above is strange - why have you got a blast25 branch?
Maybe the transplant isn't working as expected...

 dhcp243253:galaxy-central kanwei$ hg push
 pushing to https://kan...@bitbucket.org/galaxy/galaxy-central/
 searching for changes
 abort: push creates new remote branches: blast25!
 (use 'hg push --new-branch' to create new remote branches)
 dhcp243253:galaxy-central kanwei$ hg merge blast25
 abort: merging with a working directory ancestor has no effect

It looks like you are on the blast25 branch, try checking out
default then merging the new blast25 branch. It should be
a fast-forward merge (just the two commits). Then you
can push default back to galaxy-central on bitbucket. I think.


 Importing the diffs gives merge error


Are you using a hg command here?

What I meant for the Plan B was just using the Unix command
patch to apply my changes by hand to the default branch.
Then commit it under your name. It won't be associated with my
bitbucket account, but it should just work.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Ignoring white space differences in test output

2011-04-05 Thread Peter Cock
On Tue, Apr 5, 2011 at 10:58 AM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 I'd like to write a tool test where the sample output and the
 generated output are the same except for white space.
 i.e. running diff with the -w switch reports no differences.

 Is this possible at the moment with the Galaxy test framework?
 I'm hoping for a test output attribute. I don't see anything
 like this mentioned here:
 https://bitbucket.org/galaxy/galaxy-central/wiki/WritingTests

 Thanks,

 Peter

I've filed an enhancement request for this,
https://bitbucket.org/galaxy/galaxy-central/issue/510

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating BLAST+ wrapper

2011-04-05 Thread Peter Cock
On Mon, Apr 4, 2011 at 6:07 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 On Mon, Apr 4, 2011 at 5:55 PM, Kanwei Li kan...@gmail.com wrote:
 Done the manual way


 Thanks.

 I was hoping to have finished this today, but it turns out there are
 some subtle but annoying changes to how BLAST records the hit
 id/name/description in their XML file which make the script to
 convert from XML to tabular unhappy.


Hi Kanwei,

I think I'm nearly done now, but I am still working on the blast25
branch you had trouble merging/transplanting from. I guess I will
need to make a new clean branch from the galaxy tip for submission.

Do you prefer several small atomic commits (which I understand
is the recommend practice in git), or a few large commits (which
seems quite common on the Galaxy hg history)?

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Ratings on Tool Shed broken?

2011-04-05 Thread Peter Cock
Hi all,

I had wondered why none of the tools on the Galaxy Tool Shed had a
rating, and had assumed it was just that so far no-one had bothered. I
have a new theory, it doesn't work?

I just viewed Brad's BAM to BigWig tool, clicked on Tool Actions
then Rate Tool, and got:

Server Error
An error occurred. See the error logs for more information. (Turn
debug on to display exception reports here)

Can anyone reproduce this? I've tried a couple of other tools and the
same thing happens.

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Relative file path in Galaxy

2011-04-06 Thread Peter Cock
On Wed, Apr 6, 2011 at 12:44 AM, Zhe Chen z...@lanl.gov wrote:
 Hi,

 My tool has a script reads a file in the same directory as the script.
 When I try to use relative path to read the file, it works by directly run
 it, but does not work when calling from Galaxy. Absolute path will work,
 but I want to know is there a way to do it using relative path.

 Can you give me some suggestion?



 myTool.pl use the following code the read the file in the same directory

 my $genusfiles = genus.txt;
 open (GENUSGP, $genusfiles)  or die $genusfiles: $! \n;
 close(GENUSGP);


 Thanks

I would expect that you can look at the first entry in argv which
should give you the path to the script being run, myTool.pl, then
use that to construct the path to genus.txt

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] another update problem

2011-04-11 Thread Peter Cock
On Mon, Apr 11, 2011 at 7:26 PM, Ryan Golhar golha...@umdnj.edu wrote:
 I just updated my local galaxy instance using hg pull, and the history items
 are no longer expandable.  What happened?

Which browser(s) are affected? I recall reporting some issues
with Internet Explorer only and the history peep view not working.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Getting tool dirpath in a Python code file

2011-04-14 Thread Peter Cock
On Thu, Apr 14, 2011 at 2:08 PM, Leandro Hermida
soft...@leandrohermida.com wrote:
 Hi,

 I have a tool with a code file=my_script.py/ tag and in that code file
 I'm trying to get the tool dirpath where that script and the tool XML
 exist.  I've tried:

 os.path.abspath(os.path.dirname(sys.argv[0]))
 os.path.abspath(os.path.dirname(__file__))

 And both don't work as expected. Is there a galaxy class I could import
 which will have the tool directory path?

 regards,
 Leandro

For standard Python tools in Galaxy, I'm using os.path.split(sys.argv[0])[0]
to get the path, which on reflection probably should be written as
os.path.dirname(sys.argv[0]) as you suggest.

What do __file__ and sys.argv[0] give you? The simplest way to debug
this is to add a print statement, since Galaxy will show the stdout.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] NLStradamus wrapper (for nuclear localization signals)

2011-04-14 Thread Peter Cock
Hi all,

I just wanted to mention (to avoid duplicate work) that I have a working
Galaxy wrapper for NLStradamus v1.6, a tool for prediction of nuclear
localization signals (NLSs) from a protein FASTA file. The wrapper
does a little reformatting of the output to get a clean tabular file for use
in Galaxy.

I would release this to the Galaxy Tool Shed now, but the tool's author
Alex Nguyen Ba tells me he hopes to have an update released fairly
soon which should include a native tabular output - probably with a
richer column structure. So rather than having to change the tool
output and potentially break workflows, I plan to wait till then.

However, if anyone wants the wrapper before then, drop me an email.

Regards,

Peter

References:

A. N. Nguyen Ba, A. Pogoutse, N. Provart, A. M. Moses. NLStradamus:
a simple Hidden Markov Model for nuclear localization signal prediction.
BMC Bioinformatics. 2009 Jun 29;10(1):202.

http://www.moseslab.csb.utoronto.ca/NLStradamus
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Getting tool dirpath in a Python code file

2011-04-15 Thread Peter Cock
On Thu, Apr 14, 2011 at 2:56 PM, Leandro Hermida
soft...@leandrohermida.com wrote:
 On Thu, Apr 14, 2011 at 3:17 PM, Peter Cock p.j.a.c...@googlemail.com
 wrote:

 For standard Python tools in Galaxy, I'm using
 os.path.split(sys.argv[0])[0]
 to get the path, which on reflection probably should be written as
 os.path.dirname(sys.argv[0]) as you suggest.

 What do __file__ and sys.argv[0] give you? The simplest way to debug
 this is to add a print statement, since Galaxy will show the stdout.


 Hi Peter,

 __file__ throws an error: global name '__file__' is not defined

I guess the script is being loaded as a string, and run with eval(...)
or something like that. It would also explain why sys.argv[0] would
be one of the Galaxy script files.

 os.path.abspath(os.path.dirname(sys.argv[0])) gives me
 /path/to/galaxy/scripts directory which is two levels up from what the tool
 directory I want for example /path/to/galaxy/tools/mytool

So combine that with ../tools/mytool/ and you're done? OK, you have
to know the name of the folder your tool *should* be in... so not a
perfect solution.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Setting min/max limit on repeat tag in tool XML file

2011-04-19 Thread Peter Cock
Hi all,

I'm working on a tool where the repeat tag looks like an elegant
solution - however I want to limit the number of repeats to be 1, 2 or 3.
i.e. I want to set min=1 and max=3 attributes for the repeat,
based on analogy to the param tag.

Is this possible, but not documented here?
https://bitbucket.org/galaxy/galaxy-central/wiki/ToolConfigSyntax

In terms of the GUI, I'd like the page to preload with as many
entries as the min setting (zero by default), and disable the
add button once the max setting is reach (default no limit).

(Alternatively you could validate the max/min when the form is
submitted by clicking the execute button, but that seems less
user friendly.)

If this isn't already possible, would it be generally useful to add this?

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Setting min/max limit on repeat tag in tool XML file

2011-04-19 Thread Peter Cock
On Tue, Apr 19, 2011 at 1:46 PM, Daniel Blankenberg d...@bx.psu.edu wrote:
 Hi Peter,

 Yes, setting min and max values in the repeat tag should work as you need.
 As you point out, it is probably not well documented, so please let us know
 if it isn't working as you imagine it should.

 Thanks for using Galaxy,

 Dan

Hi Dan,

Excellent - that does pretty much what I was expecting. Thanks for updating
the wiki page.

It would be a nice refinement to hide the Remove buttons when at the
minimum, and hide the Add button when at the maximum - but not essential.

What I need to work out now is how to use repeat parameters in the unit
tests. There seems to be examples in tools/mutation/visualize.xml and
tools/plotting/xy_plot.xml which only use the first repeat...

Also, can I access the repeat number for default values - e.g. I'd
like to have a caption parameter defaulting to Set 1, Set 2 etc.

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] BLAST+ HTML output broken

2011-04-19 Thread Peter Cock
Hi Kanwei,

I noticed that the BLAST+ output options for HTML are not working,
a side effect of my recent refactoring. Could you transplant/merge this
fix and test please:

https://bitbucket.org/peterjc/galaxy-central/changeset/e1e6f92e07bf
https://bitbucket.org/peterjc/galaxy-central/changeset/c60551f7d13c

These are the only commits on this new branch from the default:

https://bitbucket.org/peterjc/galaxy-central/src/blast_2011_04_19

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Plotting tools - best output format? PDF, PNG, ...

2011-04-20 Thread Peter Cock
Hi all,

I'm working on a simple plotting tool, and currently I am using PDF
output since this is vector based (so it can be rescaled without loss
of quality) and can be edited easily enough for use in publications.

However, at least on my OS/Browser, Galaxy does not show the
PDF files in situ, the eye button acts like a download link. Is this
a bug?

If I switch to making a PNG file, then clicking on the eye button
shows the image directly in the center panel (nice), but PNG is
not a great format for detailed figures or printing.

Looking over the tools which come with Galaxy under
tools/plotting/*.xml all use PDF or PNG.

How about SVG? Don't most of the mainline browsers have
reasonably good SVG support built in (possibly via a plugin)?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Plotting tools - best output format? PDF, PNG, ...

2011-04-20 Thread Peter Cock
On Wed, Apr 20, 2011 at 10:07 AM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 I'm working on a simple plotting tool, and currently I am using PDF
 output since this is vector based (so it can be rescaled without loss
 of quality) and can be edited easily enough for use in publications.

 However, at least on my OS/Browser, Galaxy does not show the
 PDF files in situ, the eye button acts like a download link. Is this
 a bug?

Looks like a Firefox issue/setting - false alarm:

On our local Galaxy, with a PDF entry in the history, and as a control
http://nar.oxfordjournals.org/content/39/7/2492.full.pdf+html
which should show a PDF embedded on the left:

Windows XP, Internet Explorer 7.05730.13
- NAR shows PDF on left, good
- Clicking on the eye, shows the PDF in the central panel, good.
- Clicking on the disk, saves the PDF, good.

Windows XP, Chrome 10.0.648.204
- NAR shows PDF on left, good
- Clicking on the eye, shows the PDF in the central panel, good.
- Clicking on the disk, saves the PDF, good.

Windows XP, Firefox 3.6.13
- NAR shows PDF on left, good
- Clicking on the eye, shows the PDF in the central panel, good.
- Clicking on the disk, saves the PDF, good.

Linux CentOS, Firefox 3.6.13
- NAR downloads PDF, shows left pane empty, usable.
- Clicking on the eye, downloads the PDF, usable.
- Clicking on the disk, saves the PDF, good.

Mac OS X 10.6, Firefox 4.0
- NAR downloads PDF, shows left pane empty, usable.
- Clicking on the eye, downloads the PDF, usable.
- Clicking on the disk, saves the PDF, good.

Mac OS X 10.6, Safari 5.0.5
- NAR shows PDF on left, good
- Clicking on the eye, shows the PDF in the central panel, good.
- Clicking on the disk, saves the PDF, good.

So it looks like Firefox can (probably dependent on the user's
settings) download PDF files rather than show them in situ.
That's fine, and not specific to Galaxy.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Plotting tools - best output format? PDF, PNG, ... SVG!

2011-04-20 Thread Peter Cock
On Wed, Apr 20, 2011 at 4:30 PM, Pieter Neerincx
pieter.neeri...@gmail.com wrote:
 Hi Peter,

 On Apr 20, 2011, at 11:07 AM, Peter Cock wrote:

 Hi all,

 ...cut...

 How about SVG? Don't most of the mainline browsers have
 reasonably good SVG support built in (possibly via a plugin)?

 Yep, SVG support is pretty good nowadays especially in FireFox
 and Opera. Safari, IE and Chrome have only partial support, but
 for simple charts without 3D filters it works fine. I've added SVG
 as a datatype to my Galaxy and this works great. You won't need
 a plugin; in fact the old Adobe SVG plugin is depreciated already
 for a few years now and no longer compatible with recent browser
 versions.

I looked at SVG about a year ago, and was pretty impressed.
I did run into some issues, particularly with links, and opted to
use PNG files in the end. Since then we've finally dropped IE6,
so hopefully SVG would be safe now.

Are your changes to add SVG as a datatype to Galaxy public?
I'd like to suggest that be merged to the trunk.

 The nice benefit of SVG in addition to not needing a plugin is
 that the image can scale with the surrounding text if you zoom
 in your browser (without loosing resolution).

 The disadvantage is that although my users know what a PDF
 file is and can process them for posters/manuscripts, most of
 them never heard of SVG and get stuck when it doesn't import
 into PowerPoint et al. :(

Do they cope with PDF?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Plotting tools - best output format? PDF, PNG, ... SVG!

2011-04-21 Thread Peter Cock
On Thu, Apr 21, 2011 at 10:50 AM, Pieter Neerincx
pieter.neeri...@gmail.com wrote:
 Hi Peter,

 On Apr 20, 2011, at 5:47 PM, Peter Cock wrote:

 On Wed, Apr 20, 2011 at 4:30 PM, Pieter Neerincx
 pieter.neeri...@gmail.com wrote:
 Hi Peter,

 On Apr 20, 2011, at 11:07 AM, Peter Cock wrote:

 Hi all,

 ...cut...

 How about SVG? Don't most of the mainline browsers have
 reasonably good SVG support built in (possibly via a plugin)?

 Yep, SVG support is pretty good nowadays especially in FireFox
 and Opera. Safari, IE and Chrome have only partial support, but
 for simple charts without 3D filters it works fine. I've added SVG
 as a datatype to my Galaxy and this works great. You won't need
 a plugin; in fact the old Adobe SVG plugin is depreciated already
 for a few years now and no longer compatible with recent browser
 versions.

 I looked at SVG about a year ago, and was pretty impressed.
 I did run into some issues, particularly with links, and opted to
 use PNG files in the end. Since then we've finally dropped IE6,
 so hopefully SVG would be safe now.

 Are your changes to add SVG as a datatype to Galaxy public?
 I'd like to suggest that be merged to the trunk.

 I simply added this line to my datatypes_conf.xml:

        datatype extension=svg type=galaxy.datatypes.images:Image 
 mimetype=image/svg+xml/

 I didn't make a sniffer as I don't have a use case for users
 uploading SVGs, so I didn't have to create a Python class and
 SVG is just another instance of galaxy.datatypes.images:Image
 from a plain vanilla Galaxy.

That makes sense if the sniffer is only used on upload.

 To make sure the client displays the SVG you may have to add
 the SVG mimetype to your web server's config too. Where this
 is defined may differ per linux distro. I use apache as a proxy
 and had to add

        image/svg+xml           svg svgz

 to /etc/mime.types.

Thanks for the details - it looks simple enough to add here too.
If/when any SVG producing tools are added to Galaxy or the
Galaxy Tool Shed, then it would be nice to have this done in
the main repository.

 The nice benefit of SVG in addition to not needing a plugin is
 that the image can scale with the surrounding text if you zoom
 in your browser (without loosing resolution).

 The disadvantage is that although my users know what a PDF
 file is and can process them for posters/manuscripts, most of
 them never heard of SVG and get stuck when it doesn't import
 into PowerPoint et al. :(

 Do they cope with PDF?

 Well, they know how to view and print one :). Importing them
 directly into PowerPoint and Word is just as problematic though,
 so that usually involves opening the PDF in a different program
 and copy/paste or exporting to another format. Still a hassle, but
 they manage...

Getting a PDF into PowerPoint or Word is easy on a Mac ;)
But yes, point taken - even though a PDF is easier to work with
than an SVG (or a postscript file), it can still be a hassle.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Galaxy tools' priority level (nice)

2011-04-21 Thread Peter Cock
Hi all,

Our Galaxy is running on a shared server which will sometimes be running
other computationally demanding jobs (outside of Galaxy). In some cases
I'd like these to have priority, perhaps by having Galaxy run the tool child
processes at a nice level of 10 (say).

Is there any built in way to control the Unix priority level (e.g.
nice or ionice)
used to run tasks? I don't see anything on here, but perhaps I'm looking
in the wrong place:
https://bitbucket.org/galaxy/galaxy-central/wiki/Config/WebApplicationScaling

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Galaxy tools' priority level (nice)

2011-04-21 Thread Peter Cock
On Thu, Apr 21, 2011 at 5:43 PM, Assaf Gordon gor...@cshl.edu wrote:
 Hi Peter,

 Peter Cock wrote, On 04/21/2011 11:00 AM:
 Is there any built in way to control the Unix priority level (e.g.
 nice or ionice) used to run tasks? I don't see anything on here,
 but perhaps I'm looking in the wrong place:
 https://bitbucket.org/galaxy/galaxy-central/wiki/Config/WebApplicationScaling


 I've been experimenting with 'nice' as well (for other servers), and it
 seems the consensus is that 'nice' is broken.

 Linus puts it in a colorful way here: https://lwn.net/Articles/418739/
 [...] Seriously. Nobody _ever_ does nice make, unless they are seriously
 repressed beta-males (eg MIS people who get shouted at when they do
 system maintenance unless they hide in dark corners and don't get
 discovered). It just doesn't happen.

I know nice isn't perfect, but in the case of the sys admin setting
up Galaxy, we don't have the human laziness problem to overcome:
We could ensure all job tasks get run with nice 10 automatically,
without the Galaxy users having to do anything special.

 One recommended solution is to use cgroups, which can control
 CPU, Disk I/O, Memory usage and network load, as explained here:
 http://broadcast.oreilly.com/2009/06/manage-your-performance-with-cgroups-and-projects.html

 With cgroups there will be no need to change anything in galaxy
 (just apply a cgroup to the galaxy user, or something similar).

Yes, but I want to treat the Galaxy webserver differently from
the compute jobs it launches.

 Unfortunately, cgroups requires a recent kernel, so it might not
 be applicable in your case.

 -gordon

Are you using cgroups on your Galaxy?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Could anyone body tell me why our C program do not work in galaxy

2011-04-22 Thread Peter Cock
2011/4/22 liyanhui607 liyanhui...@163.com:
 Dear Sir,
   We have a program writen in C lanuage. It can run nomally in linux
 environment.
    We plan to add it to galaxy, but it did not work.
    The C file after complie named shortest-path
  The shortest-path.xml is as follow:
   Could anybody tell me what is wrong ? why it does not work?
    Thank you very much!

 Best Wishes!

 Yan-Hui Li

What error message do you get?

Did you try the two changes I suggested (the same things Dan just
suggested as well): https://bitbucket.org/galaxy/galaxy-central/issue/525

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Could anyone body tell me why our C program do not work in galaxy

2011-04-25 Thread Peter Cock
2011/4/25 liyanhui607 liyanhui...@163.com:
 Thank you  very much!
 After I have added it to the path, it can work now.
 Best Wishes!
 Yan-Hui Li

Oh good.

Peter

P.S. Please keep the mailing list on replies.

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Staged Method for cluster running SGE?

2011-04-26 Thread Peter Cock
Hi all,

So far we've been running our local Galaxy instance on
a single machine, but I would like to be able to offload
(some) jobs onto our local SGE cluster. I've been reading
https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster

Unfortunately in our setup the SGE cluster head node is
a different machine to the Galaxy server, and they do not
(currently) have a shared file system. Once on the cluster,
the head node and the compute nodes do have a shared
file system.

Therefore we will need some way of copying input data
from the Galaxy server to the cluster, running the job,
and once the job is done, copying the results back to the
Galaxy server.

The Staged Method on the wiki sounds relevant, but
appears to be for TORQUE only (via pbs_python), not
any of the other back ends (via DRMAA).

Have I overlooked anything on the Cluster wiki page?

Has anyone attempted anything similar, and could you
offer any guidance or tips?

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Staged Method for cluster running SGE?

2011-04-26 Thread Peter Cock
On Tue, Apr 26, 2011 at 12:10 PM, Sean Davis sdav...@mail.nih.gov wrote:
 On Tue, Apr 26, 2011 at 5:11 AM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 So far we've been running our local Galaxy instance on
 a single machine, but I would like to be able to offload
 (some) jobs onto our local SGE cluster. I've been reading
 https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster

 Unfortunately in our setup the SGE cluster head node is
 a different machine to the Galaxy server, and they do not
 (currently) have a shared file system. Once on the cluster,
 the head node and the compute nodes do have a shared
 file system.

 Therefore we will need some way of copying input data
 from the Galaxy server to the cluster, running the job,
 and once the job is done, copying the results back to the
 Galaxy server.

 The Staged Method on the wiki sounds relevant, but
 appears to be for TORQUE only (via pbs_python), not
 any of the other back ends (via DRMAA).

 Have I overlooked anything on the Cluster wiki page?

 Has anyone attempted anything similar, and could you
 offer any guidance or tips?

 Hi, Peter.

 You might consider setting up a separate queue for SGE jobs. Then,
 you could specify a prolog and epilog script that will copy files from
 the galaxy machine into the cluster (in the prolog) and back to galaxy
 (in the epilog).

I see - the prolog and epilog scripts are per SGE queue, so
in order to have Galaxy specific scripts, the simplest solution
is a Galaxy specific queue.

 This assumes that there is a way to map from one
 file system to the other, but for Galaxy, that is probably the case
 (galaxy files on the galaxy server are under the galaxy instance and
 galaxy files on the cluster will probably be run as a single user in
 that home directory).

We have a user account galaxy on the Galaxy Server, and it would
make sense to have a matching user account galaxy on the Cluster
which would submit the SGE jobs and own their data files.

 I have not done this myself, but the advantage
 to using prolog and epilog scripts is that galaxy jobs then do not
 need any special configuration--all the work is done transparently by
 SGE.

 Sean

Thanks for the pointers - I have some reading ahead of me...

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Reciprocal Best Hits (RBH) from BLAST tabular output

2011-05-04 Thread Peter Cock
Hi all,

I mentioned just over a month ago that I had written a Galaxy tool
to do find Reciprocal Best Hits (RBH) from BLAST tabular output
or similar (e.g. Bill Pearson's FASTA tabular output).

http://lists.bx.psu.edu/pipermail/galaxy-dev/2011-March/004799.html
http://lists.bx.psu.edu/pipermail/galaxy-dev/2011-April/004825.html
http://lists.bx.psu.edu/pipermail/galaxy-dev/2011-April/004829.html

The only problem I had was setting default columns, which I
consider to be a minor bug in Galaxy:

https://bitbucket.org/galaxy/galaxy-central/issue/507

Regardless of this small usability issue, would the Galaxy team
like to add this tool to the main distribution (to sit along with my
BLAST+ wrappers), or should I submit it to the tool shed?

If you want to merge it, the following change set should suffice:
https://bitbucket.org/peterjc/galaxy-central/changeset/198bf927ca30

I made a subsequent change set to test giving column defaults
as value=c2 rather than value=c2 which makes no difference,
but if this is how default column values are meant to be given,
take this change too:
https://bitbucket.org/peterjc/galaxy-central/changeset/198bf927ca30

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] quick question: how can i supply the user's email address to a tool?

2011-05-06 Thread Peter Cock
On Fri, May 6, 2011 at 8:18 AM, Hans-Rudolf Hotz h...@fmi.ch wrote:
 Hi Ed

 You can use the variable $userEmail in the 'command' tag of the tool
 definition file

 Regards, Hans

That should probably be listed on the wiki,
https://bitbucket.org/galaxy/galaxy-central/wiki/ToolConfigSyntax

Are there any other similar variables worth knowing about?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] options from file

2011-05-06 Thread Peter Cock
On Fri, May 6, 2011 at 10:17 AM, SHAUN WEBB swe...@staffmail.ed.ac.uk wrote:

 Hi, I have created a tool that will fetch sequences for selected IDs from a
 tabular file containing multiple IDs and additional info.

 I want the tool config to scan the first column of the tab file for IDs and
 provide the user with a selection box where they can select a single ID or
 multiple IDs and get output for all selected.

 The following method does this:


 param name=tabfile type=data format=tabular label=ID File/
 param name=selection type=select multiple=true accept_default=true
 label=ID 
              options from_dataset=tabfile
                              column name=name index=0/
                              column name=value index=0/
              /options
  /param



 The issue is, if the top file in my history is a SAM file containing ~30,000
 IDs in the first column the tool initially attempts to load these all in to
 the selection box and effectively crashes my local instance.

 I only want to use this on tab files that ill have ~100 IDs at most.

Maybe the options from_dataset=tabfile tag could have a max
setting? e.g. options from_dataset=tabfile max=100 could
load just the first 100 entries in the tabular file. That seems much
more general than the new filetype idea.

It could have a default max value which should useful.

Thinking of the example of a tabular file of gene IDs for an organism,
you might well want 20 to 30 thousand entries. Since Galaxy puts
a search function on the selection, the UI should be OK. Its just the
performance we need to worry about.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Megablast GIs

2011-05-07 Thread Peter Cock
On Saturday, May 7, 2011,  drho...@uark.edu wrote:
 Thanks Peter,  I debated which to post to.
 This is just using the Megablast tool in Galaxy.  My IT people are not 
 Biologists.  They had
 this working fine and some tweak has screwed up the GI column in the results 
 file.  It should
 be the same NR and HTGS datafiles that we had before.  So you recommend that 
 I should
 just have them reinstall the databases and make sure to use the correct 
 switches?


You don't know what tweak broke things? You've got at least three factors,

(1) Galaxy and the wrapper, but I don't see how an update of these
would have affected the IDs.

(2) The BLAST command line tool (blast all in this case), perhaps it
was updated and this has altered the output?

(3) The BLAST database(s) were updated (or rebuilt if you have local
databases, in which case it could have been done with a newer version
of the formatdb tool). If you are using NR does you have it regularly
updated from the NCBI FTP site?

Hopefully your admin team will have some sort of log of system updates
that might help work out what happened.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Galaxy administrators - admin_users setting

2011-05-09 Thread Peter Cock
Hi all,

To make someone an administrator on a local Galaxy install, do I just
need to add their email (login) to the comma separated setting
admin_users in universe_wsgi.ini?
https://bitbucket.org/galaxy/galaxy-central/wiki/Admin/AdminInterface

I have this working on one server, but it doesn't seem to have any
effect on a second server. Both are now running the current release
(changeset 50e249442c5a).

Is there some other setting needed to enable the admin interface?

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Galaxy administrators - admin_users setting

2011-05-09 Thread Peter Cock
On Mon, May 9, 2011 at 12:48 PM, Hans-Rudolf Hotz h...@fmi.ch wrote:
 Hi Peter

 First,the obvious question: Have you restarted galaxy?

Yes, and I tried hard reloads in two different browsers (both of which
are working on the primary server).

 Second: I vaguely remember a similar problem, and if I remember correctly,
 the problem we had was the space between the comma and the next e-mail
 address. Hence, make sure you write the line like this:

 admin_users = f...@bar.com,f...@bar.com,f...@bar.com

 (ie: without any blank spaces)

 I hope this helps, Hans

I'll double check that.

As an aside, it would be interesting to check if the comparison is done
case insensitively or not (emails and domain names are case insensitive).

Fingers crossed... thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] param type=float optional=true doesn't work

2011-05-12 Thread Peter Cock
On Thu, May 12, 2011 at 6:03 PM, Leandro Hermida wrote:
 Hi, how do you make float parameter types fully optional? optional=true
 doesn't work.

 -Leandro

Sadly this is a known bug I reported some time ago:
https://bitbucket.org/galaxy/galaxy-central/issue/403

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Reload .loc files without restarting Galaxy system?

2011-05-13 Thread Peter Cock
On Wed, Apr 6, 2011 at 10:30 PM, Luobin Yang yangl...@isu.edu wrote:
 Hi,
 The reload a tool's configuration menu avoids restarting Galaxy system
 when a tool's web interface changed, but it doesn't seem to reload the
 .loc files that a tool's XML file uses. I am wondering if it's possible to
 reload the .loc file also?
 Thanks,
 Luobin

I'd like to be able to do this too. I filed an enhancement bug:

https://bitbucket.org/galaxy/galaxy-central/issue/538

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Wrapper generator.

2011-05-13 Thread Peter Cock
On Fri, May 13, 2011 at 2:12 PM, J. F. J. Laros j.f.j.la...@lumc.nl wrote:
 Hi Pieter,

 Thanks for the reply, but from the looks of it, this is a way to incorporate
 webservice clients into galaxy. What I want is slightly different.

 I'm talking about adding a normal command line tool (an aligner for example)
 but I'm not willing to write a wrapper each time (the wrapper for BWA for
 example is 327 lines). If a computer readable description of the interface
 exists, this description can be used to generate a python wrapper. I mentioned
 WSDL because this also gives a description of an interface, but any equivalent
 description would of course be fine.

 Ideally, such a description is maintained by the developer of the tool in
 question.

 With kind regards,
 Jeroen.

How many tools can you name that come with such a machine readable
file describing their command line interface?

I can think of only one - the EMBOSS tool suite and their ACD files
(used internally by EMBOSS to generate the command line help,
documentation, and do the command line option parsing).

Unfortunately I don't think there is an easy answer - even if wrapping
a tool where all the input and output formats are already supported in
Galaxy.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Select first/last N rows from grouped tabular files (e.g. top BLAST hits)

2011-05-17 Thread Peter Cock
Hi all,

I'm wondering if the following task can be done in Galaxy with the
standard tools. The specific example is selecting the top (e.g. 3)
match sequences for each blast query, but I see this problem as much
more general than a  Select top BLAST hits tool.

I want to select the first few (e.g. 1) rows of each group in a
tabular file, where the group criteria is having certain columns equal
(e.g. the first 2).

e.g. Tabular BLAST output has columns of query ID, match ID, etc.

queryA match1 ...
queryA match2 ...
queryA match2 ...
queryA match3 ...
queryA match4 ...
queryA match4 ...
queryA match4 ...
queryB match5 ...
queryB match5 ...
queryC match6 ...
queryC match7 ...

In this example, some of my queries have more than one HSP per match
(more than one line with the same first two columns). If I group on
the first two columns, the groups are:


queryA match1 ...

queryA match2 ...
queryA match2 ...

queryA match3 ...

queryA match4 ...
queryA match4 ...
queryA match4 ...

queryB match5 ...
queryB match5 ...

queryC match6 ...

queryC match7 ...


If I then take the first row in each group, that gives me just the
first HSP for each query+match combination.

queryA match1 ...
queryA match2 ...
queryA match3 ...
queryA match4 ...
queryB match5 ...
queryC match6 ...
queryC match7 ...

If for example I wanted only the top 3 matches for each query, I could
repeat the proposed tool one more time but with different settings -
this time grouping on the first column only:

queryA match1 ...
queryA match2 ...
queryA match3 ...
queryB match5 ...
queryC match6 ...
queryC match7 ...

I hope I've conveyed the idea here. The existing tools Select first
lines from a dataset and Select last lines from a dataset are
related, but do this at the file level.

Does this make sense? Does it seem like a useful tool to write if
there isn't anything like this already present? Or might it be simpler
to just write a Select top BLAST hits tool?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Published Workflows menu location

2011-05-18 Thread Peter Cock
 Hi all,

I recently started trying out the Share workflow functionality on
our local Galaxy, so that I could make available some general interest
workflows to all our users, rather than sharing them with individuals
one by one. However, once published, it was not easy to find them! I
eventually found them under the Shared Data drop down menu entry
Published Workflows (yet Shared workflows are not on this menu).
The captions Shared and Published seem to be used inconsistently.

Maybe the top level Shared Data menu should be renamed to Shared
Resources or Published Resources?

To me, workflows are not data, and so should not be under Shared
Data. Rather, I expected them to be under the Workflow menu. Is the
separation because in order to run or edit a  Published Workflows,
you must first import it?

Perhaps a simple solution would be a note and link at the bottom of
the main Workflow page directing people to the  Published
Workflows page under the Shared Data drop down menu.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Reciprocal Best Hits (RBH) from BLAST tabular output

2011-05-18 Thread Peter Cock
On Wed, May 18, 2011 at 3:54 AM, Kanwei Li kan...@gmail.com wrote:
 Hi Peter,
 I think the tool shed would be appropriate for this...
 -K

OK - I'll do that.

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Select first/last N rows from grouped tabular files (e.g. top BLAST hits)

2011-05-18 Thread Peter Cock
On Tue, May 17, 2011 at 5:30 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 I'm wondering if the following task can be done in Galaxy with the
 standard tools. The specific example is selecting the top (e.g. 3)
 match sequences for each blast query, but I see this problem as much
 more general than a  Select top BLAST hits tool.

 ...

 Does this make sense? Does it seem like a useful tool to write if
 there isn't anything like this already present? Or might it be simpler
 to just write a Select top BLAST hits tool?

While I still think the above task could be useful in general, I am
now considering a general BLAST filter tool to offer this and some
other commonly used filters like a minimum coverage threshold
(which is possible with a filter on the extended tabular output, but
not trivial).

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Select first/last N rows from grouped tabular files (e.g. top BLAST hits)

2011-05-19 Thread Peter Cock
On Thu, May 19, 2011 at 7:33 PM, madduri gal...@ci.uchicago.edu wrote:
 I wonder if somebody can give me more context around this issue..


On 3rd May I emailed IBX about their Galaxy install and one of
the (in house) tools mentioned on the workflow image here:
https://ibi.uchicago.edu/resources/galaxy/index.html

I recognised the NCBI BLAST+ tools but the Filter Top Blast
Results tool was new to me, and asked what it did and if it or
any the other IBX tools would be available at the Galaxy Tool Shed:
http://community.g2.bx.psu.edu/

I had a reply from Alex Rodriguez (iBi/CI University of Chicago)
that they haven't put any of the wrappers on the Galaxy tool
shed yet as they are still being worked on. The IBI system
assigned the number [Galaxy #13918].

This thread Select first/last N rows from grouped tabular
files (e.g. top BLAST hits) could have similarities to the
IBI Filter Top Blast Results tool, so I forwarded the email
to the IBI galaxy email address to encourage you (e.g. Alex)
to comment on the thread. The IBI system assigned the
number [Galaxy #14246].

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Tools: dynamic check of file type?

2011-05-20 Thread Peter Cock
2011/5/20 Louise-Amélie Schmitt louise-amelie.schm...@embl.de:
 Well I'm still missing something...

 Is there a way to use the conditional tag sets in the tool's xml with it? To
 suppress the unnecessary options for a given file type? or more than one?


It sounds like you want to use the conditional and param tags,
but rather than responding to another parameter directly (e.g. a select),
you want to look at an input file parameter's file type?

I don't know if that is possible, but I can thing of a couple of examples
where I could use that functionality.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] egg install failure: docutils, GeneTrack

2011-05-23 Thread Peter Cock
On Mon, May 16, 2011 at 8:59 PM, Spunde, Alex aspu...@illumina.com wrote:
 Same thing:

 [aspunde@ilmnhw-biolin10 galaxy-dist]$ python ./scripts/fetch_eggs.py
 Fetched http://eggs.g2.bx.psu.edu/Babel/Babel-0.9.4-py2.6.egg
 Fetched http://eggs.g2.bx.psu.edu/docutils/docutils-0.7-py2.6.egg
 Fetched
 http://eggs.g2.bx.psu.edu/pysqlite/pysqlite-2.5.6_3.6.17_static-py2.6-linux-x86_64-ucs2.egg
 Fetched
 http://eggs.g2.bx.psu.edu/GeneTrack/GeneTrack-2.0.0_beta_1_dev_48da9e998f0caf01c5be731e926f4b0481f658f0-py2.6.egg
 One or more of the python eggs necessary to run Galaxy couldn't be
 downloaded automatically.  You can try building them by hand (all
 at once) with:
   python scripts/scramble.py
 Or individually:
   python scripts/scramble.py Babel
   python scripts/scramble.py docutils
   python scripts/scramble.py GeneTrack

I had a similar issue which was down to the file permissions - sudo fixed it,
and I redid the chown afterwards.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Filter data and cut column bugs

2011-05-23 Thread Peter Cock
On Mon, May 23, 2011 at 3:09 PM, Anton Nekrutenko an...@bx.psu.edu wrote:
 Dear Peter:

 Yes, that would help.

One possibility would be to have all new bugs CC'd to the dev
mailing list? Not sure if everyone here would like that or not...

 But, you patches are definitely not unnoticed. We'll apply them
 (likely at the conference) and we are also in the process of
 devising a strategy for doing so quickly and consistently.

That sounds good :)


 See you on Tuesday.

 a.

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Display column headers in data_column parameter

2011-05-24 Thread Peter Cock
On Tue, May 24, 2011 at 3:12 PM, Johannes Eichner
johannes.eich...@uni-tuebingen.de wrote:
 Dear Galaxy-Dev Team,

 is it possible to display the headers of the columns which can be
 selected in a data_column parameter? Currently, only the column index
 (e.g., c1, c2, etc.) is displayed in Galaxy. If it would be possible to
 additionally display the column headers, the selection of the desired
 columns would be easier for the user. Is there a way to do this using
 the current version of Galaxy?

 Kind regards,
 Johannes

I completely agree that in general the UI for column parameter isn't
as user friendly as it could be. The filter data UI is even worse for
non-programmers, but due to its complexity and flexibility, that is
harder to improve too.

This was one of the things I wanted to bring up with the Galaxy team
at the conference (starting tonight). Unfortunately due to the Grimsvotn
Volcano's cloud of ash I won't be there.

In some cases yes, Galaxy knows the column meaning. In many of my
data files there is a # header line with tab separated column names,
which could be detected and shown.

Given there may or not be suitable column meta data, what I was
picturing was an expanded column picking dialogue that could show
the first few rows of data in each column (not using a simple HTML
widget anymore), or simply the first value (less drastic change).

So, rather than having as now something like: Pick column(s):
[ ] c1
[ ] c2
[ ] c3
[ ] c4
[ ] c5

If Galaxy has names in the column meta data you might have
Pick columns(s):
[ ] c1 - Identifier
[ ] c2 - Start
[ ] c3 - End
[ ] c4 - Strand
[ ] c5 - Sequence

Or, showing the first data entry: Pick columns(s):
[ ] c1, e.g. gi|12345678
[ ] c2, e.g. 123
[ ] c3, e.g. 456
[ ] c4, e.g. +
[ ] c4, e.g. ACGTACTGT...

In the final example I envision truncating very long values with
an ellipsis.

Both the above proposals don't require drastic changes to the
HTML widget and layout.

I guess we should file an enhancement issue on bitbucket:
https://bitbucket.org/galaxy/galaxy-central/issues/new

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Galaxy BLAST XML to tabular fix

2011-05-25 Thread Peter Cock
On Wed, May 18, 2011 at 6:40 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi Kanwei,

 Could you merge this branch or transplant this fix please?

 https://bitbucket.org/peterjc/galaxy-central/changeset/3c0b903c86a9

 https://bitbucket.org/peterjc/galaxy-central/src/blast_2011_05_18

 New branch, blast_2011_05_18

 There is no change to the logic, just to the assert statement where
 the sanity test didn't consider doubly masked sequences.

 Should I also update the XML wrapper's version number as well?

 Thanks,

 Peter

Hi Kanwei,

Did this email reach you? Let me know if you want a new unit
test to go with it and/or to bump the tool's version number.

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Galaxy BLAST XML to tabular fix

2011-05-25 Thread Peter Cock
On Wed, May 25, 2011 at 1:10 PM, Kanwei Li kan...@gmail.com wrote:
 Sorry, it slipped by. Committed!


No problem - thank you,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Is dynamic associated information per dataset possible?

2011-05-26 Thread Peter Cock
On Thu, May 26, 2011 at 10:34 AM, Roman Valls brainst...@nopcode.org wrote:
 Nate, is it just me or all the .png links on that wikipage are
 broken/missing ? :-S

I've noticed this on other pages, I think some stuff got moved a while back.
They just need the folder name prefixed...

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] stdout and stderr

2011-05-26 Thread Peter Cock
On Thu, May 26, 2011 at 1:36 PM, shashi shekhar meshash...@gmail.com wrote:
 Hi all,
 I am using one tool which gives stdout and stderr . it is showing red color
 (job failed) in browser.How can i resolve such type of problem.

 Regards
 shashi shekhar

See issue 325,
https://bitbucket.org/galaxy/galaxy-central/issue/325/

Until that is fixed, you'll probably need a wrapper script, there
is an example of this in tools/ncbi_blast_plus/hide_stderr.py

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] The powers of Cheetah, how?

2011-05-26 Thread Peter Cock
On Thu, May 26, 2011 at 1:22 PM, Kempenaar, M (med)
m.kempen...@med.umcg.nl wrote:
 Hi list,
 I've been working on a single Galaxy tool for a while now, and I'm running
 into all sorts of difficulties that can probably all be fixed using Cheetah.
 However I've been trying several ways of utilizing the (enormous) powers of
 Cheetah but so far I'm not successful. I'll try to ask my questions as clear
 as possible, please let me know if anything is unclear.
 In my tool I use the following input parameter:
 ---
   param name=report_options type=select multiple=True
 display=checkboxes
 option value=overviewphyla distribution/option
 option value=phyladendphyla dendograms/option
 option value=heatmapheatmap/option
 option value=dendogramdendogram/option
 option value=pcaprincipal component analysis/option
 option value=mdsmulti dimensional scaling/option
   /param
 ---
 Each of these options corresponds to a function in my R script, thus I want
 to do an if-else if-else to execute these functions that the user selected
 (note that multiple=True).
 Now the problem is, I also use R Sweave (sweaving R output (text/graphics)
 into LaTeX) to generate a nice looking PDF report, and since that's not a
 'real' programming/ scripting language, I can't for instance split the
 string (as I used to do in R) and check for the boxes a user selected, so my
 questions are:
 I planned on using Cheetah to include sections of Sweave LaTeX code for each
 box checked by the user, perfectly doable with an #if-#elif you'd think,
 however since the $report_options contains a comma-separated single string,
 I would need to split (or use regular expressions) to be able to do such an
 #if-#elif like:
 #if heatmap in $report_options.split(,)
 results in: NotFound: cannot find 'split' while searching for 'split'

Try changing this:

#if heatmap in $report_options.split(,)

to:

#if heatmap in str($report_options).split(,)

Although $report_options acts a bit like a Python string, it isn't one,
and lacks lots of useful string methods. We can turn it into a string
using the Python str function.

 Q: Followup from previous question, where within the sections of my tool XML
 file can I actually use Cheetah? This of course has to do with the order in
 which a tool is executed.

I believe only within the command.../command tag.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] stdout and stderr

2011-05-26 Thread Peter Cock
On Thu, May 26, 2011 at 2:42 PM, Bossers, Alex alex.boss...@wur.nl wrote:
 PS: 2 is STDERR and 1 is STDOUT. One could do a redirect of STDERR to STDOUT
 and STDOUT to a logfile. You won’t get a red box though when errors occur!

Right - if you need to detect an error state via the return code,
currently you have to do this with a wrapper script.

If you don't ever expect any errors, then you can treat stderr
as an output, or just redirect to to /dev/nul - but I don't like that
idea - silent failures are bad.

I regard fixing issue 325 as one of the top priorities in Galaxy, but
in the short term I'd settle for treating any output on stderr OR
a non-zero return code as a failure. i.e. Something like this patch,
even if more work is needed for PBS jobs:
https://bitbucket.org/galaxy/galaxy-central/issue/325/#comment-331776

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Display column headers in data_column parameter

2011-05-26 Thread Peter Cock
On Thu, May 26, 2011 at 3:45 PM, Johannes Eichner
johannes.eich...@uni-tuebingen.de wrote:
 Hi Peter,

 I defined an additional metadata element for the column names
 analogously to the column_types element in the module
 galaxy_basedir/lib/galaxy/tools/datatypes/tabular.py:
 MetadataElement( name=column_names, default=[], desc=Column names,
 param=metadata.ColumnTypesParameter, readonly=True, visible=False,
 no_value=[] )

 Then I read the column names in the function set.meta and saved them
 in the new metadata element:
 dataset.metadata.column_names = column_names

Do you look for a tab separated # header line at the start of the file
for these names?

 Finally, I changed a line of code in the function get.column.list in
 galaxy_basedir/lib/galaxy/tools/parameters/basic.py, such that
 Galaxy displays the column names in the data_column parameter:
 original line: column_list.append( str( i + 1 ) )
 changed line: column_list.append( str( i + 1 ) + :  +
 dataset.metadata.column_names[i]) )

 Kind regards,
 Johannes

From a Python style point of view, I'd use string formatting here:

column_list.append(%i: %s % (i+1, dataset.metadata.column_names[i]))

Do you have a branch on bitbucket for this, or a patch? I'd like to try it.
You could attach a patch to issue 554:
https://bitbucket.org/galaxy/galaxy-central/issue/554

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Filter data and cut column bugs

2011-05-27 Thread Peter Cock
On Mon, May 23, 2011 at 3:34 PM, Peter Cock p.j.a.c...@googlemail.com wrote:
 On Mon, May 23, 2011 at 3:09 PM, Anton Nekrutenko an...@bx.psu.edu wrote:
 Dear Peter:

 Yes, that would help.

 One possibility would be to have all new bugs CC'd to the dev
 mailing list? Not sure if everyone here would like that or not...

 But, you patches are definitely not unnoticed. We'll apply them
 (likely at the conference) ...

Kanwei has just applied the fix for issues 535 and 537, thanks!

https://bitbucket.org/galaxy/galaxy-central/issue/535/
Filter data on any column tool complains about hash comment lines

https://bitbucket.org/galaxy/galaxy-central/issue/537/
Filter data on any column tool casts unused columns

That leaves these two pending,

https://bitbucket.org/galaxy/galaxy-central/issue/534/
Cut column tool messes up # header lines

https://bitbucket.org/galaxy/galaxy-central/issue/536/
Filter data on any column tools shows nonsense % vs total on stdout

While I'm looking over minor bugs where I submitted a
patch, the following should also be fairly quick to review:

FASTQ to Tabular tool doesn't accept plain FASTQ
https://bitbucket.org/galaxy/galaxy-central/issue/436/

Conditional does not work with scripts in different path
https://bitbucket.org/galaxy/galaxy-central/issue/159/

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Implemented adding tools via admin console

2011-05-28 Thread Peter Cock
On Fri, May 27, 2011 at 11:21 AM, Vossen, Bodo
bodo.vos...@mpi-bn.mpg.de wrote:

 Hi all,

 I've implemented functionality for adding tools via the admin menu.

That sounds very useful in principle, although perhaps
the less flexible alternative of reloading tool_conf.xml
would achieve the same aim?

 ...
 Then it is checked if script and XML are compatible by
 checking if they have the same number of arguments.

This seems very fragile, and having looked at the code for
detecting the number of arguments in a Python script it
looks like it will get it wrong in many situations. Even
for something quite simple like this:

#sys.argv[0] is the script file itself
assert len(sys,argv)==4, Expect 3 arguments
arg1, arg2, arg3 = sys.argv[1:]

Also, what about conditional code like this?

assert len(sys.argv)==4, Expect 3 arguments
if sys.argv[1] == db:
database = sys.argv[2]
filename = None
else:
database = None
filename = sys.argv[2]
out_filename = sys.argv[3]

If I have read your get_parameter_number code right,
it would claim there are four arguments since there
are four lines of the basic form variable = sys.argv[i]

Also, it cannot possibly work when there are a varying
number of arguments, for example a repeat argument.
If you want a test case, my Venn Diagram tool on the
Tool Shed should be relevant (its a python script).

 If you have any comments or question please do not
 hesitate to contact me, I'm open for every kind of critics.

I would remove the attempt to check the number of arguments.
It is practically impossible to get this right in all cases,
so I don't think it is worth doing. Having valid script/xml
combinations rejected would be quite annoying.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Implemented adding tools via admin console

2011-05-30 Thread Peter Cock
On Mon, May 30, 2011 at 10:42 PM, Bossers, Alex wrote:
 Peter,
 if you were just interested in having a dynamic refresh
 it was posted long time ago and we have it operational. It
 comes as a tool in your tool list and refreshes all from the
 tool_conf.xml.

I'd like to see something like this functionality built into Galaxy,
on the admin page next to reload a single tool.

Having an extra standard tool (if world visible) is not an ideal
solution. If that is what you meant? I'll have to try it and see.

 Not the loc filtes though!

Reloading loc files is also on my wish list :)

 As far as i remember it doesn't kill running jobs.

Good - because otherwise you might as well restart Galaxy.

 It requires two lines of code in galaxy source and the addition of a
 tool to the tools section. I pasted it from my archive below. But it
 is quite self explanatory.
 Alex

Thanks, now I have two solutions to play with ;)

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Dynamic default values

2011-06-01 Thread Peter Cock
Dear all,

I was thinking about how to improve some of my tools and
wondered if there is any way to have dynamic default values
for parameters (depending on other parameters).

Obviously this won't make sense in all cases, but as a specific
example, consider my Venn Diagram tool:

https://bitbucket.org/peterjc/galaxy-central/src/9f66c5d1fca8/tools/plotting/venn_list.xml

I have separate parameters for each dataset (file input) and its
caption (text input). Like so:

  param name=set type=data format=tabular,fasta,fastq,sff
label=Members of set help=Tabular file (uses column one), FASTA,
FASTQ or SFF file./
  param name=lab size=30 type=text value=Group
label=Caption for set/

Is something like this possible (or generally useful?)

  param name=set type=data format=tabular,fasta,fastq,sff
label=Members of set help=Tabular file (uses column one), FASTA,
FASTQ or SFF file./
  param name=lab size=30 type=text value=$set.name
label=Caption for set/

i.e. The default value of the label text parameter lab would be the
human friendly name of the associated data file parameter set. I'd
expect changing the file to also change the text box.

I have thought of a workaround, but not yet tried it: Use the empty
string as the default, and use an if statement in the Cheetah
template for constructing the command line string.

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] The new hg based Galaxy Tool Shed

2011-06-01 Thread Peter Cock
On Wed, Jun 1, 2011 at 3:22 PM, Nate Coraor n...@bx.psu.edu wrote:
 Hi Peter,

 Greg will probably reply, but I'll throw in my $0.02 as well.

Great - but with your answers you've triggered more questions ;)

 Peter Cock wrote:
 Hi Greg et al,

 I've just been looking over your slides from last week about the new
 'Galaxy Tool Shed', which are posted online here:

 http://wiki.g2.bx.psu.edu/GCC2011

 http://wiki.g2.bx.psu.edu/GCC2011?action=AttachFiledo=gettarget=GalaxyToolShed.pdf

 They talk about how you will be tracking individual tools in hg repositories.

 I can see two ways this might work:

 (1) Each of these tool specific repositories (or branches if you just make 
 one
 repository for each tool owner) would be a full fork of the Galaxy code base.
 This allows in principle tools to include changes to core functionality (but
 that seems dangerous due to potential merge clashes), and any existing
 tool contributor's pre-existing hg forks on bitbucket might be reused.

 The tool shed isn't really intended for framework changes - I would
 suggest keeping these as bitbucket forks, although it would certainly be
 good if we had a way to locate the list of such forks centrally.

Well, as long as the repository is created by forking on bitbucket, then
the link existing in the bitbucket web interface.
https://bitbucket.org/galaxy/galaxy-central/descendants

 (2) Each of these tool specific repositories would ONLY track the tool 
 specific
 files you'd add to Galaxy to install the tool. So, typically there would be 
 an
 XML file, perhaps a wrapper script, maybe a sample loc file, and a plain
 text readme file.

 I'm guessing you've gone for something along the lines of idea (2), but I

 Yep.

It did seem the most likely route.

 would love to hear more about how this will all work. e.g. Where would
 the tool shed repositories be hosted, and would tool authors use hg to
 work with them, or something like the current web based tool upload?

 They're hosted here, and you can check them out and work with them
 locally as you do the Galaxy source itself, or use the new web-based
 upload to upload individual files or tarballs.

 Have a look at the test instance of the next-gen toolshed here if you'd
 like to see how it works:

  http://testtoolshed.g2.bx.psu.edu/

 Please feel free to use this as a sandbox and report any issues you find.

I see the existing usernames and passwords from the old Tool Shed were
transferred - that makes life easier. And it lists the hg information, e.g.

hg clone http://pete...@testtoolshed.g2.bx.psu.edu/repos/peterjc/venn_list
hg clone 
http://pete...@testtoolshed.g2.bx.psu.edu/repos/peterjc/tmhmm_and_signalp

What happens with branches? Would the Tool Shed just show the
default branch? That seems best for a simple UI.

I have a query regarding the way the tools are shown in tables and the
version column, which shows a changeset and revision number. According
to Greg's slides (slide #10, titled Simpler tool versioning which seems ironic
to me), the old numerical version is still there in the XML - and I'd prefer to
see that. How about having both shown (two columns, perhaps call them
Public version and hg version or hg revision).

With regards to the planned installation functionality, what happens when
a tool repository (aka Tool Suite in the old model) contains several XML
wrappers - would you be able to choose which are wanted? The use case
I have here is when several tools share some common dependency (which
should be tracked in a single repository), and were therefore useful to
bundle together as a suite, but where not all the tools will be of global
interest (e.g. My TMHMM, SignalP, etc suite).

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] The new hg based Galaxy Tool Shed

2011-06-01 Thread Peter Cock
On Wed, Jun 1, 2011 at 4:22 PM, Greg Von Kuster g...@bx.psu.edu wrote:
 Hello Peter - I finally got a chance to jump in - see my inline comments...

Hi :)

 What happens with branches? Would the Tool Shed just show the
 default branch? That seems best for a simple UI.

 Some of the branching details are yet to be worked out, but forks are easy
 because repository urls include the unique username of the Galaxy user.

Well, yes and no - as long as there are competing versions of a Galaxy tool
(e.g. from an original author and a fork by a second author), and they use
the same ID in their XML, you have a clash. This will have to be considered
in the (automated) install interface. i.e. In general, when installing
or updating
any tool, there may be existing versions of some components already present.
In fact two completely unrelated tools could even have the same XML ID by
accident.

 I have a query regarding the way the tools are shown in tables and the
 version column, which shows a changeset and revision number. According
 to Greg's slides (slide #10, titled Simpler tool versioning which seems 
 ironic
 to me), the old numerical version is still there in the XML - and I'd prefer 
 to
 see that. How about having both shown (two columns, perhaps call them
 Public version and hg version or hg revision).

 We can certainly do this, but what would you like to see for tool suites and
 other tool types?  The old Galaxy tool shed strictly required a 
 suite_config.xml
 file that included the overall version of the suite.  To make tool development
 easier, we're no longer requiring the inclusion of a suite_config.xml file ( 
 we
 don't even differentiate types of tools since everything is a repository ).  
 The
 definition of a tool in the next gen tool shed, is fairly loose.  A tool 
 could be
 data, it could be an exported workflow, it could be a suite of tools, a single
 tool, or just a set of files.  So we'll need to define an easy way to provide 
 a
 version of the tool if it will be different than the version of the 
 repository tip.

I see what you mean for the suite case. Maybe on the view details page
each constituent tool could be shown with its classical version number
from the XML file?


 Here's the future big picture highlights.  Many of the details are yet to
 be defined and fleshed out...

 We're hoping that in the near future there will be many local tool sheds
 ( just like Galaxy instances ).  I'm thinking that there will be a central 
 tool
 shed broker of sorts that is hosted by the Galaxy team.  This broker will
 provide 2 basic functions.  It will enable local tool sheds ( including the
 current tool shed hosted by the Galaxy team ) to advertise their tools,
 and it will allow local Galaxy instances to use those advertisements to
 find tools that the local Galaxy instance's users are interested in.  This
 specific point has not yet been discussed to any depth, so consider it
 fluid for now.

I'm not immediately sold on this plan. To me one of the big plus points
of having a single Official Tool Shed looked after by the Galaxy team
is the convenience factor (a one stop shop), which requires critical mass,
plus whatever QA happens as part of the current approval process. I
would regard it as a step backwards if in order to hunt for a wrapper for
a given tool, I had to resort to Google in order to find all the individual
Galaxy Tool Sheds.

 When a Galaxy instance's admin locates tools within a specific tool shed
 that they want to install, they will be able to install them via a Galaxy tool
 installation control panel.  Think of a UI that provides a check-boxed list
 of tools that have been found in some tool shed or sheds. The Galaxy
 admin will check those tools he wants to install, and the tools, along with
 all dependencies will automatically be installed in the local Galaxy instance.
 Dependencies could include 3rd party binaries, maybe some form of data,
 and other forms of dependencies.  This is another good reason to keep
 tools separated in their own repositories.

If you mean by dependencies the small task of installing the tool XML
and associated scripts and data files currently bundled in the tar balls
on the current Tool Shed, that seems fine. Anything beyond that seems
difficult and likely to impose a significant extra load on tool wrapper
authors.

 The installation will be virtually automatic, requiring little or no manual
 intervention via a package manage of sorts.  This will be done using
 a combination of fabric scripts, and other components.  All of the
 underlying mercurial stuff will be handled beneath the UI layer.

This larger aim of installing the underlying dependencies is impossible
in general - but that seems to be what you want to aim for. Consider
obvious use case of closed source (non-redistributable) 3rd party binaries.
I can think of several examples from the current Tool Shed wrappers,
including the Roche Newbler off instrument applications, TMHMM
and SignalP.


Re: [galaxy-dev] The new hg based Galaxy Tool Shed

2011-06-01 Thread Peter Cock
On Wed, Jun 1, 2011 at 5:25 PM, Nate Coraor n...@bx.psu.edu wrote:
 Peter Cock wrote:

 Well, yes and no - as long as there are competing versions of a Galaxy tool
 (e.g. from an original author and a fork by a second author), and they use
 the same ID in their XML, you have a clash. This will have to be considered
 in the (automated) install interface. i.e. In general, when installing or
 updating any tool, there may be existing versions of some components
  already present. In fact two completely unrelated tools could even have
 the same XML ID by accident.

 I agree there could be a problem with tool ID uniqueness.  We've talked
 about suggesting that people namespace their tool IDs to prevent this,
 but nothing formal has materialized at this point.

That sounds sensible, and the sooner the better.

 I'm not immediately sold on this plan. To me one of the big plus points
 of having a single Official Tool Shed looked after by the Galaxy team
 is the convenience factor (a one stop shop), which requires critical mass,
 plus whatever QA happens as part of the current approval process. I
 would regard it as a step backwards if in order to hunt for a wrapper for
 a given tool, I had to resort to Google in order to find all the individual
 Galaxy Tool Sheds.

 It'll be possible for people to run their own Tool Sheds if they'd like,
 for whatever purpose - and this may be necessary for sharing extremely
 large data which we can't possibly host at the main Shed, but there
 should be an aggregator somewhere which lists all of the available
 public Sheds and makes it easy to add them as new sources to your Galaxy
 install.  Like a slightly more organized Debian APT system.

If there is an official meta tool shed aggregator, that would address
my main concern about fragmenting things.

 If you mean by dependencies the small task of installing the tool XML
 and associated scripts and data files currently bundled in the tar balls
 on the current Tool Shed, that seems fine. Anything beyond that seems
 difficult and likely to impose a significant extra load on tool wrapper
 authors.

 It'll be up to the authors to decide what level of complexity they care
 to handle,

Good - that silences a lot of my worries.

 ... but we want to move away from the situation where someone
 installs a tool but finds that it's unusable because the actual
 underlying dependency doesn't exist and is non-trivial to install.

Improving the documentation shown on the tool shed could help here -
make it easier for the tool wrapper to tell the Tool Shed user what will
be required.

Currently we get a short plain text box as part of the upload (no markup),
and can include a (plain text) readme file which is easily viewable from
the tool shed. I've just filed an enhancement request on a related idea:

https://bitbucket.org/galaxy/galaxy-central/issue/565/
Show mockup of tool GUI in Galaxy Tool Shed

 This larger aim of installing the underlying dependencies is impossible
 in general - but that seems to be what you want to aim for. Consider
 obvious use case of closed source (non-redistributable) 3rd party binaries.
 I can think of several examples from the current Tool Shed wrappers,
 including the Roche Newbler off instrument applications, TMHMM
 and SignalP.

 Agreed, thankfully, the current dependency system (tool_dependency_dir
 in the config file (not in the sample config, sorry, I'll rememdy that
 shortly!)) only requires that you have an environment file that
 configures whatever is necessary (generally just $PATH) to find a
 dependency.  So the tools in the Tool Shed would provide the XML,
 wrapper script (if necessary), and then instructions or perhaps an
 interface to configure the env file.

I'd hope the common case where all that is required is the tool binary
to be on the path, would not require any extra configuration files. See
also: https://bitbucket.org/galaxy/galaxy-central/issue/82

 [cut]

 Are you biting off more than you can chew? I hope I am misinterpreting
 your plans.

 Hopefully not!  We're trying to think this through pretty thoroughly
 before we get started, thanks for joining in the discussion. =)

I've been reassured :)

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Blast2GO in Galaxy

2011-06-02 Thread Peter Cock
On Tue, May 31, 2011 at 11:26 AM, Peter Cock p.j.a.c...@googlemail.com wrote:
 Hi all,

 As I mentioned on Twitter, at the end of last week I wrapped Blast2GO
 for Galaxy, using the b2g4pipe program (Blast2GO for pipelines). See
 http://blast2go.org/

 Currently current code is on bitbucket under my tools branch,
 https://bitbucket.org/peterjc/galaxy-central/src/tools/

 Specifically files tools/ncbi_blast_plus/blast2go.* viewable here:
 https://bitbucket.org/peterjc/galaxy-central/src/tools/tools/ncbi_blast_plus/

 I've using a Galaxy location file, tool-data/blast2go.loc, to offer
 one or more Blast2GO configurations (properties files), mapping this
 to the -prop argument. This way you could have for example the Spanish
 Blast2GO server with its current database (May 2010), and a local
 Blast2GO database. I want to setup a local database and try this
 before submitting the wrapper to the Tool Shed.

I've done that using the current latest data from May 2011, so one
year newer than the current default public Blast2GO database provided
by the Blast2GO developers (dated May 2011). This requires downloaded
some large data files and importing them into a MySQL database, and
that took about 24 hours to process in all. The last step is done via
their Java tool and took most of the time. For more details, see:
http://blast2go.org/localgodb

However, the end result is worth the effort as running Blast2GO is now
much much faster. I need to try it on some larger files, but we're
talking at least an order of magnitude quicker, maybe two.

 The input to the tool is a BLAST XML file, specifically blasting
 against a protein database like NR (so blastp or blastx, not blastn
 etc). I want to try some very large BLAST XML files to confirm
 b2g4pipe copes with the current BLAST+ output files - I gather there
 were some problems in this area in the past, so having the wrapper
 script fragment the XML might be a workaround. Currently the only real
 function of the wrapper script is to rename the output file - b2g4pipe
 insists on using the *.annot extension.

Now that I have a local Blast2GO database, I'll be able to try out
b2g4pipe on some bigger XML files (without having to wait ages).

 Right now the only output is a tabular three column *.annot file,
 which can be loaded into the Blast2GO GUI. For analysis within Galaxy,
 I'm wondering about an option to split the first column (which holds
 the original FASTA query's identifier and any description) in two.
 i.e. Split at the first white space to give the FASTA identifier, and
 any optional description as a separate column. That would make
 linking/joining/filtering on the ID much easier.

 If anyone has any comments or feedback now, that would be welcome.

I've submitted the initial version to the Galaxy Tool Shed now, but
would still welcome feedback and would even consider non-backwards
compatible changes in the short term.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Updates to Galaxy Community Conference website

2011-06-02 Thread Peter Cock
Hi all,

I have a couple of minor suggestions for improving the website
http://galaxy.psu.edu/gcc2011/Home.html

First, please fill in a meaningful HTML titles since that will be used
as the default caption in a bookmark, tab or window title.

Second, please add a link to the presentation slides and videos,
e.g. from the programme page and main page?

http://wiki.g2.bx.psu.edu/GCC2011

I realise this isn't complete yet, but it is already a useful resource.
Also once you have all the slides, could the links in the wiki table
be made to point straight to the file?

Thanks,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Sharing tool definitions between XML files

2011-06-02 Thread Peter Cock
On Wed, Oct 6, 2010 at 11:16 AM, Peter pe...@maubp.freeserve.co.uk wrote:
 On Tue, Sep 28, 2010 at 12:04 PM, Peter pe...@maubp.freeserve.co.uk wrote:
 Hi all,

 I'm wondering if there is any established way to share parameter
 definitions between tool wrapper XML files?

 For example, I am currently working on NCBI BLAST+ wrappers, and these
 tools have a lot in common. For example, the output format option will
 be a select parameter, and it will be the same for blastn, blastp,
 tblastn, etc. I can just cut and paste the definition but this seems
 inelegant and will be a long term maintenance burden if it ever needs
 updating (lots of places would need to be updated the same way). I'd
 like to have a shared XML snippet defining this parameter which I
 could then reference/include from each tool wrapper than needs it.

 I'm thinking of something like XML's !ENTITY ... might work. Is this
 possible?

 Peter


 Alex Bossers replied on the BLAST+ thread to express interest in
 sharing definitions between wrapper XML files:

 http://lists.bx.psu.edu/pipermail/galaxy-dev/2010-September/003376.html

 As I add more parameters to my BLAST+ wrappers, keeping all
 the files in sync is getting a bit tedious...

 Peter


Hi all,

I'm replying to this old thread after finding this old (fixed) issue
on the bug tracker,
https://bitbucket.org/galaxy/galaxy-central/issue/131/allow-xml-includes-in-all-xml-files

Does this offer a solution to sharing definitions in tool XML files?
Has anyone got an example using this?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Defining new file formats in Galaxy (for new tool wrappers)

2011-06-02 Thread Peter Cock
Hi all,

Something I've not needed to do until now is define a new file format
in Galaxy. I understand the basic principle and defining a subclass in
Python... however, how does this work with new tools on the Tool Shed?
In particular, if an output format is likely to be used by more than
one tool, can we get it added to the Galaxy core?

As an example, the basic functionality of the Blast2GO for pipelines
tool (b2g4pipe) takes a BLAST XML input file, and gives a tab
separated annotation output file. Galaxy already has 'blastxml' and
'tabular' file formats defined, so I didn't need to do anything extra.
However, the tool can also take (a directory of) InterProScan XML
files as input, so here a new 'interproscanxml' format would useful.
Then any wrapper using or producing InterProScan XML could take
advantage of this. e.g. Konrad's InterProScan wrapper could then offer
the XML output as an option in addition to or instead of the tabular
output.

Related to this example, why isn't there a generic base class for XML
formats in general?
https://bitbucket.org/galaxy/galaxy-central/issue/568/missing-xml-datatype-base-class

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Defining new file formats in Galaxy (for new tool wrappers)

2011-06-06 Thread Peter Cock
On Thu, Jun 2, 2011 at 6:39 PM, Greg Von Kuster g...@bx.psu.edu wrote:

 We will certainly include support for new data formats into the Galaxy core.
  In case you haven't seen it, details for adding new formats is available in
 our wiki at https://bitbucket.org/galaxy/galaxy-central/wiki/AddingDatatypes.

Hi Greg,

Should that page talk about lib/galaxy/datatypes/registry.py as well?
That seems to be where mime types are specified, and for some
reason (a historical fall back?), there is another sniffer listing here too
(as well as in datatypes_conf.xml).

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Components

2011-06-06 Thread Peter Cock
On Mon, Jun 6, 2011 at 2:45 PM, Mariette jmari...@toulouse.inra.fr wrote:
 Hi everyone,

 I'm new in galaxy and I want to create a component, so I have multiple
 questions.
 The component executes a script which takes different formats :
 fasta|fastq|sff. As output it takes a folder, to do so I used the
 extra_files_path attribute of an output data
 (--out='$output.extra_files_path'). The result file is stored in this folder
 with the log file and should be in the same format as the input one.
 I wrote the xml file and it works just fine, however as it's a folder there
 is no way to use the result file(s) for further analysis. How can I manage
 this ?

I use a wrapper script which moves the output file from the sub-directory
to the location Galaxy requested.

Regarding the file format, if you only have one input file you can use
format=input, otherwise use the change_format tag in the XML.

 Also it can happen that there is more than 1 file outputted, how can I
 manage multiple outputs ?

 An other question about the input format, I have an option to the script to
 specify the format. I saw there is auto-format detection routines in galaxy,
 there is a way so I can call them to automatically complete this option ?

Normally the wrapper specifies the output format explicitly.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Color blindness and Galaxy UI

2011-06-06 Thread Peter Cock
 On Mon, Jun 6, 2011 at 11:18 AM, James Taylor ja...@jamestaylor.org wrote:

 Peter, most of the current UI theme was designed by someone who is
 red-green colorblind (me), and we avoid using color alone for encoding
 information. The change has been reverted, thanks for brining it to our
 attention.

I expect you would have spotted this commit soon anyway then :)

On Mon, Jun 6, 2011 at 4:30 PM, Kanwei Li kan...@gmail.com wrote:
 Maybe the background color for the error can be changed as well? I just got
 a colorblind tester and #FA for example provides a much better contrast
 than the current color. (ss attached)

That sounds sensible - see what James thinks ;)

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Components

2011-06-07 Thread Peter Cock
On Tue, Jun 7, 2011 at 9:24 AM, Mariette jmari...@toulouse.inra.fr wrote:

 Thanks for the answer,

 so I tryed to write a wrapper, but the main script is not recognize.
 When I'm calling my wrapper in the xml file, I don't have to specify
 the full path but then from my wrapper if I want to call my main script
 I can't if this one is not on the PATH ? the wrapper current directory
 is the job_working_directory, there is anyway to access my main script
 just like it's done for the wrapper ?

Generally put the wrapper script next to your tool XML file, and don't
prefix it with any path in the command tag in the XML, e.g.

/opt/galaxy/tools/mariette/my_tool.xml
/opt/galaxy/tools/mariette/my_tool.sh

(or my_tool.py or my_tool.pl etc depending on what language you used)

Your wrapper script then has to call the real executable, and the
normal way to do this on Unix is to have it on the PATH. You could
also hard code an absolute path.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Components

2011-06-07 Thread Peter Cock
On Tue, Jun 7, 2011 at 10:56 AM, Mariette jmari...@toulouse.inra.fr wrote:

 Thanks for the answer,
 Still considering the number of outputed files problem. In my script I know
 that if
 a cleaning option is set 2 other files will be created, so I check if the
 option is asked by the
 user and try this :

  outputs
   data name=log format=txt /
   #if $clean_pairends.clean_pairends_select==y:
       data name=output format=input/
   #end if
  /outputs

 but this is not working ... there is a way to do what I want ?
 thx
 Jerome

The #if stuff is Cheetah syntax, and only applies within the command
tag - not to the XML file in general.

You probably want a conditional output using when, see:
https://bitbucket.org/galaxy/galaxy-central/wiki/ToolConfigSyntax

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] galaxy-dev] format=input

2011-06-08 Thread Peter Cock
On Wed, Jun 8, 2011 at 9:26 PM, Jim Johnson johns...@umn.edu wrote:
 I added an attribute format_source  (analogous to metadata_source)
 to the data tag to deal with that kind of issue:
 see:   https://bitbucket.org/galaxy/galaxy-central/wiki/ToolConfigSyntax


Where is the code to handle this? Maybe I missed the commit
with all the recent activity.

What happens if both format and format_source are given? I
would hope an error - which means the wiki is misleading in
that format is listed as required.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] User list and disk space?

2011-06-09 Thread Peter Cock
On Tue, May 17, 2011 at 2:21 PM, Nate Coraor n...@bx.psu.edu wrote:
 Peter Cock wrote:

 That sounds really useful - it is something that you plan on adding
 to Galaxy officially?

 Yes, I'm working on adding a lot of user- and admin- side access to
 various numbers about disk usage (used by histories, used by deleted
 data, etc. and disk quotas.  I hope to have this done shortly after the
 GCC.

 Thanks,
 --nate

Hi Nate,

Since you're looking at this kind of thing, would the Saved Histories
page be worth updating to show the size on disk of each history? As
a Galaxy user that seems moderately useful - although perhaps more
of interest to Galaxy admins?

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Setting up Galaxy with SGE

2011-06-15 Thread Peter Cock
On Wed, Jun 15, 2011 at 2:30 AM, Ka Ming Nip km...@bcgsc.ca wrote:
 Hello,

 I have been trying to set up Galaxy to run its jobs on my SGE
 cluster using the Unified Method, based on the steps described
 in the wiki:

 https://bitbucket.org/galaxy/galaxy-central/wiki/Config/Cluster

 My local instance of Galaxy is installed under my home directory,
 but my cluster is on a different file system. I need to ssh to the
 head node before I can submit jobs.

 Is there any way to set up Galaxy with a SGE cluster on a different
 file system?

I haven't looked into it in great detail, but we currently have a
similar network setup, and would also like to link Galaxy to SGE.

The main problem is the lack of a shared file system - in order
to run a job on the cluster we (Galaxy) has to get the data onto
the cluster, run the job, and get the results back.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Conditional parameters regression: KeyError: '__current_case__'

2011-06-15 Thread Peter Cock
Hi Dan,

I just started work on updating my MIRA wrapper, and fell over a
regression present of 6079:5e6254f2b9e3, steps to reproduce on Linux:-

1. Install latest Galaxy
2. Install my MIRA wrapper v0.0.1 from the tool shed (don't need MIRA for this)
3. Run Galaxy
4. Select the MIRA tool
5. Change Backbones/reference chromosomes? or
   Sanger/Capillary reads? or 454 reads? or
   Solexa/Illumina reads? to yes
6. Instead of new file parameter appearing, get a KeyError:
'__current_case__' exception (details below).

Shorter example without a 3rd party tool:

1. Install latest Galaxy
2. Run Galaxy
3. Select the Convert SOLiD output to fastq tool
4. Change Is this a mate-pair run? to yes
5. Instead of new file parameters appearing, get a KeyError:
'__current_case__' exception.

By going through recent commits and retesting, I found the commit
which caused this regression:

changeset:   6078:894112e0b9d5
user:Daniel Blankenberg d...@bx.psu.edu
date:Thu Jun 09 11:26:41 2011 -0400
summary: Some fixes for interplay between DataToolParameter,
implicit datatype conversion and dynamic select
lists. Closes #491.

I guess there are downsides to running my development environment off
galaxy-central rather than galaxy-dist ...

Peter

-

Exception details,

KeyError: '__current_case__'
clear this
clear this
URL: http://127.0.0.1:8081/tool_runner/index
Module weberror.evalexception.middleware:364 in respond view
  try:
__traceback_supplement__ = errormiddleware.Supplement,
self, environ
app_iter = self.application(environ, detect_start_response)
try:
return_iter = list(app_iter)  app_iter =
self.application(environ, detect_start_response)
Module paste.debug.prints:98 in __call__ view
  try:
status, headers, body = wsgilib.intercept_output(
environ, self.app)
if status is None:
# Some error occurred  environ, self.app)
Module paste.wsgilib:539 in intercept_output view
  data.append(headers)
return output.write
app_iter = application(environ, replacement_start_response)
if data[0] is None:
return (None, None, app_iter)  app_iter =
application(environ, replacement_start_response)
Module paste.recursive:80 in __call__ view
  environ['paste.recursive.script_name'] = my_script_name
try:
return self.application(environ, start_response)
except ForwardRequestException, e:
middleware = CheckForRecursionMiddleware(  return
self.application(environ, start_response)
Module paste.httpexceptions:632 in __call__ view
 []).append(HTTPException)
try:
return self.application(environ, start_response)
except HTTPException, exc:
return exc(environ, start_response)  return
self.application(environ, start_response)
Module galaxy.web.framework.base:145 in __call__ view
  kwargs.pop( '_', None )
try:
body = method( trans, **kwargs )
except Exception, e:
body = self.handle_controller_exception( e, trans,
**kwargs )  body = method( trans, **kwargs )
Module galaxy.web.controllers.tool_runner:68 in index view
  # so make sure to create a new history if we've never had
one before.
history = trans.get_history( create=True )
template, vars = tool.handle_input( trans, params.__dict__ )
if len(params)  0:
trans.log_event( Tool params: %s % (str(params)),
tool_id=tool_id )  template, vars = tool.handle_input( trans,
params.__dict__ )
Module galaxy.tools:966 in handle_input view
  # Update state for all inputs on the current page taking new
# values from `incoming`.
errors = self.update_state( trans,
self.inputs_by_page[state.page], state.inputs, incoming )
# If the tool provides a `validate_input` hook, call it.
validate_input = self.get_hook( 'validate_input' )
errors = self.update_state( trans, self.inputs_by_page[state.page],
state.inputs, incoming )
Module galaxy.tools:1159 in update_state view
  group_state = state[input.name] = {}
# TODO: we should try to preserve values if we can
self.fill_in_new_state( trans,
input.cases[current_case].inputs, group_state, context )
group_errors = dict()
group_old_errors = dict()
self.fill_in_new_state( trans, input.cases[current_case].inputs,
group_state, context )
Module galaxy.tools:876 in fill_in_new_state view
  context = ExpressionContext( state, context )
for input in inputs.itervalues():
state[ input.name ] = 

Re: [galaxy-dev] User list and disk space?

2011-06-16 Thread Peter Cock
On Thu, Jun 9, 2011 at 4:20 PM, Nate Coraor n...@bx.psu.edu wrote:
 Peter Cock wrote:
 Hi Nate,

 Since you're looking at this kind of thing, would the Saved Histories
 page be worth updating to show the size on disk of each history? As
 a Galaxy user that seems moderately useful - although perhaps more
 of interest to Galaxy admins?

 This is already done in development, I was going to finish up the code
 and commit it within the next couple of weeks.


This has now been committed to galaxy-central, and looks very useful.

I notice the Saved Histories page now has buttons Rename, Delete,
Delete and remove datasets from disk (new) and Undelete. That
distinction between hide it away and really delete should make sense
to users from the Recycle Bin/Trash idea for files on Windows/MacOSX.

Thanks,

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] The new hg based Galaxy Tool Shed

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 3:00 AM, Ravi Madduri madd...@mcs.anl.gov wrote:
 I apologize for jumping on to this thread a bit late. I read below that
 there is a plan to pull tools into a galaxy installation automagically. I
 wonder if you plan on providing some kind of API to query the tool registry
 and discover the tools and install them into an existing galaxy
 installation.

Yes, have a look at Greg's slides from the Galaxy Community Conference
http://wiki.g2.bx.psu.edu/GCC2011

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Updating tools on new hg based Galaxy Tool Shed

2011-06-16 Thread Peter Cock
Hi all,

I just tried updating one of my tools on the new hg based Galaxy Tool
Shed and ran into some issues. I guess I'm the first person to try
this...

First of all, there is no clear update action like there used to me.
Could that be restored please?
https://bitbucket.org/galaxy/galaxy-central/issue/585/hg-tool-shed-lacks-update-tool-action

I tried using the upload files to repository option, and picked my
tarball. I did not pick a location since I wanted the tar ball
unpacked at the repository root, but this isn't what happened:
https://bitbucket.org/galaxy/galaxy-central/issue/586/hg-tool-shed-upload-files-to-repository

Finally, it seems there is currently no delete files action, so I
can't fix this (delete the misplaced files) via the web interface.
https://bitbucket.org/galaxy/galaxy-central/issue/584/cant-delete-files-in-new-hg-tool-shed

Are you expecting tool authors to work primarily at the hg level?

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Conditional parameters regression: KeyError: '__current_case__'

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 2:23 PM, Daniel Blankenberg d...@bx.psu.edu wrote:
 Hi Peter,

 If you were curious it was fixed about 7 hours later in 5681:0886ed0a8c9f

 Thanks for using Galaxy,

 Dan

Thanks - I know this kind of unexpected breakage is inevitable sometimes,
but would recommend tool developers in general run with galaxy-dist or
galaxy-central?

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Conditional parameters regression: KeyError: '__current_case__'

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 3:01 PM, Daniel Blankenberg d...@bx.psu.edu wrote:
 Hi Peter,
 Production servers, or any server where you have users, should always track
 Galaxy-dist.

Yes, that's very clear.

 Galaxy-central is a development repo (you get access to the latest and
 greatest, but also should Expect more bugs to pop up at any given time),
 from the Galaxy-central header on bitbucket: Main development repository
 for Galaxy. Active development happens here, and this repository is thus
 intended for those working on Galaxy development. See
 http://bitbucket.org/galaxy/galaxy-dist/ for a more stable repository
 intended for end-users.
 Its really a personal choice that each (tool) developer will have to make
 based upon whether they consider themselves to be a pure end-user (just
 adding tools or running a Galaxy server) who only wants to work on a
 stable branch; or if they want to contribute to Core development or gain
 early access to development features (and bugs).  I would say that finalized
 tools and e.g. tools submitted to the tool shed should be vetted against
 galaxy-dist.

OK, that's basically how I've been working - quite often during tool development
I've hit bugs in Galaxy and wanted to make branches with fixes etc, or needed
new functionality for a tool that isn't yet in galaxy-dist.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating tools on new hg based Galaxy Tool Shed

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 3:26 PM, Greg Von Kuster g...@bx.psu.edu wrote:
 Hi Peter,

 I understand, and that is why I am working feverishly to add features
 that do not force the use of hg form the command line (I've got several
 features functional, but not everything yet).  The current tool shed requires
 some use of hg if you want to delete files from the repo, but other than that,
 hg is not really required.

That's great, but I do feel the new hg went live a bit prematurely.

 The current tool shed also provides some very useful features (e.g.,
 browsing tool code files) that were not available in the old tool shed.

Really? I find the browsing the tool code files has regressed.

In old tool shed had a simple non-interactive tree of the files
visible on the main page for a tool or suite, where the individual files
could be clicked on to view/download. It was simple and effective
(at least when there weren't many files which seems typical).

The new tool shed makes you click on tool actions, browse repo,
and then gives an interactive tree which is fully collapsed by default,
forcing lots more clicking to drill down to the files. All that clicking
makes it slow and fiddly to use.

 In addition, tool developers are no longer forced to to the extra work
 that was necessary in the old tool shed to add a tool (i.e., there is no
 longer a requirement for tool suite config definitions ).

For tool suites you have a point, that was a bit of a hassle.

 My hope is that the current features make everyone happy
 enough that they will be able to wait for those messing featuers
 that are wanted, using the current features in the meantime.

It'll get there :)

Another big regression I would prioritise is the complete lack of
any user provided text on a tool's main page. The old tool shed
had the minimal text box where you could summarise the tool,
its requirements, recent changes etc. The new tool shed has
nothing to replace this as far as I can see.

A simple wiki like, ReST, or just plain text details box would do.

Automatic detection of a readme text file in the repo, to be
displayed on the tools' main page would be an elegant solution
(are you familiar with how github.com does this?). It also keeps
the description in the hg repo which is a plus point.

The mock up idea would be another (more complex) way to
tackle this, https://bitbucket.org/galaxy/galaxy-central/issue/565

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Cluster Usage

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 4:31 PM, Robert Jackson rj...@utpa.edu wrote:
 Does anyone have practical information for installation and set up for the
 galaxy distribution on a cluster??


Did you not get these replies?

http://lists.bx.psu.edu/pipermail/galaxy-user/2011-June/002715.html
http://lists.bx.psu.edu/pipermail/galaxy-user/2011-June/002715.html
http://lists.bx.psu.edu/pipermail/galaxy-user/2011-June/002716.html

On the plus side, the galaxy-dev list is more appropriate for this
question than galaxy-user list.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Updating tools on new hg based Galaxy Tool Shed

2011-06-16 Thread Peter Cock
On Thu, Jun 16, 2011 at 5:40 PM, Chris Fields cjfie...@illinois.edu wrote:

 I'm of the opposite mind; I'm not sure there is an absolute need to
 have one's tools be a branch or fork of the main tool shed, though
 I agree there is also utility in allowing that (having a customized
 'main' repo, or optimizing tools already present).

To clarify, I have been using branches on an hg fork of the main
Galaxy repository up until now, then preparing tarballs for upload
to the Galaxy Tool Shed. I could equally have tracked my local
changes under git, svn or cvs. The point is under this model, the
Tool Shed has no connection to whatever repository (or repositories)
the tool author uses on their machine.

Barring teething issues like Bug 584/585/586 tool authors could
continue to use this development model, where the fact that
the Galaxy Tool Shed uses an hg repository internally is an
irrelevant internal implementation detail.

As far as I know, there are no plans to make the Tool Shed work
directly with a full Galaxy hg repo - only mini-repositories, one
for each tool or tool suite.

 Having completely
 separate repos for tools seems cleaner, focusing development on
 those tools alone (not any of the others present in a branch) and
 pushes maintenance of the tool back to the tool developer themselves

Yes, and that is the model the new Galaxy Tool Shed is following.
Although I'm a little hazy on the details of tool suites in the new model
(and there are lots of situations where it makes sense to bundle
several tools).

 (e.g. there is no need for the galaxy devs to 'pull' in changes at any
 point into a main repo).

Well, there still is for general bug fixes, adding new file formats, and
merging any tools which they decide to include in the core set.

 This might also allow the galaxy devs to designate a set of
 'blessed' or supported tools in various repositories.

Indeed.

 Re: git: as Peter knows I'm also primarily a git/github user.  It is
 feasible at some future point to allow git/github repos.  For instance,
 one could possibly integrate github usage via this:

 http://hg-git.github.com/

 Of course, I think it's much more important that any additional
 vcs integration wait until the new tool shed interface stabilizes
 somewhat, but (at least from the github perspective) seems like
 it shouldn't be terribly hard to do.

True - if there were sufficient demand from Tool Shed contributors.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Question about installing NCBI BLAST+ onto Galaxy

2011-06-29 Thread Peter Cock
On Tue, Jun 28, 2011 at 9:52 PM, George Michopoulos
gior...@stanford.edu wrote:
 Hey everyone,
 Hope all is well! I was wondering if someone could help me with
 another error I ran into. I recently downloaded the NCBI BLAST+ toolkit and
 it automatically installed itself into Galaxy. I'm just wondering if anyone
 knows where I am supposed to put the directory with the database files it
 needs to run correctly, or if it makes a difference.

It shouldn't matter as long a you use an appropriate path in the
blastdb.loc and blastdb_p.loc files. We use /data/blastdb/ for ours.

 I have configured the
 blastdb.loc file as shown below, and the database now appears in the
 drop-down menu for the NCBI tools, but when I try executing any of the
 BLASTs Galaxy returns the following error, regardless of the path
 permutation I try:

At least Galaxy is finding the loc file :)

 When I tried putting the databases within the galaxy installation (within
 galaxy_test) and I used the whole path:
 BLAST Database error: No alias or index file found for nucleotide
 database [/Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/tool-data/blastdb/refseq_rna] in
 search path
 [/Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/database/job_working_directory/28::]
 Return error code 2 from command: tblastx -query
 /Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/database/files/000/dataset_27.dat -db
 /Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/tool-data/blastdb/refseq_rna
 -evalue 0.001 -out
 /Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/database/files/000/dataset_32.dat -outfmt
 6 -num_threads 8

Does BLAST+ work at the command line?

Does BLAST+ work within Galaxy for FASTA vs FASTA (rather than
FASTA vs database)?

Also what does this give:

ls 
/Users/burtonigenomics/Rosa_Files/bin/fastx_bin/galaxy_test/tool-data/blastdb/refseq_rna.*

My guess is you have tried a valid path, but that the Galaxy user does
not have read permission so BLAST+ can't open the database.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] Question about installing NCBI BLAST+ onto Galaxy

2011-06-30 Thread Peter Cock
On Thu, Jun 30, 2011 at 1:26 AM, George Yianni Michopoulos wrote:
 Hey Peter,

 Thanks so much for your help, your comments helped me realize what
 was wrong. The issue was that I wasn't including the name of the
 database within the path, just the folder it was in!

 Best,
 George Michopoulos


Easily done, I'm glad its working now.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] optional integer value in tool config

2011-07-04 Thread Peter Cock
On Mon, Jul 4, 2011 at 1:46 PM, Sarah Diehl di...@immunbio.mpg.de wrote:
 Hello,

 in the config of a tool I want to integrate into Galaxy I would like to have
 a parameter with the following properties:

 param name=var type=integer label=Var value=50 optional=true
 min=10 max=100 /

 So it should be possible to leave this field empty, but when you choose to
 enter a number it should be between 10 and 100.
 The problem now is that Galaxy refuses to accept it when this field is
 empty.


Which version of Galaxy are you running? Optional float and integer
parameters was only recently made possible under issue 403 with an
initial fix committed two weeks ago,

https://bitbucket.org/galaxy/galaxy-central/issue/403
https://bitbucket.org/galaxy/galaxy-central/changeset/adcc600effa4


 After encountering this problem I tried to solve the issue within Cheetah
 and tried the following:

 #if $var.value  10 or $var.value  100
 #set $var.value = 
 #end if

You'd need to try something like this:

#if int($var.value)  10 or int($var.value)  100
#set $var.value = 
#end if

but you also have to first exclude the case where is is already empty
as then int would fail.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] New Tool Shed: Inconsistent field names

2011-07-06 Thread Peter Cock
On Tue, Jul 5, 2011 at 7:24 PM, Greg Von Kuster g...@bx.psu.edu wrote:

 I have previously commented on the problem with showing the
 description on its own without the context of the tool name as
 displayed in Galaxy:

 http://lists.bx.psu.edu/pipermail/galaxy-dev/2011-June/005640.html

 and:

 http://lists.bx.psu.edu/pipermail/galaxy-dev/2010-December/004012.html
 http://lists.bx.psu.edu/pipermail/galaxy-dev/2010-December/004017.html
 http://lists.bx.psu.edu/pipermail/galaxy-dev/2010-December/004019.html

 One idea would be to show just the name in the left hand pane of
 Galaxy, with the description as a tool tip AND show this at the top of
 the tool main page in the central panel. This would encourage tool
 authors to write this field as a self contained synopsis, and make the
 left hand panel less text heavy.


 This may be considered for the Galaxy instance (the decision is in the
 hands of others), but would probably not work in the tool shed.  Again,
 we'll see what we can improve as I make progress on the repository
 metadata implementation.


Sorry if that was unclear, I did mean the main Galaxy interface there.

Peter

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] From NCBI SRA to UCSC viewer pipeline.

2011-07-07 Thread Peter Cock
On Thu, Jul 7, 2011 at 7:14 AM, colin molter colin.mol...@gmail.com wrote:
 Hi all,
 i am trying to use a local instance of my galaxy to pre-format data stored
 at sra-ncbi.
 Does anyone has a working pipeline that (s)he could share.
 Here is the pipeline I am using, with some questions.
 1/ download sra files to my server.
 2/ transform them in fastq using the sra toolbox.
 3/ upload them in galaxy, by using the 'add to data library'
 4/ use the fastq groomer to enable to use the fast q in galaxy.
 Note: i guess that the data at sra are already in the fastq sanger format.
 So it could be nice to be able to skip that point (it took 10 hours to groom
 a fastq of 25Gb).

Just upload load them and set the file format to fastqsanger
(either when you upload, or afterwards via the pencil icon
to edit the attributes - this is fast as it doesn't change the file).

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] New version_string tag in tool XML

2011-07-07 Thread Peter Cock
Hi all,

This new feature looks very exciting for tracking reproducibility -
now in theory both the tool wrapper version (explicit in the XML)
and any underlying tool version will both be recorded:

https://bitbucket.org/galaxy/galaxy-central/changeset/84b20f29dfdf

I have updated the NCBI BLAST+ wrappers to use the new
feature, please merge/transplant this branch/commit:

https://bitbucket.org/peterjc/galaxy-central/src/blast_2011_07_07
https://bitbucket.org/peterjc/galaxy-central/changeset/11f425f4e137

I hope the new tool version string will get its own field in the
database soon (and be viewable via the web interface). Note
that string may well be multi-line, e.g.

$ blastp -version
blastp: 2.2.25+
Package: blast 2.2.25, build Apr  6 2011 13:59:28

Unless that is you go for more fragile solutions like these to
ensure a single line:

$ samtools 21 | grep -i ^Version
Version: 0.1.16 (r963:234)

$ bwa 21 | grep -i ^Version
Version: 0.5.9-r16

Thanks,

Peter

P.S. I would say the name version_string is a bit confusing,
I'd have picked version_cmd or version_command instead.
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


  1   2   3   4   5   6   7   8   9   10   >