Re: [galaxy-user] Merging of illumina paired end files

2011-09-15 Thread Jennifer Jackson

Hello Arun,

In brief, you will want to load your data via FTP, groom the files, and 
run BWA for Illumina with the paired-end data option (for mapping, not 
assembly).


Help to get started can be found at:
http://galaxyproject.org/wiki/Learn

An initial input of 30G would likely not be a problem, but it depends on 
what analysis steps you intend to do after mapping. A Cloudman option 
may be more suitable, please see:

http://galaxyproject.org/wiki/Big%20Picture/Choices

Hopefully this helps,

Best,

Jen
Galaxy team

On 9/14/11 7:06 PM, Arun Khattri wrote:

I have 3 illumina paired end reads of exome capture of the sample. I
want to assemble these reads to genome using tools available in Galaxy
(BWA etc). My concern is the amount of data that I  can analyzed and
when these reads should be merged. The total size of data is +30Gb.

Thanks,
Arun


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] cufflinks version

2011-09-15 Thread Jennifer Jackson

Hello Shantanu,

A version list for tools is at:
http://galaxyproject.org/Admin/Tools/Tool%20Dependencies

We are currently working on a global tool versioning strategy. As a 
result, users will notice more integration with the UI over the upcoming 
months.


Hopefully this helps,

Jen
Galaxy team

On 9/9/11 2:09 PM, Shantanu Pavgi wrote:


What version of cufflinks is available on the galaxy site? Is there any page 
where I can find version of all galaxy tools?

Thanks,
Shantanu.
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Getting (or setting) physical file name

2011-09-15 Thread Peter Cock
On Thu, Sep 15, 2011 at 6:11 PM, Paul-Michael Agapow
paul-michael.aga...@hpa.org.uk wrote:
 So one of my colleagues has a script he wants to turn into a  Galaxy tool.
 The twist is that script:



 1.   Looks for files with a fixed name (e.g. “params.txt”)

 2.   Accepts other file names as commandline arguments, but the actual
 names of those files has arguments embedded in it  (e.g.
 “nuc_100iter_b.fasta” for nucleotide data in fasta format to be run against
 model b for 100 iterations.)



 I know, awkward and clumsy. But hardly unique for many historical
 bioinformatic tools. Anyway, the challenge for me is to pick the easiest
 path to port this script to a tool. And it seems to be fairly awkward under
 the Galaxy model as I understand it. Possibilities:



 1.   Rewrite the script argument parsing and invocation. Obviously,
 there will be resistance to this and with some justification (“I thought you
 said this could wrap any command line program …”)

If this is your own tool, this is the cleanest solution and helps
beyond just using it within Galaxy.

 2.   Write a script that calls the original script after moving and
 renaming files according to desired arguments. Any problems with a
 two-script/executable tool like this?

That's what I'd go for - a wrapper script which takes command line
arguments like a sane command line tool, and uses them to prepare
the input files for the weird script. Your tool should automatically be
called from a temp working directory so you can probably just make
the specially named files right there, and try using links to alias the
input files rather than copying them (faster as less IO).

 How do I specify the interpreter for both parts of the script?

If your script is marked as executable with a suitable hash bang,
it doesn't even need the Galaxy interpreter in the XML file.

For the internal script it doesn't matter at all - Galaxy doesn't
need to know.

Peter

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] run tophat in galaxy

2011-09-15 Thread Jennifer Jackson

=== Please use Reply All when responding to this email! ===

Hi Tao,

I made an error in my prior reply, it is possible to guide assembly in 
TopHat. To do this, on the TopHat form, change TopHat settings to use: 
from Use Defaults to Full parameter list. In the expanded form:


1 - change Use Own Junctions: to be yes.
2 - change Use Gene Annotation Model: to be yes
3 - in the new pull-down menu, select the GTF file from your history

Great question! Glad that we were able to provide you with the correct 
instruction,


Best,

Jen
Galaxy team

On 9/15/11 1:38 PM, Jennifer Jackson wrote:

=== Please use Reply All when responding to this email! ===

Hello Tao,

Sorry for the delayed reply, your question did not post to the mailing
list since the to was not _only_ to galaxy-user.

Going forward, please leave off any to or cc to team members when
asking a question. Send all questions directly to
galaxy-u...@bx.psu.edu and do not include any Re or Fwd text in
the subject line.

Regarding RNA-seq analysis and reference GTF files, the place to
incorporate the GTF file is in the Cufflinks step, the option to select
the GTF file from your history is on the tool's form. If you have
questions about the tools that are not addressed by these help links:

http://usegalaxy.org/u/jeremy/p/transcriptome-analysis-faq
http://usegalaxy.org/u/jeremy/p/galaxy-rna-seq-analysis-exercise

then contacting the tool authors would be the next step:
email tophat.cuffli...@gmail.com

To visualize the data, the available options will be links associated
with each dataset (expand the dataset box to locate these). The Galaxy
Track Browser (GTB) aka Trackster, UCSC Genome Browser, Ensembl, and
GeneTrack are potential options; the datatype will determine which links
are provided.

Hopefully this helps,

Best,

Jen
Galaxy team


 Original Message 
Subject: run tophat in galaxy
Date: Sun, 28 Aug 2011 08:50:04 -0700
From: Peng, Tao tp...@fhcrc.org
To: Jennifer Jackson j...@bx.psu.edu, galaxy-user
galaxy-user@lists.bx.psu.edu



Hi how can I specify a GTF gene annotation file when running tophat to
guide the alignment to human genome? What is the best way to visualize
the tophat results in the context of annotated human genome, i.e. RefSeq?

Thanks,

tao


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


[galaxy-user] mailing list

2011-09-15 Thread Sinnakaruppan MATHAVAN
Hi,
Include me in your mailing list.

Mathavan
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] run tophat in galaxy

2011-09-15 Thread shamsher jagat
I have related question If I have to use Ensembl mouse GTF file
(Mus_musculus.NCBIM37.64) Do I have to download and reformat it or Galaxy
can take it from the source directly?

Thanks

On Sun, Aug 28, 2011 at 8:50 AM, Peng, Tao tp...@fhcrc.org wrote:

  === Please use Reply All when responding to this email! ===


 **

 Hi how can I specify a GTF gene annotation file when running tophat to
 guide the alignment to human genome? What is the best way to visualize the
 tophat results in the context of annotated human genome, i.e. RefSeq?

 Thanks,

 tao


 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

  http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/