[galaxy-user] Text Manipulation: Filter out duplicates (uniq) from an plain text file ?

2011-05-06 Thread Roman Valls
Hey galaxy users,

Thats a fairly good question from one of my colleagues. I've looked
through the menus (mainly Text Manipulation and Filter and
Sort(Select)), googled (on the mailing list archives too), but couldn't
find an answer: How should I remove duplicates on plain text files
without resorting to:

echo file|sort|uniq before uploading the file/text.

or

Putting a regexp together to replace the duplicate occurences as in:

http://www.regular-expressions.info/duplicatelines.html


I'm pretty sure I'm missing some really basic stuff here... is this
basic operation something supposed to be done outside galaxy perhaps ?

Thanks in advance !

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Text Manipulation: Filter out duplicates (uniq) from an plain text file ?

2011-05-06 Thread Peter Cock
On Fri, May 6, 2011 at 3:16 PM, Roman Valls brainst...@nopcode.org wrote:
 Well, having similarly basic tools (in Galaxy) that can be performed on
 the commandline, such as sort or cut I just wondered how come a
 uniq is not there on the tool panel in some form/name.

 Thanks for the feedback Rory !

That's a timely question - I was also looking for something within Galaxy
to take a text file and remove duplicate lines.

Peter
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Text Manipulation: Filter out duplicates (uniq) from an plain text file ?

2011-05-06 Thread Guru Ananda
Hi Peter and Roman,

The Count tool under Statistics section provides uniq-like
functionality. If you run this tool by selecting all columns under Count
occurrences of values in column(s) field, your output will contain one line
per record, with the 1st column containing the number of occurrences of each
record.

Hope this answers your question.
Thanks for using Galaxy,
Guru.


On Fri, May 6, 2011 at 10:22 AM, Peter Cock p.j.a.c...@googlemail.comwrote:

 On Fri, May 6, 2011 at 3:16 PM, Roman Valls brainst...@nopcode.org
 wrote:
  Well, having similarly basic tools (in Galaxy) that can be performed on
  the commandline, such as sort or cut I just wondered how come a
  uniq is not there on the tool panel in some form/name.
 
  Thanks for the feedback Rory !

 That's a timely question - I was also looking for something within Galaxy
 to take a text file and remove duplicate lines.

 Peter
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

  http://lists.bx.psu.edu/




-- 
Graduate student, Bioinformatics and Genomics
Makova lab/Galaxy team
Penn State University
505 Wartik lab
University Park PA 16802
g...@psu.edu
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] RNA seq analysis

2011-05-06 Thread puvan001

Hi

I have a couple of questions regarding RNA seq analysis. My questions are
1.I need to use a viral genome (very small, ~2kb ) as a reference genome 
and it is not available in Galaxy. I guess I can use this data from my 
history. I have a fasta file but I am not sure whether I have to do some 
kind of indexing or not.


2. In Tophat, default for maximum number of alignments to be allowed is 
40. What my understanding is a single read can be aligned maximum 40 
different places. I am wondering why this is 40. Is there any specific 
reason? If I need unique mapping, I have to use 1 instead of 40. Am I 
correct?



Thanks

SP




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread David Matthews
Hi,

I have done exactly the same kind of thing for adenovirus so I can help with 
it. In answer to question 1 you do not need to index it will be done for you 
when tophat is called. Secondly you should leave the 40 multihits as it is and 
post analysis filter out the multihits - this will allow you to determine if 
you do have a multihit problem or not and if so whether it is a big problem and 
where it is on the genome. I have a workflow on Galaxy which you can use called 
Bristol workflow to get sorted unique proper pair mapped reads. If you plug 
in your sam file it should give you files listing only unique hits and those 
which map more than once. This workflow assumes you have paired end data but it 
can be modified to work with single end reads as well.

Hope this helps.


Best Wishes,
David.

__
Dr David A. Matthews

Senior Lecturer in Virology
Room E49
Department of Cellular and Molecular Medicine,
School of Medical Sciences
University Walk,
University of Bristol
Bristol.
BS8 1TD
U.K.

Tel. +44 117 3312058
Fax. +44 117 3312091

d.a.matth...@bristol.ac.uk






On 6 May 2011, at 17:09, puvan...@umn.edu wrote:

 Hi
 
 I have a couple of questions regarding RNA seq analysis. My questions are
 1.I need to use a viral genome (very small, ~2kb ) as a reference genome and 
 it is not available in Galaxy. I guess I can use this data from my history. I 
 have a fasta file but I am not sure whether I have to do some kind of 
 indexing or not.
 
 2. In Tophat, default for maximum number of alignments to be allowed is 40. 
 What my understanding is a single read can be aligned maximum 40 different 
 places. I am wondering why this is 40. Is there any specific reason? If I 
 need unique mapping, I have to use 1 instead of 40. Am I correct?
 
 
 Thanks
 
 SP
 
 
 
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
 http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
 http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread puvan001


Hi David,

Thanks!When I tried to run Tophat, it doesn't recognise my FASTA file and 
it says History does not include a dataset of the required format / 
build. Do you have any thoughts about this?


Now it makes more sense about multihits. Thanks for sharing your 
workflow.


With regards

Sumathy

On May 6 2011, David Matthews wrote:


Hi,

I have done exactly the same kind of thing for adenovirus so I can help 
with it. In answer to question 1 you do not need to index it will be done 
for you when tophat is called. Secondly you should leave the 40 multihits 
as it is and post analysis filter out the multihits - this will allow you 
to determine if you do have a multihit problem or not and if so whether it 
is a big problem and where it is on the genome. I have a workflow on Galaxy 
which you can use called Bristol workflow to get sorted unique proper pair 
mapped reads. If you plug in your sam file it should give you files 
listing only unique hits and those which map more than once. This workflow 
assumes you have paired end data but it can be modified to work with single 
end reads as well.


Hope this helps.


Best Wishes,
David.

__
Dr David A. Matthews

Senior Lecturer in Virology
Room E49
Department of Cellular and Molecular Medicine,
School of Medical Sciences
University Walk,
University of Bristol
Bristol.
BS8 1TD
U.K.

Tel. +44 117 3312058
Fax. +44 117 3312091

d.a.matth...@bristol.ac.uk






On 6 May 2011, at 17:09, puvan...@umn.edu wrote:


Hi

I have a couple of questions regarding RNA seq analysis. My questions are
  1.I need to use a viral genome (very small, ~2kb ) as a reference genome 
and it is not available in Galaxy. I guess I can use this data from my 
history. I have a fasta file but I am not sure whether I have to do some 
kind of indexing or not.


  2. In Tophat, default for maximum number of alignments to be allowed 
is 40. What my understanding is a single read can be aligned maximum 40 
different places. I am wondering why this is 40. Is there any specific 
reason? If I need unique mapping, I have to use 1 instead of 40. Am I 
correct?



Thanks

SP



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/





--
Sumathy Puvanendiran
Graduate student



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread Austin Paul
Hi,

You need to run fastq groomer on your rna-seq data.  Your reference is fine
as a fasta.

Austin

On Fri, May 6, 2011 at 10:26 AM, puvan...@umn.edu wrote:


 Hi David,

 Thanks!When I tried to run Tophat, it doesn't recognise my FASTA file and
 it says History does not include a dataset of the required format / build.
 Do you have any thoughts about this?

 Now it makes more sense about multihits. Thanks for sharing your
 workflow.

 With regards

 Sumathy


 On May 6 2011, David Matthews wrote:

 Hi,

 I have done exactly the same kind of thing for adenovirus so I can help
 with it. In answer to question 1 you do not need to index it will be done
 for you when tophat is called. Secondly you should leave the 40 multihits as
 it is and post analysis filter out the multihits - this will allow you to
 determine if you do have a multihit problem or not and if so whether it is a
 big problem and where it is on the genome. I have a workflow on Galaxy which
 you can use called Bristol workflow to get sorted unique proper pair mapped
 reads. If you plug in your sam file it should give you files listing only
 unique hits and those which map more than once. This workflow assumes you
 have paired end data but it can be modified to work with single end reads as
 well.


 Hope this helps.


 Best Wishes,
 David.

 __
 Dr David A. Matthews

 Senior Lecturer in Virology
 Room E49
 Department of Cellular and Molecular Medicine,
 School of Medical Sciences
 University Walk,
 University of Bristol
 Bristol.
 BS8 1TD
 U.K.

 Tel. +44 117 3312058
 Fax. +44 117 3312091

 d.a.matth...@bristol.ac.uk






 On 6 May 2011, at 17:09, puvan...@umn.edu wrote:

 Hi

 I have a couple of questions regarding RNA seq analysis. My questions are

  1.I need to use a viral genome (very small, ~2kb ) as a reference genome
 and it is not available in Galaxy. I guess I can use this data from my
 history. I have a fasta file but I am not sure whether I have to do some
 kind of indexing or not.


  2. In Tophat, default for maximum number of alignments to be allowed
 is 40. What my understanding is a single read can be aligned maximum 40
 different places. I am wondering why this is 40. Is there any specific
 reason? If I need unique mapping, I have to use 1 instead of 40. Am I
 correct?



 Thanks

 SP



 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

 http://lists.bx.psu.edu/




 --
 Sumathy Puvanendiran
 Graduate student




 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

  http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread puvan001


Hi Austin

I did all these (grooming and trimming)on rna-seq data and I don't have a 
problem with built in genome . I'll try again!



Thanks

Sumathy




On May 6 2011, Austin Paul wrote:


Hi,

You need to run fastq groomer on your rna-seq data.  Your reference is fine
as a fasta.

Austin

On Fri, May 6, 2011 at 10:26 AM, puvan...@umn.edu wrote:



Hi David,

Thanks!When I tried to run Tophat, it doesn't recognise my FASTA file and
  it says History does not include a dataset of the required format / 
build.

Do you have any thoughts about this?

Now it makes more sense about multihits. Thanks for sharing your
workflow.

With regards

Sumathy


On May 6 2011, David Matthews wrote:

Hi,


I have done exactly the same kind of thing for adenovirus so I can help

with it. In answer to question 1 you do not need to index it will be done
  for you when tophat is called. Secondly you should leave the 40 
multihits as

it is and post analysis filter out the multihits - this will allow you to
  determine if you do have a multihit problem or not and if so whether it 
is a
  big problem and where it is on the genome. I have a workflow on Galaxy 
which
  you can use called Bristol workflow to get sorted unique proper pair 
mapped
  reads. If you plug in your sam file it should give you files listing 
only

unique hits and those which map more than once. This workflow assumes you
  have paired end data but it can be modified to work with single end 
reads as

well.



Hope this helps.


Best Wishes,
David.

__
Dr David A. Matthews

Senior Lecturer in Virology
Room E49
Department of Cellular and Molecular Medicine,
School of Medical Sciences
University Walk,
University of Bristol
Bristol.
BS8 1TD
U.K.

Tel. +44 117 3312058
Fax. +44 117 3312091

d.a.matth...@bristol.ac.uk






On 6 May 2011, at 17:09, puvan...@umn.edu wrote:

Hi


I have a couple of questions regarding RNA seq analysis. My questions 
are


1.I need to use a viral genome (very small, ~2kb ) as a reference 
genome

and it is not available in Galaxy. I guess I can use this data from my
history. I have a fasta file but I am not sure whether I have to do some
kind of indexing or not.




 2. In Tophat, default for maximum number of alignments to be allowed

is 40. What my understanding is a single read can be aligned maximum 40
different places. I am wondering why this is 40. Is there any specific
reason? If I need unique mapping, I have to use 1 instead of 40. Am I
correct?





Thanks

SP



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/






--
Sumathy Puvanendiran
Graduate student




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/





--
Sumathy Puvanendiran
Graduate student


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


[galaxy-user] Composite Datatypes Q.

2011-05-06 Thread Todd Yilk
I have a program I'm trying to galaxify that emits a variable number of 
result files. I would like the output of my Galaxy tool to show up in Galaxy as 
an html file with links to the result files. So when you click on the eye, the 
html file should up in the middle pane ... sorry if I'm not describing this in 
an elegant way.

Creating a composite datatype is the way to go in this situation, correct? I'm 
creating a class that inherits from Html. How do I get the result returned from 
my custom generate_primary_file function to show up as a tool's output ... if 
that's the right way to go about this?

Thanks!
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-user] Megblast GIs

2011-05-06 Thread Douglas Rhoads
We have a local install of Galaxy and are using it for training grad and 
undergrad students 
(using the Windshield Splatter data).  We have a relatively new install and the 
Megablast 
seems to be doing something funny with the output in that column 2 which is 
supposed to be 
the GI of the database hit is now two numbers joined with an underline (see 
below).  I need to 
tell my IT people what is wrong so they can correct the problem.

BL1-1-3 157649053_11175091.30   46  4   0   10  55  
58541   58496   2e-
06  60.0
BL1-1-5 161788633_175843100.00  26  0   0   15  40  
48812   48787   4e-
04  52.0

-- 

Douglas Rhoads, Professor of Biological Sciences
Director of Graduate Program in Cell and Molecular Biology
601 SCEN, Dept. of Biological Sciences,
1 University of Arkansas, Fayetteville, AR 72701-1201
drhoads(at)uark.edu, http://www.uark.edu/ua/drhoads
phone: 479-575-3251  FAX: 479-575-4010


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread Austin Paul
There are many ways.  I typically use IGV.  It needs a sam file, so I first
convert the bam to sam in galaxy, then download the sam file.  In IGV, I
upload the reference and the sam file, then use IGVtools to index the sam
file, then I can visualize the data.

Austin
On Fri, May 6, 2011 at 5:30 PM, puvan...@umn.edu wrote:

 Hello

 I was able to run RNA seq data against a custom build genome. How can I
 visualize the results. I tried via trackster and unfortunately I couldn't.
 Can you help me?


 Thanks

 Sumathy

 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

  http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread Austin Paul
Oops.  Good to know.  Thanks.

Austin

On Fri, May 6, 2011 at 6:02 PM, Sean Davis sdav...@mail.nih.gov wrote:

 IGV reads BAM files just fine; no need to convert to SAM.

 Sean

 On Fri, May 6, 2011 at 8:45 PM, Austin Paul austi...@usc.edu wrote:

 There are many ways.  I typically use IGV.  It needs a sam file, so I
 first convert the bam to sam in galaxy, then download the sam file.  In IGV,
 I upload the reference and the sam file, then use IGVtools to index the sam
 file, then I can visualize the data.

 Austin
 On Fri, May 6, 2011 at 5:30 PM, puvan...@umn.edu wrote:

 Hello

 I was able to run RNA seq data against a custom build genome. How can I
 visualize the results. I tried via trackster and unfortunately I couldn't.
 Can you help me?


 Thanks

 Sumathy

 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

  http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread Jim Robinson

Hi Vasu,

I'm going to add the function to index BAM files soon, using Picard.
In the beginning  there was no java BAM reader, only SAM, and I  
added the index then.  Indexed BAMs came along later, but that's  
probably more than you want to know...I think most people will  
still use Galaxy to index as it can take a long time, but I agree with  
you on the convenience factor.


Jim


On May 6, 2011, at 9:36 PM, vasu punj wrote:

One of the problem is IGV dont have option of creating index file so  
one has to create index file in Galaxy first to  view in IGV. Jim I  
have been using IGV 2 beta version it is great work but How hard is  
to include index functionality with in IGV. I know we can use sam  
tools also but just for convinence if it is not that much of work.

Vasu

--- On Fri, 5/6/11, Sean Davis ssdav...@mail.nih.gov wrote:

From: Sean Davis sdav...@mail.nih.gov
Subject: Re: [galaxy-user] RNA seq analysis
To: Austin Paul austi...@usc.edu
Cc: galaxy-user@lists.bx.psu.edu galaxy-user@lists.bx.psu.edu, puvan...@umn.edu 
 puvan...@umn.edu

Date: Friday, May 6, 2011, 8:02 PM

IGV reads BAM files just fine; no need to convert to SAM.
Sean

On Fri, May 6, 2011 at 8:45 PM, Austin Paul austi...@usc.edu wrote:
There are many ways.  I typically use IGV.  It needs a sam file, so  
I first convert the bam to sam in galaxy, then download the sam  
file.  In IGV, I upload the reference and the sam file, then use  
IGVtools to index the sam file, then I can visualize the data.


Austin
On Fri, May 6, 2011 at 5:30 PM, puvan...@umn.edu wrote:
Hello

I was able to run RNA seq data against a custom build genome. How  
can I visualize the results. I tried via trackster and unfortunately  
I couldn't. Can you help me?



Thanks

Sumathy

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/



-Inline Attachment Follows-

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread puvan001



Hi

I may be doing in a wrong way. I clicked trackster and I added the custom 
build genome. Since it is a very small genome (~2kb), I considered this as 
a single contig. Then I cliked add tracks and added my data file. But I 
got a message no data for this contig. Whenever I used built in genomes I 
did not have any problem. I guess I am doing something wrong here.



Sumathy










On May 6 2011, Jeremy Goecks wrote:


Sumathy,

What kind of problems are you having with Trackster?

J.

On May 6, 2011, at 8:30 PM, puvan...@umn.edu wrote:


Hello

  I was able to run RNA seq data against a custom build genome. How can I 
visualize the results. I tried via trackster and unfortunately I couldn't. 
Can you help me?



Thanks

Sumathy
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/





--
Sumathy Puvanendiran
Graduate student


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] RNA seq analysis

2011-05-06 Thread vasu punj
Thanks Jim,
 
Vasu
 
--- On Fri, 5/6/11, Jim Robinson jrobi...@broadinstitute.org wrote:


From: Jim Robinson jrobi...@broadinstitute.org
Subject: Re: [galaxy-user] RNA seq analysis
To: vasu punj pu...@yahoo.com
Cc: Austin Paul austi...@usc.edu, Sean Davis sdav...@mail.nih.gov, 
galaxy-user@lists.bx.psu.edu galaxy-user@lists.bx.psu.edu, 
puvan...@umn.edu puvan...@umn.edu
Date: Friday, May 6, 2011, 9:01 PM


Hi Vasu,

I'm going to add the function to index BAM files soon, using Picard.   In the 
beginning  there was no java BAM reader, only SAM, and I added the index 
then.  Indexed BAMs came along later, but that's probably more than you want to 
know...    I think most people will still use Galaxy to index as it can take a 
long time, but I agree with you on the convenience factor.

Jim


On May 6, 2011, at 9:36 PM, vasu punj wrote:

 One of the problem is IGV dont have option of creating index file so one has 
 to create index file in Galaxy first to  view in IGV. Jim I have been using 
 IGV 2 beta version it is great work but How hard is to include index 
 functionality with in IGV. I know we can use sam tools also but just for 
 convinence if it is not that much of work.
 Vasu
 
 --- On Fri, 5/6/11, Sean Davis ssdav...@mail.nih.gov wrote:
 
 From: Sean Davis sdav...@mail.nih.gov
 Subject: Re: [galaxy-user] RNA seq analysis
 To: Austin Paul austi...@usc.edu
 Cc: galaxy-user@lists.bx.psu.edu galaxy-user@lists.bx.psu.edu, 
 puvan...@umn.edu puvan...@umn.edu
 Date: Friday, May 6, 2011, 8:02 PM
 
 IGV reads BAM files just fine; no need to convert to SAM.
 Sean
 
 On Fri, May 6, 2011 at 8:45 PM, Austin Paul austi...@usc.edu wrote:
 There are many ways.  I typically use IGV.  It needs a sam file, so I first 
 convert the bam to sam in galaxy, then download the sam file.  In IGV, I 
 upload the reference and the sam file, then use IGVtools to index the sam 
 file, then I can visualize the data.
 
 Austin
 On Fri, May 6, 2011 at 5:30 PM, puvan...@umn.edu wrote:
 Hello
 
 I was able to run RNA seq data against a custom build genome. How can I 
 visualize the results. I tried via trackster and unfortunately I couldn't. 
 Can you help me?
 
 
 Thanks
 
 Sumathy
 
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
  http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
  http://lists.bx.psu.edu/
 
 
 
 -Inline Attachment Follows-
 
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
   http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
   http://lists.bx.psu.edu/
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
  http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
  http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/