Re: [galaxy-user] Evaluating TopHat's results

2013-07-15 Thread Jennifer Jackson

Hello Thanh,

The tool  NGS: Picard (beta) - SAM/BAM Alignment Summary Metrics may 
be the tool you are looking for. There are others in this tool group 
that added up numbers in BAM or SAM files, and SAMTools has flagstat, 
so you could create you own calculation with one of those, plus a count 
on the fastq inputs, and the Compute tool, if it is not exactly right.


Are you using the public Main Galaxy instance at 
https://main.g2.bx.psu.edu/ (usegalaxy.org) clicking over to connect to 
the Genomic HyperBrowser, via web? Or are you doing something else? Can 
you give this another try this morning and see if it is working?


Hopefully the first part helped, let us know about the second,

Take care,

Jen
Galaxy team

On 7/14/13 1:47 PM, Hoang, Thanh wrote:

Hi,
I ran TopHat on Galaxy for my RNA-seq data. I want to analyze TopHat's 
output files, such as percentage of reads mapped to the genome...but I 
am not sure how to do that.
I am also trying to visualize the BAM file by IGB but the following 
error message appears :  Failed to authenticate to the server.


Anyone can help with these issues?

Thank so much
Thanh


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

   http://galaxyproject.org/search/mailinglists/


--
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-user] Evaluating TopHat's results

2013-07-15 Thread Hoang, Thanh
Hi Jen,
Thanks so much for your advice.
*The tool  NGS: Picard (beta) - SAM/BAM Alignment Summary Metrics may be
the tool you are looking for. There are others in this tool group that
added up numbers in BAM or SAM files, and SAMTools has flagstat, so you
could create you own calculation with one of those, plus a count on the
fastq inputs, and the Compute tool, if it is not exactly right.*
*
*
I ran the Flagstat on my TopHat 's output BAM file. I am now confusing
about the result:

44066574 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
44066574 + 0 mapped (100.00%:-nan%)
0 + 0 paired in sequencing
0 + 0 read1
0 + 0 read2
0 + 0 properly paired (-nan%:-nan%)
0 + 0 with itself and mate mapped
0 + 0 singletons (-nan%:-nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ=5)

Note that my RNA-seq data is single-ended sequencing. The raw data for
this sample before mapping with TopHat only has 33174286 reads. My
question is why I have more reads mapped in the BAM file from TopHat's
output? and Does this BAM file contains the mapped reads only ( NOT
non-mapped reads)?

I have also tried the *SAM/BAM Alignment Summary Metrics  *tool. This
time I have 25541681 reads from BAM file ( the result seems only show
mapped reads). Is that the number I should expect?

Thank you

Thanh



On Mon, Jul 15, 2013 at 10:41 AM, Jennifer Jackson j...@bx.psu.edu wrote:

  Hello Thanh,

 The tool  NGS: Picard (beta) - SAM/BAM Alignment Summary Metrics may be
 the tool you are looking for. There are others in this tool group that
 added up numbers in BAM or SAM files, and SAMTools has flagstat, so you
 could create you own calculation with one of those, plus a count on the
 fastq inputs, and the Compute tool, if it is not exactly right.

 Are you using the public Main Galaxy instance at
 https://main.g2.bx.psu.edu/ (usegalaxy.org) clicking over to connect to
 the Genomic HyperBrowser, via web? Or are you doing something else? Can you
 give this another try this morning and see if it is working?

 Hopefully the first part helped, let us know about the second,

 Take care,

 Jen
 Galaxy team


 On 7/14/13 1:47 PM, Hoang, Thanh wrote:

 Hi,
 I ran TopHat on Galaxy for my RNA-seq data. I want to analyze TopHat's
 output files, such as percentage of reads mapped to the genome...but I am
 not sure how to do that.
 I am also trying to visualize the BAM file by IGB but the following error
 message appears :  Failed to authenticate to the server.

  Anyone can help with these issues?

  Thank so much
 Thanh


 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:

   http://galaxyproject.org/search/mailinglists/


 --
 Jennifer Hillman-Jackson
 Galaxy Support and Traininghttp://galaxyproject.org


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

[galaxy-user] Evaluating TopHat's results

2013-07-14 Thread Hoang, Thanh
Hi,
I ran TopHat on Galaxy for my RNA-seq data. I want to analyze TopHat's
output files, such as percentage of reads mapped to the genome...but I am
not sure how to do that.
I am also trying to visualize the BAM file by IGB but the following error
message appears :  Failed to authenticate to the server.

Anyone can help with these issues?

Thank so much
Thanh
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/