Hello Davide,
If the data does represent alignments (not just sequence), then there is
one more item to check for. Another team member reminded me that
Cufflinks requires an "XS" custom tag in an input SAM file (or any
compressed BAM file that represents it). Details are here from the
Cufflinks manual:
http://cufflinks.cbcb.umd.edu/manual.html#cufflinks_input :
--
Cufflinks takes a text file of SAM alignments, or a binary SAM (BAM)
file as input. For more details on the SAM format, see the specification
<http://samtools.sourceforge.net/SAM1.pdf>. The RNA-Seq read mapper
TopHat <http://tophat.cbcb.umd.edu/> produces output in this format, and
is recommended for use with Cufflinks. However Cufflinks will accept SAM
alignments generated by any read mapper. Here's an example of an
alignment Cufflinks will accept:
s6.25mer.txt-913508 16 chr1 4482736 255 14M431N11M * 0 0 \
CAAGATGCTAGGCAAGTCTTGGAAG IIIIIIIIIIIIIIIIIIIIIIIII NM:i:0 XS:A:-
Note the use of the custom tag XS. This attribute, which must have a
value of "+" or "-", indicates which strand the RNA that produced this
read came from. While this tag can be applied to any alignment,
including unspliced ones, it *must* be present for all spliced alignment
records (those with a 'N' operation in the CIGAR string).
This should be fairly easy to check for now that the data is in
uncompressed SAM format. Running the pipeline starting with Tophat may
be the best choice. If you using the public Main server and have
continued problems not covered in our tutorial/FAQ, a bug report can be
submitted from error datasets:
http://wiki.galaxyproject.org/Support#Reporting_tool_errors
Hopefully this helps,
Jen
Galaxy team
On 12/17/12 11:46 PM, Davide Degli Esposti wrote:
Dear Jen,
Thank you very much for your help. As you mentioned, it seems likely a
problem of the input. I tried to transform the bam file in sam at your
Galaxy site and then to run cufflinks and I have got the three output
files. Do you think it is an acceptable way to avoid the obstacle? I
am going to perform some controls checking for the results obtained
from other methods.
Thank you again for your help
Best,
davide
---
Davide Degli Esposti, PhD
Epigenetic (EGE) Group
International Agency for Research on Cancer
Tel. +33 4 72738036
Fax. +33 4 72738322
150, cours Albert Thomas
69372 Lyon Cedex 08
France
------------------------------------------------------------------------
*From:* Jennifer Jackson [j...@bx.psu.edu]
*Sent:* Tuesday, December 18, 2012 2:53 AM
*To:* Davide Degli Esposti
*Cc:* galaxy-user@lists.bx.psu.edu
*Subject:* Re: [galaxy-user] cufflinks analysis using .bam files
generated by LifeScope (ABI 5500 Sequencer)
Hello Davide,
The fact that you are not getting any error points to some problem
with the input. Perhaps you are sending just sequence data in BAM
format to Cufflinks, without any alignment performed first? Some sort
of error would be expected for most other cases, but this is not the
Galaxy server our team hosts, it is difficult to state exactly what
the issue may be, just offer suggestions.
Tophat will require fastq files as input, unless this alternate Galaxy
site has a modified wrapper. Then the alignments generated by Tophat
(or another alignment tool, sometimes Bowtie is used) in BAM format
are the input to Cuffinks (along with other optional data).
If your data are aligned BAM, and you continue to have problems with
this alternate Galaxy site, it would be best to contact the group that
runs it - the information is on their home page (middle panel) when
you follow the url.
You could also decide to use the public Galaxy instance run by our
core project team at http://usegalaxy.org, if we have the tool set you
wish to use. A generalized tutorial for RNA-seq analysis is available
here:
http://main.g2.bx.psu.edu/u/jeremy/p/galaxy-rna-seq-analysis-exercise
And some troubleshooting help here:
http://wiki.galaxyproject.org/Support#Tools_on_the_Main_server
The tool author's original documentation would be good to review as well:
http://tophat.cbcb.umd.edu
http://cufflinks.cbcb.umd.edu
Best,
Jen
Galaxy team
On 12/13/12 11:47 AM, Davide Degli Esposti wrote:
Hello,
I am new in using Galaxy and I am working on .bam files generated by
our sequencing platform, using the LifeScope software associated to
ABI 5500 sequencer. I uploaded my files on a galaxy browser (
http://galaxy.raetschlab.org) and I tried to run cufflink assemble
and quantify reads expression levels for each file. However, when I
run cufflinks (using default parameters) the output is an empty file.
What is going wrong? Should I use special parameters? Are the .bam
files generated by LifeScope suitable for cufflink analysis or should
I transform the xsq ABI output in a fastq and then apply TopHat?
I thank you very much for your help
Davide
---
Davide Degli Esposti, PhD
Epigenetic (EGE) Group
International Agency for Research on Cancer
Tel. +33 4 72738036
Fax. +33 4 72738322
150, cours Albert Thomas
69372 Lyon Cedex 08
France
/
------------------------------------------------------------------------------------------------
This message and its attachments are strictly confidential. If you
are not
the intended recipient of this message, please immediately notify the
sender
and delete it. Since its integrity cannot be guaranteed, its content
cannot
involve the sender's responsibility. Any misuse, any disclosure or
publication
of its content, either whole or partial, is prohibited, exception
made of
formally approved use.
------------------------------------------------------------------------------------------------/
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/
--
Jennifer Jackson
http://galaxyproject.org
/
------------------------------------------------------------------------------------------------
This message and its attachments are strictly confidential. If you are not
the intended recipient of this message, please immediately notify the
sender
and delete it. Since its integrity cannot be guaranteed, its content
cannot
involve the sender's responsibility. Any misuse, any disclosure or
publication
of its content, either whole or partial, is prohibited, exception made of
formally approved use.
------------------------------------------------------------------------------------------------/
--
Jennifer Jackson
http://galaxyproject.org
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/