[galaxy-user] SOLID RNA-Seq De Novo Transcriptome Assembly

Jennifer Jackson Wed, 30 Oct 2013 19:57:37 -0700

Hi Oscar,

You most likely want to explore tools that are designed specifically forthis purpose, if the reference genome you are talking about is theassembled transcriptome. Trinity is one tool, but there are others inthe Tool Shed and on some of the Public Servers.


Links:
http://wiki.galaxyproject.org/Support#Tools_on_the_Main_server
http://wiki.galaxyproject.org/Support#Custom_reference_genome
http://wiki.galaxyproject.org/BigPicture/Choices
http://wiki.galaxyproject.org/Tool%20Shed

Your question is a bit confusing because the 'annotations' may alreadybe what these tools would produce and I am not sure what you are tryingto do next. If it is the assignment of putative function, then there aremany paths to follow, some better suited for viral genomes. You'll wantto find out what others doing this exact work are using right now andconsider the same tools. Start by checking out the public Galaxyservers, many have trial tools that you can later include in alocal/cloud from the tool shed:http://wiki.galaxyproject.org/PublicGalaxyServers

If your question was misunderstood (the reference genome is in fact aDNA genome - and you have RNA sequence to align), then the RNA-seqpipeline can be used as-is with 'Tophat for SOLiD', Cufflinks,CuffMerge, CuffDiff - all on a local/cloud/slipstream with the referencegenome as a cluster reference genome. There is no requirement forreference annotation with any of these tool - it helps to gain fullfunctionality - especially with CuffDiff, but is not required. Moreassistance is at tophat.cuffli...@gmail.com.


Hopefully this helps,

Jen
Galaxy team

On 10/24/13 6:06 PM, Oscar Aguilar wrote:

Hi Dr. Jackson,
I'm sorry to bother you but I have been searching for answers but Ican't seem to find any and I'm sure that you would be able to answermy question.
So I am trying to find a novel gene using de novo tramscriptomeassembly and I see that TopHat might just be able to help me out withmy dilemma. The viral genome not available on the galaxy website, andthe other issue is that I am using SOLID data. So my question is, canI use TopHat with SOLID data by converting to nucleotide base fastq?or do I have to use TopHat2 with a colourspace viral genome? I alsohave to admit that I am completely new to bioinformatics and myproject as lead me here so I am trying to tackle it on my own.
Fo the custom genome, I have managed to load it (in fasta, andannotation in BED) but I am not sure how to assign the annotations tothe genome. Also, does TopHat require an annotated genome? I read thatit doesn't but I'm not sure...I fear that my gene is a spliced one andI would like to be able to pull it out from output data.
I'm sorry to bother you as I'm sure the answer is out there I justreally can't seem to find it and am now desperate.
Thank you in advance,
Oscar


--
Oscar A. Aguilar, M.Sc
PhD Candidate
Sunnybrook Research Institute
Department of Immunology
University of Toronto
416-480-6100x89492
oscar.agui...@utoronto.ca <mailto:oscar.agui...@utoronto.ca>


--
Jennifer Hillman-Jackson
http://galaxyproject.org

___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

[galaxy-user] SOLID RNA-Seq De Novo Transcriptome Assembly

Reply via email to