Re: [galaxy-dev] Help Needed

Jennifer Jackson Fri, 03 Jan 2014 13:00:15 -0800

Hello Elsayed,

Your protocol below seems to be a mix of a variant detection and anRNA-seq workflow. To build a workflow for RNA-seq, you will want tocompare your steps with the protocols in the link that I sent you.

If you want more examples, many more can be found here (for both RNA-seqand variant analysis, these are distinct analysis). Some have workflowsthat you can import and use directly or modify.


   https://wiki.galaxyproject.org/Learn#Other_Tutorials

   Shared Data -> Published Pages

In particular, there is a sample workflow in this tutorial that willhelp you to understand what the different steps are for and what orderthey go in.

https://usegalaxy.org/u/jeremy/p/galaxy-rna-seq-analysis-exercise

Best,

Jen
Galaxy team



On 1/3/14 10:57 AM, Elsayed Hejazy wrote:

*Dear Dr. Jennifer,*
A lot of thanks for your reply its really mean alot for me.
i am still NGS junior i trying to do the following steps kindly giveme the right order for these proceduresData Description: i have to samples each sample consists of 14 FASTQfile (7 forward and 7 reverse ) i think this mean its paired end fromIllumina then i will try the following workflow to got best results1- Drag tow input dataset into workflow one for forward sequences fileand one for reverse to use paired end option in TOPHAT tool later andwhen i run this workflow i will select multiple selection for the 7forward files to analyse all of them at the same time2- Drag FASTQC and link with last step for each to got if these filemay be illmina 1.8 version or older.3- Drag FASTQ Groomer and link with last step if files older than 1.8version to prepare as .FASTQSANGER format.4- Drag Filter FASTQ and link with last step to remove redundancy ofsequences.
5- Drag FASTQ trimmer to remove unwanted ends of sequenced may occur
6- Drag Manipulate FASTQ and link with last step (i dont know why).
the above six steps done twice to generate to files as output to makeas input for the following steps.7- Drag TOPHAT for illumina and make it accept paired end files andlink each file generated from QC to TOPHAT this step used to align andmap with reference genome.8- Drag Cufflinks and link with aligned BAM file generated from TOPHATto create an assembly9- Drag Cuffmerge and link with GTF file from Cufflinks this step tomerge all assemblies generated in Cufflinks10- Drag Cuffcompare and link with last step to got detailed reportsfor accuracy of all generated assemblies.11- Drag MPileup and link with TOPHAT BAM file to generate filecontaining SNPs sites.12- Drag Pileup-to-Interval and link with MPileup step to filter thenumber of output SNPs to successive one or the most accurate. (i dontknow what is the difference between this tool and Filter Pileup).- i dont know what is the tools used to know the copy number variationCNVi need to know how to separate human sequences from sample mayinfected with any other sequences (is this at alignment stage)i need to know the perfect order of steps if this order is notcompletely right.
Is this what i should do to make a good NGS workflow got all possibleinformation form dataset
Really i am so so so sorry for disturbance - waiting your reply
Best Regards,
elsayed
On Thu, Jan 2, 2014 at 6:05 PM, Jennifer Jackson <[email protected]<mailto:[email protected]>> wrote:
    Hello Elsayed,

    Protocol help for RNA-seq analysis can be found here:
    https://wiki.galaxyproject.org/Support#Tools_on_the_Main_server:_RNA-seq

    The QA/QC steps should be done before mapping, on individual
    datasets (such as replicates) or on partial or merged datasets as
    needed (if that is how the data was sequenced, just be
    consistent). Only keep in mind that the larger the dataset, the
    more compute some of these steps can require.

    Hopefully this helps,

    Jen
    Galaxy team


    On 1/1/14 1:32 AM, Elsayed Hejazy wrote:
    i need to know more about the order of steps for RNA Seq data
    analysis
    Is alignment and assembly should done first to combine all FASTQ
    reads into one file and start analysis or i should do Quality
    Control ,filtering  , trimming and manipulation first and do
    alignment and assembly at the end of analysis to use tophat for
    example or any other analysis tools in Galaxy

    Thank you
    elsayed


    ___________________________________________________________
    Please keep all replies on the list by using "reply all"
    in your mail client.  To manage your subscriptions to this
    and other Galaxy lists, please use the interface at:
       http://lists.bx.psu.edu/

    To search Galaxy mailing lists use the unified search at:
       http://galaxyproject.org/search/mailinglists/
--Jennifer Hillman-Jackson
    http://galaxyproject.org


--
Jennifer Hillman-Jackson
http://galaxyproject.org

___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Help Needed

Reply via email to