Course Announcement

NOTE: please apply as soon as possible, the period for applications is exceptionally short due to operational reasons

*ARANGS13*
Automated and reproducible analysis of NGS data

  IMPORTANT DATES for ARANGS13
   Deadline for applications: October 8th 2013
   Notification of acceptance dates: October 15th 2013
   Course date: October 21st - October 24th 2013

Course Description:
Next generation sequencing (NGS) technologies for DNA have resulted in a yet bigger deluge of data. Researchers are learning that analysing such data sets is becoming the bottleneck in their work. In many cases, several steps in these analyses are fairly generic (e.g. quality control filtering, alignment to reference sequences, typing) so that off-the-shelf pipelines can be applied. In other cases, novel research approaches require development of new analysis pipelines. Either way, all analysis steps should be repeatable and any changes made to the data (e.g. renaming, annotation, alignment) should be recorded so that the provenance of the results is clear and inferences are reproducible. In this brief workshop we will establish several best practices of reproducibility and provenance recording in the (comparative) analysis of data obtained by NGS. In doing so we will encounter the commonly used technologies that enable these best practices by working through use cases that illustrate the underlying principles. Building on the basis of workflow development, we will further illustrate how custom-built workflows can be manipulated using graphical platforms (e.g. Galaxy, Taverna, etc.).

Best practices

    Standardized project organization
    Projects 'runnable' without user intervention
    No loss of data, metadata, parameters or source code through versioning
    Sharing of scripts and workflows

Technologies

    Next generation sequencing platforms
    File formats (e.g. FASTQ, SAM/BAM, GFF3)
    Command-line executables, command line scripting and batching
    High-level programming with domain-specific toolkits
    Revision control systems
    Workflow environments (both visual and command line)

Use cases

    Phylogenetic placement of metagenomic data
    Typing of pathogens
    Comparative analysis of multicellular genomic data
    Post-assembly: handling richly annotated genomes

More information, including application instructions, available at
http://gtpb.igc.gulbenkian.pt/bicourses/ARANGS13/

Thank you
Pedro Fernandes
GTPB coordinator

--
Pedro Fernandes
Instituto Gulbenkian de Ciência
Apartado 14
2781-901 OEIRAS
PORTUGAL
Tel +351 21 4407912
http://gtpb.igc.gulbenkian.pt






___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

 http://galaxyproject.org/search/mailinglists/

Reply via email to