Re: [galaxy-user] Using galaxy for Bacterial RNA-seq

2012-02-17 Thread Ateequr Rehman
Dear Noa and Bomba

i am also in situation like you, hardly any programming experiance but want to 
analyse, bacterial tranmscriptome before and after an stress. 

I am very new to galaxy, would be obliged if any one can give me more hints, i 
have few queries
In Brief;  we did our sequnecing by Hiseq 2000, from our partner i got two 
files (FASTQ) for each treatment, for example R1 and R2


i was sucessful to upload these files (Fastq) and reference genome file (Fasta) 
via FTP, 
 
after upload, i run the Fastq groomer, with my understanding i must quality 
filter my sequences, but i am not sure what shoud be the cutoff. 


after reading Noa email, seems like i should do Bowtie mapping, after groomer , 
and filter is is there any other middle  step also before i go to bowtie
Another question is, should i join both the files (fastq joiner) at any stage 
of analysis 


Bomba could you please send me link as you stated, for RNA seqq analysis on 
Univerity of albama website, although it is for eukaryote, but may be will help 
me to understand the steps

Thanking you all





 From: Noa Sher noa.s...@gmail.com
To: Jeremy Goecks jeremy.goe...@emory.edu 
Cc: galaxy-user@lists.bx.psu.edu (galaxy-user@lists.bx.psu.edu) 
galaxy-user@lists.bx.psu.edu; closetic...@galaxyproject.org; Bomba Dam 
bomba@mpi-marburg.mpg.de 
Sent: Thursday, February 16, 2012 8:31 PM
Subject: Re: [galaxy-user] Using galaxy for Bacterial RNA-seq
 

 
Hi Bomba
I have been recently struggling with the same issue (RNA-Seq on a bacteria; no 
programming experience).
I was in touch with the authors of the tophat-cufflinks suite and they all 
suggested that given that bacteria have little or no introns, you can do bowtie 
followed by  cufflinks and skip the tophat. Alternatively, if you do decide to 
do tophat and then cufflinks, make SURE to change the intron size parameters, 
otherwise you will get a mess. I used min intron distance 10bp, max size 
1000bp.  If you just want to do comparative work you can do bowtie and then 
cuffdiff directly. Look at your data with the Galaxy genome browser after you 
run tophat or bowtie to make sure it looks OK.  (I found it easier to convert 
bowtie data from SAM to BAM file and then to view it). One thing I havent fully 
looked into is what happens at the ends of the mapping (since bowtie assumes 
linear chromosomes and not circularized, so just be aware of that difference as 
not all reads will align properly at the ends).
Feel free to contact me if you need more help - I am definitely not an expert 
but have been struggling through doing RNA-Seq on galaxy for the past month or 
so, so may be able to help with some things. Make sure you use matching gtf 
reference annotation (if you have this) and genome!
Good luck!
noa

On 16/02/2012 20:01, Jeremy Goecks wrote: 
Bomba, I'm not familiar enough with bacterial/prokaryotic transcriptomes to 
suggest a possible workflow. You might try the standard 
Tophat-Cufflinks-Cuffcompare/merge-Cuffdiff workflow and see whether you get 
meaningful results; Tophat runs Bowtie internally, so there's no reason to run 
Bowtie separately unless there are Bowtie-specific parameters that you need to 
modify. I've had very little experience with PALMapper and can't speak to its 
efficacy, either for eukaryotic or prokaryotic transcriptome analyses. Finally, 
I've cc'd the galaxy-user mailing list. Using this list is the best way to 
reach the Galaxy user community and get in touch with someone that has used 
Galaxy to analyze bacterial transcriptomes. Good luck,
J. On Feb 16, 2012, at 9:17 AM, Bomba Dam wrote: 
Dear Dr. Goecks, I am working as a post-doctoral fellow in MPI Marburg, 
Germany. We am trying to understand the differential expression of genes in a 
methanotrophic bacterium under different growth conditions. We are sequencing 
the transcriptome using Illumina Hiseq. As I dont have expertise in 
programming languages, I found the Galaxy interface very user-friendly for 
doing such transcriptome analysis. However, I could not find a step wise 
protocol\workflow for mapping bacterial RNA-seq against the reference genome 
(we have the completely sequenced genome of our model organism). I have found 
a detailed step by step workflow for RNA-seq analysis from the University of 
Alabama web-site (uab.edu). However, it refers to the eukaryotic system.
Most examples provided and used for analysis are from eukaryotic systems. I am 
a bit confused weather the same workflow will also work well for bacterial 
systems as there are no splicing events or I should make some modifications. 
Can you kindly suggest me which workflow should I follow for mapping the 
bacterial reads (Bowtie, Tophat or PALMapper) and subsequent quantification 
steps. I want some guidance in this regard. With kind regards, Bomba Dam
-- 
Dr. BOMBA DAM
Alexander von Humboldt Postdoctoral Research Fellow
Max-Planck-Institut für terrestrische Mikrobiologie

Re: [galaxy-user] Error on custom builds on public galaxy server

2012-02-17 Thread Pengfei Yu
Yeah, Hope we could find the answer from Galaxy team soon :-)

Pengfei

On Thu, Feb 16, 2012 at 11:00 PM, Noa Sher noa.s...@gmail.com wrote:

  I also had exactly the same problem yesterday - glad to see that maybe
 it wasn't just me.

 Happy for help!

 Noa

 On 16/02/2012 22:24, Pengfei Yu wrote:

 Dear sir or madam,

  I am using galaxy public server and I am trying to build a custom genome
 using the function custom Builds in user section.
 I specified the Name, Key and uploaded the genome fasta file also the
 chromosome length file. But when I click submit, galaxy gave an error
 information like that:
 
 *Server Error:*
 An error occurred. See the error logs for more information. (Turn debug on
 to display exception reports here)
 
  I don't know where to find the error logs and how to turn debug on.
 Could you tell me how to fix the problem and upload the custom builds?
 Thanks a lot!

  Pengfei
 --
 Pengfei Yu
 Center of Biophysics and Computational Biology
 University of Illinois Urbana-Champaign
 Illinois, 61820, US
 Cell phone: 217-898-6301



 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

   http://lists.bx.psu.edu/




-- 
Pengfei Yu
Center of Biophysics and Computational Biology
University of Illinois Urbana-Champaign
Illinois, 61820, US
Cell phone: 217-898-6301
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Error on custom builds on public galaxy server

2012-02-17 Thread Jennifer Jackson

Hello Pengfei, Noa,

Galaxy was experiencing intermittent cluster failures yesterday. Please 
run your jobs again. I was able to create a test build a few minutes ago 
without a problem at the public Main instance.


We are very sorry for the inconvenience,

Jen
Galaxy team


On 2/16/12 12:24 PM, Pengfei Yu wrote:

Dear sir or madam,

I am using galaxy public server and I am trying to build a custom genome
using the function custom Builds in user section.
I specified the Name, Key and uploaded the genome fasta file also the
chromosome length file. But when I click submit, galaxy gave an error
information like that:

*Server Error:*
An error occurred. See the error logs for more information. (Turn debug
on to display exception reports here)

I don't know where to find the error logs and how to turn debug on.
Could you tell me how to fix the problem and upload the custom builds?
Thanks a lot!

Pengfei
--
Pengfei Yu
Center of Biophysics and Computational Biology
University of Illinois Urbana-Champaign
Illinois, 61820, US
Cell phone: 217-898-6301 tel:217-898-6301



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/wiki/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Error on custom builds on public galaxy server

2012-02-17 Thread Jennifer Jackson

Hi Pengfei,

The User - Custom Builds: form is used to set up genomes for use with 
Trackster. To use custom reference genomes with tools from the left tool 
panel, only the fasta dataset in the history is needed. It sounds like 
you already have loaded this, but if not, do this before following the 
steps below, using FTP to load if the genome is large: 
http://wiki.g2.bx.psu.edu/FTPUpload


These are the steps to extract sequences with a custom genome:

1 - starting with the tool Fetch Sequences - Extract Genomic DNA
2 - set Fetch sequences for intervals in: to be the BED file of 
coordinates

3 - set Source for Genomic Data: to be History
4 - the form will refresh to reveal a new menu option, set Using 
reference file: to be the dataset from your history that is the fasta 
version of your reference genome

5 - set Output data type: to be FASTA

It is important that the chromosome/scaffold names in the BED file are 
identical to the identifiers in the FASTA reference genome dataset. If 
there are mismatches, an error will result and corrections to the IDs 
should be made.


Hopefully this helps, but if you continue to have problems, please send 
in a bug report (using the green bug icon) from the red failed Extract 
job and we can provide feedback.


Best,

Jen
Galaxy team

On 2/17/12 8:25 AM, Pengfei Yu wrote:

Hi Jeremy,

Thanks for the help! I can upload fasta data for custom builds
(name:'NCI-H209-hg18'; key:'NCI-H209-V1') now. but when I try to use a
bed file specific for this custom builds to fetch sequences from this
custom genome, it gives error information that no sequences are
available for 'NCI-H209-v1'.
I might do something wrong, do you know what would be the reason?

Thanks!
Pengfei

On Fri, Feb 17, 2012 at 7:40 AM, Jeremy Goecks jeremy.goe...@emory.edu
mailto:jeremy.goe...@emory.edu wrote:

Hi Pengfei and Noa,

I suspect this was a temporary problem related to software issues
that we encountered yesterday. Please try again and let us know if
you're still having problems.

Also, note that either a FASTA dataset or a Len file is needed, but
not both. Using a FASTA dataset enables viewing genome data in
Trackster.

Best,
J.


On Feb 17, 2012, at 12:00 AM, Noa Sher wrote:


I also had exactly the same problem yesterday - glad to see that
maybe it wasn't just me.
Happy for help!
Noa

On 16/02/2012 22:24, Pengfei Yu wrote:

Dear sir or madam,

I am using galaxy public server and I am trying to build a custom
genome using the function custom Builds in user section.
I specified the Name, Key and uploaded the genome fasta file also
the chromosome length file. But when I click submit, galaxy
gave an error information like that:

*Server Error:*
An error occurred. See the error logs for more information. (Turn
debug on to display exception reports here)

I don't know where to find the error logs and how to turn debug
on. Could you tell me how to fix the problem and upload the
custom builds?
Thanks a lot!

Pengfei
--
Pengfei Yu
Center of Biophysics and Computational Biology
University of Illinois Urbana-Champaign
Illinois, 61820, US
Cell phone:217-898-6301 tel:217-898-6301



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
atusegalaxy.org  http://usegalaxy.org/.  Please keep all replies on the 
list by
usingreply all  in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
atusegalaxy.org http://usegalaxy.org/.  Please keep all replies
on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/





--
Pengfei Yu
Center of Biophysics and Computational Biology
University of Illinois Urbana-Champaign
Illinois, 61820, US
Cell phone: 217-898-6301



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source