[galaxy-user] Reference genome , fasta with or without gene feature

2012-06-14 Thread Ateequr Rehman
Dear All

I am analysing bacterial RNA seq (illumina). as my reference genome is not in 
galaxy. so i need to downlaoded the reference genome from  NCBI and upload to 
my history 

Should i use reference genome with  gene features or  single  fasta file with 
out any annotation information. even after download should i need to modify or 
it should be used as it is. 

Thanks in advance
Ateeq___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] Bacterial RNA seq analysis

2012-04-18 Thread Ateequr Rehman
Dear All

I need some (lots) suggestions and help,  first and most important is that i am 
working on bacterial RNA seq (illumina reads)
my analysis steps are as following 

Step 1.  FASTQ sequence data was groomed

Step 2. I did mapping by Bowtie with default parameters. Reference genome fasta 
file i am using from my history, because the reference genome is not vaialble 
on galaxy.

Step3. i sorted the bowtie output file using r workflow (Germy Goecks 
workflow)  , link below 
https://main.g2.bx.psu.edu/u/jeremy/p/transcriptome-analysis-faq#faq2

Step4. this sorting provided me Concatenate files

Step5. Concatenated files were used to run CUFFLINK, this provided me assembled 
trancript file

Step6: Assembled transcript files from step 5 were used for CUFFMERGE

Step 7:  For CUFFDIFF Transcript GTF file generated from  Step 6and  
concatenate files from step 4 were used 

Now my question is if this workflow is acceptable for bacterial transcriptome 
anaylsis, 

Should i filter SAM file, if yes then at which step

Should i convert SAM file to the BAM file, then at which step it should be

Is that Ok to use fasta of reference genome for mapping should it be converted 
to any other format, if yes then what should be the workflow

If any one have experince of bowtie parametes to map bacterial RNA seqquence 
analsis are very much welcomed  

Thanking you all ___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] Cuffdiff

2012-04-17 Thread Ateequr Rehman
Dear All
I have simple and question for cuffdiff
should we run cuffdif on merge transcript file (produced by cuffmerge) and 
concatenate data sets
or directly on cufflink produced files, in the later case, i have two 
transcript files resulting from cufflink on sample 1 and 2 respectively,
result using sample 1 as transcripts are not the same when i am suing sample 2 
as transcript 


i am bit confused what should be the correct way

any help is very much welcomed

Best
ateeq

 
Ateequr Rehman
House No. 2 ground floor
Blauenstr. 10
79115 Freiburg im Breisgau___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] Disk space quota Issue

2012-04-02 Thread Ateequr Rehman


Dear Administartor

My disk space quote is full, is their any way to increase it,or only way is 
delete some files...
Best
Ateeq

 
Ateequr Rehman
House No. 2 ground floor
Blauenstr. 10
79115 Freiburg im Breisgau


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] Cuffdiff result P and q values

2012-03-19 Thread Ateequr Rehman
Dear galaxy user
After running cuffdiff on my two samples (SAM files from bowtie)
i got a list with p and q values, and löast colum is saying abou significance

with P value, it seems like the comparison should be significant, but in Q 
value is 1, and last coumn is saying not significant

any one have an idea, how to interpret it , should we take any comparsion with 
less than 0.05 p value as significant or not 


tables in excel looks like it

Nay help is welcome

best
Ateeq



test_id gene_id gene locus sample_1 sample_2 status value_1 value_2 
log2(fold_change) test_stat p_value q_value significant 
CUFF.428.1 CUFF.428 - gi|190572091|ref|NC_010943.1|:1575813-1577629 q1 q2 
NOTEST 171.773 605.136 -150.518 588.996 3,86E-05 1 no 
CUFF.462.1 CUFF.462 - gi|190572091|ref|NC_010943.1|:1680283-1681214 q1 q2 
NOTEST 696.628 322.149 -111.266 538.062 7,42E-03 1 no 
CUFF.635.1 CUFF.635 - gi|190572091|ref|NC_010943.1|:2343969-2346219 q1 q2 
NOTEST 396.469 223.951 -0.824027 476.902 1,85E-01 1 no 
CUFF.512.1 CUFF.512 - gi|190572091|ref|NC_010943.1|:1840464-1843486 q1 q2 
NOTEST 136.314 70.604 -0.949109 422.322 2,41E-01 1 no 
CUFF.632.1 CUFF.632 - gi|190572091|ref|NC_010943.1|:2346561-2347408 q1 q2 
NOTEST 351.508 167.567 -106.882 415.844 3,20E-01 1 no 
CUFF.941.1 CUFF.941 - gi|190572091|ref|NC_010943.1|:3664426-3665364 q1 q2 
NOTEST 282.247 133.798 -10.769 412.254 3,75E-01 1 no 
CUFF.616.1 CUFF.616 - gi|190572091|ref|NC_010943.1|:2301552-2303180 q1 q2 
NOTEST 169.682 744.885 -118.774 462.107 3,82E-01 1 no 
CUFF.617.1 CUFF.617 - gi|190572091|ref|NC_010943.1|:2295763-2297758 q1 q2 
NOTEST 225.933 112.178 -101.011 454.517 5,49E-01 1 no 
CUFF.9.1 CUFF.9 - gi|190572091|ref|NC_010943.1|:41597-42402 q1 q2 OK 1729.08 
2797.07 0.693913 -4.461 8,16E-01 0.000179474 yes 
CUFF.956.1 CUFF.956 - gi|190572091|ref|NC_010943.1|:3665445-3669232 q1 q2 
NOTEST 518.525 323.653 -0.679966 444.565 8,76E-01 1 no 
CUFF.549.1 CUFF.549 - gi|190572091|ref|NC_010943.1|:2043111-2043664 q1 q2 OK 
7148.23 11816.4 0.725138 -421.443 2,50E+00 0.000275446 yes 
CUFF.872.1 CUFF.872 - gi|190572091|ref|NC_010943.1|:3489557-3490326 q1 q2 
NOTEST 220.274 840.662 -13.897 416.179 3,16E+00 1 no 
CUFF.636.1 CUFF.636 - gi|190572091|ref|NC_010943.1|:2348784-2352394 q1 q2 
NOTEST 114.384 601.415 -0.927447 414.807 3,35E+00 1 no 
CUFF.605.1 CUFF.605 - gi|190572091|ref|NC_010943.1|:2271979-2275960 q1 q2 
NOTEST 217.007 133.837 -0.697264 409.373 4,24E+00 1 no 
CUFF.568.1 CUFF.568 - gi|190572091|ref|NC_010943.1|:2160538-2164415 q1 q2 
NOTEST 74.365 377.013 -0.980011 395.097 7,78E+00 1 no 
CUFF.597.1 CUFF.597 - gi|190572091|ref|NC_010943.1|:2250029-2250918 q1 q2 
NOTEST 229.389 105.246 -112.403 386.937 0.000109116 1 no 

 
Ateequr Rehman
House No. 2 ground floor
Blauenstr. 10
79115 Freiburg im Breisgau___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] ILLumina 1.9 Hiseq

2012-02-29 Thread Ateequr Rehman
Dear Glaxy users and admin

I ran my sequence data on FASTQC tool, 

output says it is 

EncodingSanger / Illumina 1.9

now i want to groom my file, but groomer does not have option for 1.9 in Input 
FASTQ quality scores type

any idea which option i should select to grroom my file, 


later i want to run Bowtie or Tophat, 


Thanks___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Cufflinks related problem

2012-02-22 Thread Ateequr Rehman
Dear Bomba

I am bit confused , Are you running bowtie --- tophat  Cufflink
For bowtie run you are using reference  genome annotation  in gff3 format

 I am curremntly totally unable to figure it out correctly, how i should 
analyse my RNA seq data

Best
Ateeq


 



 From: Noa Sher noa.s...@gmail.com
To: Bomba Dam bomba@visva-bharati.ac.in 
Cc: galaxy-user@lists.bx.psu.edu 
Sent: Wednesday, February 22, 2012 7:28 AM
Subject: Re: [galaxy-user] Cufflinks related problem
 

 
Hi Bomba
I cant know for sure without seeing your files but I originally had the same 
problem and it ended up being because the way the genome was named in the fasta 
genome file was not the same as the way it was named in column 1 of my gtf 
file. I would check that first.  Also- with bacteria you dont want to run 
cufflinks with default parameters- use the galaxy browser to check how your 
data looks after tophat and you will most likely see very strange gene-spanning 
introns. Change the max intron size to 1000-1500 and the min distnace to 
101-5nt and you should get results that make much more sense.
Good luck
Noa

On 22/02/2012 00:12, Bomba Dam wrote: 
Dear Noa,

Can you please  tell me the parameters that I should keep while
  mapping bacterial transcripts using cuffkins. I have kept the
  default parameters as in Cufflinks and used my custom genome
  annotation  in gff3 format. The cufflinks seems to work Ok but all
  the FPKM values in these files are zero. As suggested by other
  users I have checked the correctness of my GFF3 files. The
  corresponding fasta file was used for mapping the transcripts
  using Bowtie. Are there any special trick for mapping bacterial
  transcriptome. 

regards,

Bomba

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Using galaxy for Bacterial RNA-seq

2012-02-17 Thread Ateequr Rehman
Dear Noa and Bomba

i am also in situation like you, hardly any programming experiance but want to 
analyse, bacterial tranmscriptome before and after an stress. 

I am very new to galaxy, would be obliged if any one can give me more hints, i 
have few queries
In Brief;  we did our sequnecing by Hiseq 2000, from our partner i got two 
files (FASTQ) for each treatment, for example R1 and R2


i was sucessful to upload these files (Fastq) and reference genome file (Fasta) 
via FTP, 
 
after upload, i run the Fastq groomer, with my understanding i must quality 
filter my sequences, but i am not sure what shoud be the cutoff. 


after reading Noa email, seems like i should do Bowtie mapping, after groomer , 
and filter is is there any other middle  step also before i go to bowtie
Another question is, should i join both the files (fastq joiner) at any stage 
of analysis 


Bomba could you please send me link as you stated, for RNA seqq analysis on 
Univerity of albama website, although it is for eukaryote, but may be will help 
me to understand the steps

Thanking you all





 From: Noa Sher noa.s...@gmail.com
To: Jeremy Goecks jeremy.goe...@emory.edu 
Cc: galaxy-user@lists.bx.psu.edu (galaxy-user@lists.bx.psu.edu) 
galaxy-user@lists.bx.psu.edu; closetic...@galaxyproject.org; Bomba Dam 
bomba@mpi-marburg.mpg.de 
Sent: Thursday, February 16, 2012 8:31 PM
Subject: Re: [galaxy-user] Using galaxy for Bacterial RNA-seq
 

 
Hi Bomba
I have been recently struggling with the same issue (RNA-Seq on a bacteria; no 
programming experience).
I was in touch with the authors of the tophat-cufflinks suite and they all 
suggested that given that bacteria have little or no introns, you can do bowtie 
followed by  cufflinks and skip the tophat. Alternatively, if you do decide to 
do tophat and then cufflinks, make SURE to change the intron size parameters, 
otherwise you will get a mess. I used min intron distance 10bp, max size 
1000bp.  If you just want to do comparative work you can do bowtie and then 
cuffdiff directly. Look at your data with the Galaxy genome browser after you 
run tophat or bowtie to make sure it looks OK.  (I found it easier to convert 
bowtie data from SAM to BAM file and then to view it). One thing I havent fully 
looked into is what happens at the ends of the mapping (since bowtie assumes 
linear chromosomes and not circularized, so just be aware of that difference as 
not all reads will align properly at the ends).
Feel free to contact me if you need more help - I am definitely not an expert 
but have been struggling through doing RNA-Seq on galaxy for the past month or 
so, so may be able to help with some things. Make sure you use matching gtf 
reference annotation (if you have this) and genome!
Good luck!
noa

On 16/02/2012 20:01, Jeremy Goecks wrote: 
Bomba, I'm not familiar enough with bacterial/prokaryotic transcriptomes to 
suggest a possible workflow. You might try the standard 
Tophat-Cufflinks-Cuffcompare/merge-Cuffdiff workflow and see whether you get 
meaningful results; Tophat runs Bowtie internally, so there's no reason to run 
Bowtie separately unless there are Bowtie-specific parameters that you need to 
modify. I've had very little experience with PALMapper and can't speak to its 
efficacy, either for eukaryotic or prokaryotic transcriptome analyses. Finally, 
I've cc'd the galaxy-user mailing list. Using this list is the best way to 
reach the Galaxy user community and get in touch with someone that has used 
Galaxy to analyze bacterial transcriptomes. Good luck,
J. On Feb 16, 2012, at 9:17 AM, Bomba Dam wrote: 
Dear Dr. Goecks, I am working as a post-doctoral fellow in MPI Marburg, 
Germany. We am trying to understand the differential expression of genes in a 
methanotrophic bacterium under different growth conditions. We are sequencing 
the transcriptome using Illumina Hiseq. As I dont have expertise in 
programming languages, I found the Galaxy interface very user-friendly for 
doing such transcriptome analysis. However, I could not find a step wise 
protocol\workflow for mapping bacterial RNA-seq against the reference genome 
(we have the completely sequenced genome of our model organism). I have found 
a detailed step by step workflow for RNA-seq analysis from the University of 
Alabama web-site (uab.edu). However, it refers to the eukaryotic system.
Most examples provided and used for analysis are from eukaryotic systems. I am 
a bit confused weather the same workflow will also work well for bacterial 
systems as there are no splicing events or I should make some modifications. 
Can you kindly suggest me which workflow should I follow for mapping the 
bacterial reads (Bowtie, Tophat or PALMapper) and subsequent quantification 
steps. I want some guidance in this regard. With kind regards, Bomba Dam
-- 
Dr. BOMBA DAM
Alexander von Humboldt Postdoctoral Research Fellow
Max-Planck-Institut für terrestrische Mikrobiologie

[galaxy-user] genome for Stenotrophomans

2012-01-13 Thread Ateequr Rehman
Dear Galaxy AdministratorI am analysing transcriptome and genome of clinical 
isolatesStenotrophomonasmaltophilia, would be highly obliged if you could 
please upload
Stenotrophomonas maltophilia K279 and Stenotrophomonas maltophilia R551-3 genome
acccesion numbers at NCBI are as following
Stenotrophomonas maltophilia K279   (Accession AM743169)



Stenotrophomonas maltophilia R551-3, complete genome
Accession: CP00
Thanking you all in advance
Ateequr Rehman___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-user] data upload

2012-01-11 Thread Ateequr Rehman
Dear galaxy UsersI am very very new to galaxy, will be highly
obliged if some one could help me to find the way to analyse bacterial
tramnscriptome. in the first step itself , i am having trouble to
upload files...Does any one knows how to generate URL to upload data, my fastq files are about 3 gb each, Is there any specific pipeline to analyse bacterial transcriptomeThanking all of youAteeqAteequr RehmanHouse No. 2 ground floorBlauenstr. 10			79115 Freiburg im Breisgau   From: Dannon Baker dannonba...@me.com To: mailing list margeem...@gmail.com Cc: galaxy-user@lists.bx.psu.edu  Sent: Wednesday, January 11, 2012 2:11
 PM Subject: Re: [galaxy-user] CloudMan - Persist changes to file system is not a hyperlink / clickable  
For sharing a cluster (of any kind) just click the sharing icon ()next to the cluster name and the dialog that pops up will walk you through the next steps.-DannonOn Jan 11, 2012, at 7:27 AM, mailing list wrote:So how do I get a share string for /mnt/galaxyData?Thanks again,GregOn Tue, Jan 10, 2012 at 4:16 PM, Dannon Baker dannonba...@me.com wrote:Sorry, I should have specified before -- /mnt/galaxyData is automatically persisted, as it is assumed (and used behind the scenes in the case of a
 Galaxy Cluster) that this is where data will be kept. It's only /mnt/galaxyTools or /mnt/galaxyIndices that need individual flagging for persistence.So, /mnt/galaxyData *is* safe to use for your own customizations and will be persisted automatically with no additional steps required on your part.-DannonOn Jan 10, 2012, at 4:02 PM, mailing list wrote:I'm following the instructions here:http://wiki.g2.bx.psu.edu/Admin/Cloud/Customize%20Galaxy%20CloudI ssh'd in and just made this change:cd /mnt/galaxyDatasudo git clone git://github.com/JaneliaSciComp/msg.gitNow I want to persist it, but as my subject line says, I don't seem tohave the option in the admin.Thanks,Greg___The Galaxy User list should be used for the discussion ofGalaxy analysis and other features on the public serverat usegalaxy.org. Please keep all replies on the list byusing "reply all" in your mail client. For discussion
 oflocal Galaxy instances and the Galaxy source code, pleaseuse the Galaxy Development list:http://lists.bx.psu.edu/listinfo/galaxy-devTo manage your subscriptions to this and other Galaxy lists,please use the interface at:http://lists.bx.psu.edu/___The Galaxy User list should be used for the discussion ofGalaxy analysis and other features on the public serverat usegalaxy.org. Please keep all replies on the list byusing "reply all" in your mail client. For discussion oflocal Galaxy instances and the Galaxy source code, pleaseuse the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-devTo manage your subscriptions to this and other Galaxy lists,please use the interface at: http://lists.bx.psu.edu/___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/