Re: [galaxy-user] Cuffdiff question

2013-11-15 Thread Noa.sher
Dont use the - b parameter

Sent from my iPhone; please excuse any brevity or typos!

On Nov 15, 2013, at 2:51 PM, clare Hardman chard...@mrc-lmb.cam.ac.uk wrote:

 Hi Noa,
 
 Yes I did use Cufflinks so this sounds just like my problem. So how have you 
 dealt with the problem?
 
 Best wishes,
 
 Clare
 
 
 
 On 14 Nov 2013, at 18:17, Noa Sher wrote:
 
 Hi Clare
 We just ran into a similar issue about a week ago and were debugging with 
 the authors of cuffdiff
 Apparently there are issues with the -b parameter - were you using this in 
 cufflinks?
 If yes - this may be the cause - we switched the order of the replicates and 
 the values changed; as did PCA's of the samples, etc
 They are working on this issue for an upcoming version of cufflinks.
 I am interested in knowing whether this was indeed your problem or were you 
 using a pipeline that does not include cufflinks?
 Good luck,
 Noa
 
 
 On 14/11/2013 13:13, clare Hardman wrote:
 Hello,
 
 Could you please advise me on this probably naive question. When I compare 
 sample A and sample B by Ciffdiff and then separately compare Sample A to 
 Sample C by Cuffdiff too, should the FMPK value be the same for A in both 
 tests? At the moment mine does not seem to be!
 
 Best wishes
 
 Clare
 
 
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
   http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
   http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
 
   http://galaxyproject.org/search/mailinglists/
 
 
 
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-user] Cuffdiff question

2013-11-15 Thread clare Hardman
Hi Noa,

Yes I did use Cufflinks so this sounds just like my problem. So how have you 
dealt with the problem?

Best wishes,

Clare



On 14 Nov 2013, at 18:17, Noa Sher wrote:

 Hi Clare
 We just ran into a similar issue about a week ago and were debugging with the 
 authors of cuffdiff
 Apparently there are issues with the -b parameter - were you using this in 
 cufflinks?
 If yes - this may be the cause - we switched the order of the replicates and 
 the values changed; as did PCA's of the samples, etc
 They are working on this issue for an upcoming version of cufflinks.
 I am interested in knowing whether this was indeed your problem or were you 
 using a pipeline that does not include cufflinks?
 Good luck,
 Noa
 
 
 On 14/11/2013 13:13, clare Hardman wrote:
 Hello,
 
 Could you please advise me on this probably naive question. When I compare 
 sample A and sample B by Ciffdiff and then separately compare Sample A to 
 Sample C by Cuffdiff too, should the FMPK value be the same for A in both 
 tests? At the moment mine does not seem to be!
 
 Best wishes
 
 Clare
 
 
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
   http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
   http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
 
   http://galaxyproject.org/search/mailinglists/
 
 

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

[galaxy-user] Cuffdiff question

2013-11-14 Thread clare Hardman
Hello,

Could you please advise me on this probably naive question. When I compare 
sample A and sample B by Ciffdiff and then separately compare Sample A to 
Sample C by Cuffdiff too, should the FMPK value be the same for A in both 
tests? At the moment mine does not seem to be!

Best wishes

Clare


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-user] Cuffdiff question

2013-11-14 Thread Noa Sher

  
  
Hi Clare
We just ran into a similar issue about a week ago and were debugging
with the authors of cuffdiff
Apparently there are issues with the -b parameter - were you using
this in cufflinks?
If yes - this may be the cause - we switched the order of the
replicates and the values changed; as did PCA's of the samples, etc
They are working on this issue for an upcoming version of cufflinks.
I am interested in knowing whether this was indeed your problem or
were you using a pipeline that does not include cufflinks?
Good luck,
Noa


On 14/11/2013 13:13, clare Hardman
  wrote:


  Hello,

Could you please advise me on this probably naive question. When I compare sample A and sample B by Ciffdiff and then separately compare Sample A to Sample C by Cuffdiff too, should the FMPK value be the same for A in both tests? At the moment mine does not seem to be!

Best wishes

Clare


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/



  

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

[galaxy-user] Cuffdiff question

2013-11-12 Thread Jennifer Jackson

Hello,

The transcript name when using RefSeq as a reference annotation is the 
NM_ type of identifier.


If you want to include a gene symbol, then the reference annotation 
should include the attribute gene_name. The iGenomes GTF files are an 
example of datasets that include this attribute.


http://wiki.galaxyproject.org/Support#Interpreting_scientific_results
See: /Example/?/*RNA-seq analysis*/*tools*
http://cufflinks.cbcb.umd.edu/faq.html#gtfs
http://cufflinks.cbcb.umd.edu/igenomes.html
http://cufflinks.cbcb.umd.edu/manual.html (search on page for 
gene_name to see where can be used/output)


Best,

Jen
Galaxy team

On 11/12/13 10:40 AM, Irene Bassano wrote:

Dear Jennifer,
I am about to start a new analysis and learning from my old mistakes would like 
to ask just one question:
I would like my Cufflinks and Cuffdiff results to give me the actual gene name or 
transcript name rather than NM_.. like I had last time i run the job.

At which stage and which option do I have ti use to get the proper names?

Thanks a lot!

Irene


--
Jennifer Hillman-Jackson
http://galaxyproject.org

--
Jennifer Hillman-Jackson
http://galaxyproject.org

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

[galaxy-user] Cuffdiff question about using an unspecified (?) database/build

2011-08-19 Thread David K Crossman
Hello!

I have an RNA-Seq project which consists of 5 samples from the 
species tree shrew.  When uploading these fastq files into Galaxy, I chose 
unspecified (?) for the database/build since the latest tree shrew version is 
not in the drop down list.  When using TopHat, Cufflinks/Compare I have 
selected a reference genome from my history instead of using a built-in index, 
as well as a gtf annotation file for Cufflinks/Compare and everything has been 
working fine.  Now, I am at the Cuffdiff step and I am running into an error 
when setting it up to perform replicate analysis.  When I select my TopHat 
accepted hits bam file I see a red X and the error: Unspecified genome build, 
click the pencil icon in the history item to set the genome build.  Here's a 
screenshot of what I'm seeing:

[cid:image001.png@01CC5E4E.76F37AF0]

Since the latest reference genome for tree shrew wasn't listed, 
that's why I chose unspecified (?).  Should I go back and edit these accepted 
hits bam files to say the Database/Build from the drop down list is Tree shrew 
Dec. 2006 (Broad/tupBel1) (tupBel1)?  I know that this is simple to change, 
but will this affect my results in any way?  Any help/info would be greatly 
appreciated.

Thanks,
David
inline: image001.png___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Cuffdiff question about using an unspecified (?) database/build

2011-08-19 Thread David K Crossman
Jen,

Thank you very much for the reply.  I'm glad to know it is a known bug 
and not something on my side of things.  So, would my analysis be affected if I 
did change the bam file Database/Build to the older tree shrew version found 
in the drop down list?  What significance does this Database/Build box have 
in downstream analysis if you have your own fasta reference genome file and gtf 
annotation file that is being referenced instead of a locally cached one?  I'm 
just trying to obtain a better understanding of the Database/Build box for 
analyses where I provide the fasta and gtf file.

Thanks,
David


-Original Message-
From: Jennifer Jackson [mailto:j...@bx.psu.edu] 
Sent: Friday, August 19, 2011 9:20 AM
To: David K Crossman
Cc: galaxy-user (galaxy-user@lists.bx.psu.edu)
Subject: Re: [galaxy-user] Cuffdiff question about using an unspecified (?) 
database/build

Hello David,

This is a known bug. The correction is planned to be moved out onto the public 
Galaxy instance at the next update (within a week).

Sorry for the current inconvenience,

Best,

Jen
Galaxy team

On 8/19/11 7:00 AM, David K Crossman wrote:
 Hello!

 I have an RNA-Seq project which consists of 5 samples from the species 
 tree shrew. When uploading these fastq files into Galaxy, I chose 
 unspecified (?) for the database/build since the latest tree shrew 
 version is not in the drop down list. When using TopHat, 
 Cufflinks/Compare I have selected a reference genome from my history 
 instead of using a built-in index, as well as a gtf annotation file 
 for Cufflinks/Compare and everything has been working fine. Now, I am 
 at the Cuffdiff step and I am running into an error when setting it up 
 to perform replicate analysis. When I select my TopHat accepted hits 
 bam file I see a red X and the error: Unspecified genome build, click 
 the pencil icon in the history item to set the genome build. Here's a 
 screenshot of what I'm seeing:

 Since the latest reference genome for tree shrew wasn't listed, that's 
 why I chose unspecified (?). Should I go back and edit these 
 accepted hits bam files to say the Database/Build from the drop down 
 list is Tree shrew Dec. 2006 (Broad/tupBel1) (tupBel1)? I know that 
 this is simple to change, but will this affect my results in any way? 
 Any help/info would be greatly appreciated.

 Thanks,

 David



 ___
 The Galaxy User list should be used for the discussion of Galaxy 
 analysis and other features on the public server at usegalaxy.org.  
 Please keep all replies on the list by using reply all in your mail 
 client.  For discussion of local Galaxy instances and the Galaxy 
 source code, please use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists, please 
 use the interface at:

http://lists.bx.psu.edu/

--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Cuffdiff Question

2011-06-28 Thread Jeremy Goecks
Hello Kurinji,

 I was at your USC Galaxy seminar last week, which I found very helpful - 
 thank you!

Glad to hear that you found the workshop helpful. As a reminder, please email 
questions about using Galaxy and its tools to the galaxy-user mailing list 
(which I've cc'd). You may get quicker and different responses from community 
members, and everyone will benefit from the discussion.

 I used my recently generated RNAseq data in Galaxy (which was pre-aligned 
 using tophat and already had cufflinks run on it) - I ran cuffcompare with 
 all the gtf files and then cuffdiff for the three pairs (there is 1 control 
 and 3 different drug treatments - no replicates). I got several output files, 
 as expected, but decided just to look at the gene differential expression as 
 a start. Some questions I have are - 
 
 1. (very basic question!) which is sample 1 (and corresponding value 1) and 
 sample 2 (and corresponding value 2)in my output file. This is what my output 
 file is called - 
 
 90: Cuffdiff on data 37, data 38, and data 60: gene differential expression 
 testing 33,969 lines
 
 Is 37 sample one or sample two? Given the data - I would expect sample 37 to 
 correspond to value 2 - but I could be wrong. Please let me know!

The best way to figure out which dataset corresponds with Cuffdiff's labels is 
to click the rerun button in the dataset: sample names correspond directly to 
the reads datasets (i.e. BAM files) provided as input to Cuffdiff.

 2. How do I find the UCSC gene names corresponding with start/end sites - I 
 did input the hg18 UCSC gtf file as a reference


You'll need to use a reference annotation (GTF file) that has the gene_name 
attribute as input for Cufflinks/compare/difff. Typically Ensembl annotations 
have this attribute; however, you'll need to prepend 'chr' to each 
line--really, to each chromosome name--in order to bring Ensembl notation in 
line with UCSC/Galaxy notation.

 Actually, I noticed that value 1 in this particular output file is all 0 - no 
 idea why. It is not this way in the other files, making me wonder if there is 
 an error somewhere. I am sure the bam file is okay as I viewed it on IGV and 
 saw the patterns I would expect for some candidate genes I looked at.

It's difficult for me to comment without seeing your analysis. Some output 
files depend on particular attributes being set correctly in the annotation 
file. You may want to search through our mailing list archives and see if your 
question has already been answered: 
http://gmod.827538.n3.nabble.com/Galaxy-Users-f815892.html

Good luck,
J.___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Cuffdiff Question

2011-06-28 Thread Jeremy Goecks

 Thanks for the reply. I tried to use the script provided on a previous galaxy 
 thread for adding the chr on to the gtf file on the mac terminal but I keep 
 getting this error - 
 
 awk: can't open file ensembl.gtf
  source line number 1
 
 I am very new to using the terminal so please let me know if there is 
 something basic that I am not doing right,

Try this Galaxy workflow:

http://main.g2.bx.psu.edu/u/jeremy/w/make-ensembl-gtf-compatible-with-cufflinks

It simply prepends 'chr' to the chromosome name, which is needed if you're 
using an Ensemble reference annotation and want to use it with 
Cufflinks/compare/diff in Galaxy.

Best,
J.
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/