Re: [galaxy-user] Cuffdiff question

2013-11-15 Thread clare Hardman
Hi Noa,

Yes I did use Cufflinks so this sounds just like my problem. So how have you 
dealt with the problem?

Best wishes,

Clare



On 14 Nov 2013, at 18:17, Noa Sher wrote:

> Hi Clare
> We just ran into a similar issue about a week ago and were debugging with the 
> authors of cuffdiff
> Apparently there are issues with the -b parameter - were you using this in 
> cufflinks?
> If yes - this may be the cause - we switched the order of the replicates and 
> the values changed; as did PCA's of the samples, etc
> They are working on this issue for an upcoming version of cufflinks.
> I am interested in knowing whether this was indeed your problem or were you 
> using a pipeline that does not include cufflinks?
> Good luck,
> Noa
> 
> 
> On 14/11/2013 13:13, clare Hardman wrote:
>> Hello,
>> 
>> Could you please advise me on this probably naive question. When I compare 
>> sample A and sample B by Ciffdiff and then separately compare Sample A to 
>> Sample C by Cuffdiff too, should the FMPK value be the same for A in both 
>> tests? At the moment mine does not seem to be!
>> 
>> Best wishes
>> 
>> Clare
>> 
>> 
>> ___
>> The Galaxy User list should be used for the discussion of
>> Galaxy analysis and other features on the public server
>> at usegalaxy.org.  Please keep all replies on the list by
>> using "reply all" in your mail client.  For discussion of
>> local Galaxy instances and the Galaxy source code, please
>> use the Galaxy Development list:
>> 
>>   http://lists.bx.psu.edu/listinfo/galaxy-dev
>> 
>> To manage your subscriptions to this and other Galaxy lists,
>> please use the interface at:
>> 
>>   http://lists.bx.psu.edu/
>> 
>> To search Galaxy mailing lists use the unified search at:
>> 
>>   http://galaxyproject.org/search/mailinglists/
> 
> 

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-user] Cuffdiff question

2013-11-15 Thread Noa.sher
Dont use the - b parameter

Sent from my iPhone; please excuse any brevity or typos!

On Nov 15, 2013, at 2:51 PM, clare Hardman  wrote:

> Hi Noa,
> 
> Yes I did use Cufflinks so this sounds just like my problem. So how have you 
> dealt with the problem?
> 
> Best wishes,
> 
> Clare
> 
> 
> 
> On 14 Nov 2013, at 18:17, Noa Sher wrote:
> 
>> Hi Clare
>> We just ran into a similar issue about a week ago and were debugging with 
>> the authors of cuffdiff
>> Apparently there are issues with the -b parameter - were you using this in 
>> cufflinks?
>> If yes - this may be the cause - we switched the order of the replicates and 
>> the values changed; as did PCA's of the samples, etc
>> They are working on this issue for an upcoming version of cufflinks.
>> I am interested in knowing whether this was indeed your problem or were you 
>> using a pipeline that does not include cufflinks?
>> Good luck,
>> Noa
>> 
>> 
>> On 14/11/2013 13:13, clare Hardman wrote:
>>> Hello,
>>> 
>>> Could you please advise me on this probably naive question. When I compare 
>>> sample A and sample B by Ciffdiff and then separately compare Sample A to 
>>> Sample C by Cuffdiff too, should the FMPK value be the same for A in both 
>>> tests? At the moment mine does not seem to be!
>>> 
>>> Best wishes
>>> 
>>> Clare
>>> 
>>> 
>>> ___
>>> The Galaxy User list should be used for the discussion of
>>> Galaxy analysis and other features on the public server
>>> at usegalaxy.org.  Please keep all replies on the list by
>>> using "reply all" in your mail client.  For discussion of
>>> local Galaxy instances and the Galaxy source code, please
>>> use the Galaxy Development list:
>>> 
>>>   http://lists.bx.psu.edu/listinfo/galaxy-dev
>>> 
>>> To manage your subscriptions to this and other Galaxy lists,
>>> please use the interface at:
>>> 
>>>   http://lists.bx.psu.edu/
>>> 
>>> To search Galaxy mailing lists use the unified search at:
>>> 
>>>   http://galaxyproject.org/search/mailinglists/
>> 
>> 
> 
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-user] Cuffdiff question

2013-11-14 Thread Noa Sher

  
  
Hi Clare
We just ran into a similar issue about a week ago and were debugging
with the authors of cuffdiff
Apparently there are issues with the -b parameter - were you using
this in cufflinks?
If yes - this may be the cause - we switched the order of the
replicates and the values changed; as did PCA's of the samples, etc
They are working on this issue for an upcoming version of cufflinks.
I am interested in knowing whether this was indeed your problem or
were you using a pipeline that does not include cufflinks?
Good luck,
Noa


On 14/11/2013 13:13, clare Hardman
  wrote:


  Hello,

Could you please advise me on this probably naive question. When I compare sample A and sample B by Ciffdiff and then separately compare Sample A to Sample C by Cuffdiff too, should the FMPK value be the same for A in both tests? At the moment mine does not seem to be!

Best wishes

Clare


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/



  

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-user] Cuffdiff question about using an unspecified (?) database/build

2011-08-19 Thread David K Crossman
Jen,

Thank you very much for the reply.  I'm glad to know it is a known bug 
and not something on my side of things.  So, would my analysis be affected if I 
did change the bam file "Database/Build" to the older tree shrew version found 
in the drop down list?  What significance does this "Database/Build" box have 
in downstream analysis if you have your own fasta reference genome file and gtf 
annotation file that is being referenced instead of a locally cached one?  I'm 
just trying to obtain a better understanding of the "Database/Build" box for 
analyses where I provide the fasta and gtf file.

Thanks,
David


-Original Message-
From: Jennifer Jackson [mailto:j...@bx.psu.edu] 
Sent: Friday, August 19, 2011 9:20 AM
To: David K Crossman
Cc: galaxy-user (galaxy-user@lists.bx.psu.edu)
Subject: Re: [galaxy-user] Cuffdiff question about using an unspecified (?) 
database/build

Hello David,

This is a known bug. The correction is planned to be moved out onto the public 
Galaxy instance at the next update (within a week).

Sorry for the current inconvenience,

Best,

Jen
Galaxy team

On 8/19/11 7:00 AM, David K Crossman wrote:
> Hello!
>
> I have an RNA-Seq project which consists of 5 samples from the species 
> tree shrew. When uploading these fastq files into Galaxy, I chose 
> "unspecified (?)" for the database/build since the latest tree shrew 
> version is not in the drop down list. When using TopHat, 
> Cufflinks/Compare I have selected a reference genome from my history 
> instead of using a built-in index, as well as a gtf annotation file 
> for Cufflinks/Compare and everything has been working fine. Now, I am 
> at the Cuffdiff step and I am running into an error when setting it up 
> to perform replicate analysis. When I select my TopHat accepted hits 
> bam file I see a red X and the error: "Unspecified genome build, click 
> the pencil icon in the history item to set the genome build." Here's a 
> screenshot of what I'm seeing:
>
> Since the latest reference genome for tree shrew wasn't listed, that's 
> why I chose "unspecified (?)." Should I go back and edit these 
> accepted hits bam files to say the Database/Build from the drop down 
> list is "Tree shrew Dec. 2006 (Broad/tupBel1) (tupBel1)?" I know that 
> this is simple to change, but will this affect my results in any way? 
> Any help/info would be greatly appreciated.
>
> Thanks,
>
> David
>
>
>
> ___
> The Galaxy User list should be used for the discussion of Galaxy 
> analysis and other features on the public server at usegalaxy.org.  
> Please keep all replies on the list by using "reply all" in your mail 
> client.  For discussion of local Galaxy instances and the Galaxy 
> source code, please use the Galaxy Development list:
>
>http://lists.bx.psu.edu/listinfo/galaxy-dev
>
> To manage your subscriptions to this and other Galaxy lists, please 
> use the interface at:
>
>http://lists.bx.psu.edu/

--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Cuffdiff question about using an unspecified (?) database/build

2011-08-19 Thread Jennifer Jackson

Hello David,

This is a known bug. The correction is planned to be moved out onto the 
public Galaxy instance at the next update (within a week).


Sorry for the current inconvenience,

Best,

Jen
Galaxy team

On 8/19/11 7:00 AM, David K Crossman wrote:

Hello!

I have an RNA-Seq project which consists of 5 samples from the species
tree shrew. When uploading these fastq files into Galaxy, I chose
“unspecified (?)” for the database/build since the latest tree shrew
version is not in the drop down list. When using TopHat,
Cufflinks/Compare I have selected a reference genome from my history
instead of using a built-in index, as well as a gtf annotation file for
Cufflinks/Compare and everything has been working fine. Now, I am at the
Cuffdiff step and I am running into an error when setting it up to
perform replicate analysis. When I select my TopHat accepted hits bam
file I see a red X and the error: “Unspecified genome build, click the
pencil icon in the history item to set the genome build.” Here’s a
screenshot of what I’m seeing:

Since the latest reference genome for tree shrew wasn’t listed, that’s
why I chose “unspecified (?).” Should I go back and edit these accepted
hits bam files to say the Database/Build from the drop down list is
“Tree shrew Dec. 2006 (Broad/tupBel1) (tupBel1)?” I know that this is
simple to change, but will this affect my results in any way? Any
help/info would be greatly appreciated.

Thanks,

David



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Cuffdiff Question

2011-06-28 Thread Kurinji Pandiyan
Thanks for the reply. I tried to use the script provided on a previous
galaxy thread for adding the chr on to the gtf file on the mac terminal but
I keep getting this error -

awk: can't open file ensembl.gtf
 source line number 1

I am very new to using the terminal so please let me know if there is
something basic that I am not doing right,

Thanks!
Kurinji

On Tue, Jun 28, 2011 at 6:13 AM, Jeremy Goecks wrote:

> Hello Kurinji,
>
> I was at your USC Galaxy seminar last week, which I found very helpful -
> thank you!
>
>
> Glad to hear that you found the workshop helpful. As a reminder, please
> email questions about using Galaxy and its tools to the galaxy-user mailing
> list (which I've cc'd). You may get quicker and different responses from
> community members, and everyone will benefit from the discussion.
>
> I used my recently generated RNAseq data in Galaxy (which was pre-aligned
> using tophat and already had cufflinks run on it) - I ran cuffcompare with
> all the gtf files and then cuffdiff for the three pairs (there is 1 control
> and 3 different drug treatments - no replicates). I got several output
> files, as expected, but decided just to look at the gene differential
> expression as a start. Some questions I have are -
>
> 1. (very basic question!) which is sample 1 (and corresponding value 1) and
> sample 2 (and corresponding value 2)in my output file. This is what my
> output file is called -
>
> 90: Cuffdiff on data 37, data 38, and data 60: gene differential expression
> testing 33,969 lines
>
> Is 37 sample one or sample two? Given the data - I would expect sample 37
> to correspond to "value 2" - but I could be wrong. Please let me know!
>
>
> The best way to figure out which dataset corresponds with Cuffdiff's labels
> is to click the rerun button in the dataset: sample names correspond
> directly to the reads datasets (i.e. BAM files) provided as input to
> Cuffdiff.
>
> 2. How do I find the UCSC gene names corresponding with start/end sites - I
> did input the hg18 UCSC gtf file as a reference
>
>
> You'll need to use a reference annotation (GTF file) that has the gene_name
> attribute as input for Cufflinks/compare/difff. Typically Ensembl
> annotations have this attribute; however, you'll need to prepend 'chr' to
> each line--really, to each chromosome name--in order to bring Ensembl
> notation in line with UCSC/Galaxy notation.
>
> Actually, I noticed that value 1 in this particular output file is all 0 -
> no idea why. It is not this way in the other files, making me wonder if
> there is an error somewhere. I am sure the bam file is okay as I viewed it
> on IGV and saw the patterns I would expect for some candidate genes I looked
> at.
>
>
> It's difficult for me to comment without seeing your analysis. Some output
> files depend on particular attributes being set correctly in the annotation
> file. You may want to search through our mailing list archives and see if
> your question has already been answered:
> http://gmod.827538.n3.nabble.com/Galaxy-Users-f815892.html
>
> Good luck,
> J.
>
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Cuffdiff Question

2011-06-28 Thread Jeremy Goecks

> Thanks for the reply. I tried to use the script provided on a previous galaxy 
> thread for adding the chr on to the gtf file on the mac terminal but I keep 
> getting this error - 
> 
> awk: can't open file ensembl.gtf
>  source line number 1
> 
> I am very new to using the terminal so please let me know if there is 
> something basic that I am not doing right,

Try this Galaxy workflow:

http://main.g2.bx.psu.edu/u/jeremy/w/make-ensembl-gtf-compatible-with-cufflinks

It simply prepends 'chr' to the chromosome name, which is needed if you're 
using an Ensemble reference annotation and want to use it with 
Cufflinks/compare/diff in Galaxy.

Best,
J.
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Cuffdiff Question

2011-06-28 Thread Jeremy Goecks
Hello Kurinji,

> I was at your USC Galaxy seminar last week, which I found very helpful - 
> thank you!

Glad to hear that you found the workshop helpful. As a reminder, please email 
questions about using Galaxy and its tools to the galaxy-user mailing list 
(which I've cc'd). You may get quicker and different responses from community 
members, and everyone will benefit from the discussion.

> I used my recently generated RNAseq data in Galaxy (which was pre-aligned 
> using tophat and already had cufflinks run on it) - I ran cuffcompare with 
> all the gtf files and then cuffdiff for the three pairs (there is 1 control 
> and 3 different drug treatments - no replicates). I got several output files, 
> as expected, but decided just to look at the gene differential expression as 
> a start. Some questions I have are - 
> 
> 1. (very basic question!) which is sample 1 (and corresponding value 1) and 
> sample 2 (and corresponding value 2)in my output file. This is what my output 
> file is called - 
> 
> 90: Cuffdiff on data 37, data 38, and data 60: gene differential expression 
> testing 33,969 lines
> 
> Is 37 sample one or sample two? Given the data - I would expect sample 37 to 
> correspond to "value 2" - but I could be wrong. Please let me know!

The best way to figure out which dataset corresponds with Cuffdiff's labels is 
to click the rerun button in the dataset: sample names correspond directly to 
the reads datasets (i.e. BAM files) provided as input to Cuffdiff.

> 2. How do I find the UCSC gene names corresponding with start/end sites - I 
> did input the hg18 UCSC gtf file as a reference


You'll need to use a reference annotation (GTF file) that has the gene_name 
attribute as input for Cufflinks/compare/difff. Typically Ensembl annotations 
have this attribute; however, you'll need to prepend 'chr' to each 
line--really, to each chromosome name--in order to bring Ensembl notation in 
line with UCSC/Galaxy notation.

> Actually, I noticed that value 1 in this particular output file is all 0 - no 
> idea why. It is not this way in the other files, making me wonder if there is 
> an error somewhere. I am sure the bam file is okay as I viewed it on IGV and 
> saw the patterns I would expect for some candidate genes I looked at.

It's difficult for me to comment without seeing your analysis. Some output 
files depend on particular attributes being set correctly in the annotation 
file. You may want to search through our mailing list archives and see if your 
question has already been answered: 
http://gmod.827538.n3.nabble.com/Galaxy-Users-f815892.html

Good luck,
J.___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/