Re: [galaxy-user] Cufflinks Annotation problem

2011-11-12 Thread Xiangming Ding

Hi everyone:

i met a problem with the GTF file from ucsc. I uploaded the GTF file  
from ucsc to galaxy and use this file to run cuffcompare. The running  
is fine. But in the output file I cannot find the gene_name( gene  
symbol) only gene_id and transcript_id. it was difficult for me to  
analyze the result if there was no gene name inforation. so where i  
can get GTF file that contain the gene_name information thus it will  
be convenient for the downstream analysis.


xiangming








Quoting Jennifer Jackson j...@bx.psu.edu:


Hi Chris,

Clearing the incorrect database assignment can be done by clicking  
on the pencil icon for the dataset, and in the database pull down  
menu, select the list header:


- Additional Species Are Below -

As you mention, chromosome names need to be identical between the  
reference fasta genome, any BAM files, and any input GTF files.  
Correct sorting and the GTF file's content are also important.


Since the iGenomes dataset is specifically designed to work with  
this software package, it seems worth contacting the tool authors if  
expected results are still coming up. They will know the dataset the  
best. Should a problem with Galaxy be uncovered, we would be very  
glad to learn about it and make necessary corrections.


Tool author's:
   web site: http://cufflinks.cbcb.umd.edu/
   mailing list: tophat.cuffli...@gmail.com

Thanks!

Jen
Galaxy team

On 11/11/11 8:29 AM, Bidwell, Christopher A. wrote:

Colleagues,

I am having trouble running cufflinks with an annotation file on the
public Galaxy.The assembled transcripts gtf file has all the FPKM at 0
although the gene expression and transcript expression tab files have
values for FPKM.

I have seen the SEQanswers threads about the compatibility of tophat bam
files relative to chromosomes labeled as 1,2,3... versus Chr1, Chr2,
Chr3...I am using the iGenomes bovine UMD3.1 genome and annotation file
(chromosomes are 1,2,3) from the history.I altered the gtf file to Chr1,
Chr2, Chr3... but it did not help.

Another potential discrepency/conflict is that the genome and gtf file
have the bosTau6 database attribute from when I uploaded them. However I
am running them from the history (bosTau6 is not an option for tophat).I
do not seem to be able to remove the attribute.

Am I missing something else?

Here is the command line

Info: cufflinks v1.0.3
cufflinks -q --no-update-check -I 5 -F 0.05 -j 0.05 -p 8 -G
/galaxy/main_database/files/003/142/dataset_3142240.dat -N -b ref.fa

Here is the details page

Tool: Cufflinks
Name: Cufflinks on data 69, data 4, and data 5: assembled transcripts
Created: Nov 09, 2011
Filesize: 44.6 Mb
Dbkey: bosTau6
Format: gtf
Tool Version:

Input Parameter Value
SAM or BAM file of aligned RNA-Seq reads 4: Tophat for Illumina on data
4 and data 69: accepted_hits
Max Intron Length 5
Min Isoform Fraction 0.05
Pre MRNA Fraction 0.05
Perform quartile normalization Yes
Conditional (reference_annotation) 1
Reference Annotation 5: iGen_UMD3_1_genes.gtf
Conditional (bias_correction) 0
Conditional (seq_source) 1
Using reference file 69: UMD31_iGen_1-29X.fa
Conditional (singlePaired) 0

Cordially,

Chris



___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/wiki/Support
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] question about uploading data through URL method

2011-11-07 Thread Xiangming Ding

Hi
 the file name is spr097786.fastq.bz2.After upload it showed  
spr097786.fastq. It showed it only contain around 5000 sequence reads.
 I also tried to upload through FTP. so i download the file to my  
computer and then upload to FTP in galaxy.  the totlal 800M file was  
uploaded to the FTP successfully. But when i transfered the file to  
the history i met the same problem. only 5000 sequence reads was moved  
to history. I donnot whether it is because of the bz2 file extension.  
or i should try other compressed file extension.


xaingmimg

Quoting Jennifer Jackson j...@bx.psu.edu:


Hello Xiangmimg,

Data files can be loaded using a URL on the Get Data = Upload  
form. FTP and HTTP connections are supported. This is briefly  
described on that form.


If you are still having issues, there may be a problem with file  
compression or the connection. Downloading locally then using  
Galaxy's FTP upload function is certainly an option.

http://wiki.g2.bx.psu.edu/Learn/Upload%20via%20FTP

Best,

Jen
Galaxy team

On 11/6/11 8:54 PM, Xiangming Ding wrote:

Hi galaxy

I am a new user of galaxy. i met a problem and didnot find similar
question in FAQ. I wanted to upload the data from DDBJ DRA dataset to
galaxy through UTL method. The file is around 800M. However after
uploading, the FASTQ file was just around 2M. So I wanted know whether
it is possible to upload a large file to galaxy through URL method? or I
should download the file to my pc and then uploading to galaxy through
FTP method.

Thanks

xiangmimg
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org
http://galaxyproject.org/wiki/Support




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/