Re: [mart-dev] xml query

Arek Kasprzyk Sun, 06 May 2007 02:38:07 -0700


On 5 May 2007, at 06:45, Aristotelis Tsirigos wrote:

I tried the following two queries using wget, one to get 5'UTRsequences and coordinates and the other to get the 3'UTR. In the tworesult files however, the order of the attributes is not the same. Isthere any way to control the order, or do attributes appear in randomorder?



Hi Aristotelis,

I can't reproduce this problem. The order in both files appears thesame to me:

Here are the queries (in this case,DATASET=dmelanogaster_gene_ensembl):
wget -q 'http://www.biomart.org/biomart/martservice?query=<?xmlversion="1.0" encoding="UTF-8"?> <Query virtualSchemaName = "default"header = "1" count = "" softwareVersion = " 0.5" > <Dataset name ="'$DATASET'" interface = "default" > <Attribute name ="gene_stable_id" /> <Attribute name = "5utr" /> <Attribute name ="transcript_stable_id" /> <Attribute name = "str_chrom_name" /><Attribute name = "transcript_chrom_strand" /> <Attribute name ="5utr_start" /> <Attribute name = "5utr_end" /> </Dataset> </Query>'-O 5utr.dat

5' UTR Ensembl Gene ID Ensembl Transcript ID Chromosome Strand5 UTR Start (Chr bp) 5 UTR End (Chr bp)

Sequence unavailable    CG2657  CG2657-RA       2L      -1

CTACTCGCATGTAGAGATTTCCACTTATGTTTTCTCTACTTTCAGCAACCGAGAAGAGAACCCACGTTTGAACAAGTATCGGCGTGTGGACAACAGCTATCCCCGCTTCATAACGAATGAGGCTGCCGAGGACCTGATTTACAAGAAGTCC CG11023 CG11023-RA 2L 1 7529 7679CGCAGTTGAACGCAGGTTGAGCAGGAAGCTAGTCGAGACTATAATCCATATCTTGTCTGATCCTTTGTTCAAAACCACACTCCACCAACAATTTAGCCGACCGGAACTCGGGTTATAGCACTGCTCCCCCATTGCCCCTTCAAACTTCGAGTTACATATTACAAACTACCCATCAAC CG2674 CG2674-RB

        2L      1       107760;108588   107838;108685

wget -q 'http://www.biomart.org/biomart/martservice?query=<?xmlversion="1.0" encoding="UTF-8"?> <Query virtualSchemaName = "default"header = "1" count = "" softwareVersion = " 0.5" > <Dataset name ="'$DATASET'" interface = "default" > <Attribute name ="gene_stable_id" /> <Attribute name = "3utr" /> <Attribute name ="transcript_stable_id" /> <Attribute name = "str_chrom_name" /><Attribute name = "transcript_chrom_strand" /> <Attribute name ="3utr_start" /> <Attribute name = "3utr_end" /> </Dataset> </Query>'-O 3utr.dat

3' UTR Ensembl Gene ID Ensembl Transcript ID Chromosome Strand3 UTR Start (Chr bp) 5 UTR End (Chr bp)

Sequence unavailable    CG2657  CG2657-RA       2L      -1

CAGTAGAATCACACAGCTACGCAAGAATGTGGAGAATCCAGTTTAGTTATTTTTACAAATCTTACGTAAACACTCCAAGCATGAATTCGCAACAAGTGCTTAGCTATTTAATTGAATTGAGCTGGCCGAGAGATGTGCTGGTGCAATAACTTGTTCTCATATCTGATTGTAACAGAGAATCTAGTTTTTCAATAAAATTTCCCC

AAGTAAAAACA     CG11023 CG11023-RA      2L      1       9277    7679


and that is:

sequence_type(3utr,5utr), gene_stable_id,transcript_stable_id,str_chrom_name,transcript_chrom_strand,3(5)utr_start,3(5)utr_end

The order of attributes is determined by the order you specify them inyour query with only one exceptionand that is the sequence type which takes the precedence over otherattributes - this is probably what confuses the issue (the actualorder is actually reflected by the display names of your attributes inthe header of your file).


hope that helps,
a.

-------------------------------------------------------------------------------

Arek Kasprzyk
EMBL-European Bioinformatics Institute.
Wellcome Trust Genome Campus, Hinxton,
Cambridge CB10 1SD, UK.
Tel: +44-(0)1223-494606
Fax: +44-(0)1223-494468

-------------------------------------------------------------------------------

Re: [mart-dev] xml query

Reply via email to