Dear all,
if I issue a simple query like this:
<Query virtualSchemaName = "default" header = "0" count = ""
softwareVersion = "0.5" >
<Dataset name = "rnorvegicus_gene_ensembl" interface = "default" >
<Attribute name = "gene_stable_id" />
<Attribute name = "transcript_chrom_strand" />
<Attribute name = "5utr" />
<Filter name = "transcript_status" value = "KNOWN"/>
<Filter name = "ensembl_gene_id" value =
"ENSRNOG00000004258,ENSRNOG00000014029"/>
</Dataset>
</Query>
and align the resulting sequence with a result from this query:
<Query virtualSchemaName = "default" header = "0" count = ""
softwareVersion = "0.5" >
<Dataset name = "rnorvegicus_gene_ensembl" interface = "default" >
<Attribute name = "gene_stable_id" />
<Attribute name = "transcript_chrom_strand" />
<Attribute name = "5utr" />
<Filter name = "upstream_flank" value = "1000"/>
<Filter name = "transcript_status" value = "KNOWN"/>
<Filter name = "ensembl_gene_id" value =
"ENSRNOG00000004258,ENSRNOG00000014029"/>
</Dataset>
</Query>
then for the ENSRNOG00000004258 the first common base is in position
1001 (from the start of the longer sequence), and for the
ENSRNOG00000014029 the first common base is in position 1000.
The case-important difference between the two genes is strand:
positive for 14029 and negative for 4258. This behaviour isn't unique
to these two genes - seen the same for ~6 other genes.
Is this a minor inconsistency, or a feature?
Will appreciate your clarification,
--
Sincerely yours,
Bogdan Tokovenko,
PhD student at the Laboratory of Protein Biosynthesis,
Department of Genetic Information Translation Mechanisms,
Institute of Molecular Biology and Genetics, Kyiv, Ukraine
http://bogdan.org.ua/