Hi Claude
I will look into this issue and get back to you as soon as possible.
Regards
Rhoda

On 21 Sep 2011, at 13:58, Arek Kasprzyk wrote:

Hi Claude,
From my previous experience working on Ensembl rember that occassionally martdb and ftp would get out of sync. Perhaps this is what happened this time?

I am forwarding your email to the users mailing list. Rhoda or someone else from Ensembl should be able to comment on this.


a



On Tue, Sep 20, 2011 at 12:00 PM, Claude Chelala <[email protected]> wrote:

------ Forwarded Message
From: Claude Chelala <[email protected]>
Date: Tue, 20 Sep 2011 16:55:36 +0100
To: Arek Kasprzyk <[email protected]>
Cc: Dayem Ullah <[email protected]>
Conversation: Ensembl mart tables - corrupted?
Subject: Ensembl mart tables - corrupted?

Dear Arek

Dayem (cc’d on this email) joined my group recently and is in charge of updating SNPnexus software preparing for a new release. He is experiencing few problems when working with ensembl_mart_63 (and 64) tables and would appreciate your help to sort this out.

When working on biomart release 63, he observed the following discrepancy between Public MySQL Server martdb.ensembl.org and the corresponding downloadable version (Pub Mysql tables as in ensembl release from ftp://ftp.ensembl.org/pub/release-63/mysql/ensembl_mart_63) .

ftp://ftp.ensembl.org/pub/release-63/mysql/ensembl_mart_63
Database name: ensembl_mart_63
Table name: hsapiens_gene_ensembl__exon_transcript__dm

It appears that exons are not mapped properly with the corresponding transcripts from hsapiens_gene_ensembl__transcript__main table. The mapping at martdb.ensembl.org tables seems to be correct.

The exon information with respect to exon_id (exon_id_1017) are mostly different in two versions. With respect to exon name (stable_id_1016), the other information appears to be same in two versions except the corresponding mapping with transcript (transcript_id_1064_key).

The example is shown in the file attached. For transcript ENST00000302036 (transcript_id_1064_key=342178) the pub release version mapping to corresponding exons is incorrect, whereas public server martdb.ensembl.org version gives correct mapping. Subsequently, we can see that the same exon id refers to different exons or same exon maps to different transcript.

We suspect that the ensembl Mysql release version of the table hsapiens_gene_ensembl__exon_transcript__dm is corrupted and martdb.ensembl.org table is correct. However, we noted another point: the number of rows in the tables are different as well:

martdb.ensembl.org
Database name: ensembl_mart_63
Table name: hsapiens_gene_ensembl__exon_transcript__dm
1161741 rows

ftp://ftp.ensembl.org/pub/release-63/mysql/ensembl_mart_63
Database name: ensembl_mart_63
Table name: hsapiens_gene_ensembl__exon_transcript__dm
1178393 rows

Could you please help us to get the correct mappings and tables?

Thank you

Regards
Claude
------ End of Forwarded Message




This email may contain information that is privileged, confidential or otherwise protected from disclosure. It must not be used by, or its contents copied or disclosed to, persons other than the addressee. If you have received this email in error please notify the sender immediately and delete the email.
This message has been scanned for viruses.


_______________________________________________
Users mailing list
[email protected]
https://lists.biomart.org/mailman/listinfo/users

Rhoda Kinsella Ph.D.
Ensembl Bioinformatician,
European Bioinformatics Institute (EMBL-EBI),
Wellcome Trust Genome Campus,
Hinxton
Cambridge CB10 1SD,
UK.

_______________________________________________
Users mailing list
[email protected]
https://lists.biomart.org/mailman/listinfo/users

Reply via email to