> I cannot imagine that this it would be possible for an exon to project
> out beyond a transcription region.

I could imagine it happening occasionally as an annotation error. 


>   transcript = ensembl_gene_db[transcript_id]
> 
>   exon5 = transcript.exons[5]
>   print exon5.start, exon5.stop
> 
> would be easy to make work and to have the 'start' and 'stop' be generated
> dynamically upon request (last line).
> 
> Is this the sort of thing we need?

The UCSC known gene table also encodes exon information as two blobs of comma 
seperated lists. I think it is a common pattern for their database schema.

When I've been using it I usually split all the exons out into an NLMSA and 
mostly ignore the start stop locations in the annotation table. (On the other 
hand I barely know how to use pygr).

Diane

Attachment: pgpOlHJVt9pDx.pgp
Description: PGP signature

Reply via email to