Hello Elizabeth,

        You are exactly right -- it's the UTR.  You can see it more clearly in 
this 
Session in the Genome Browser:

http://genome.ucsc.edu/cgi-bin/hgTracks?hgS_doOtherUser=submit&hgS_otherUserName=Ann&hgS_otherUserSessionName=hg19_NM_001037165
 


        Please don't hesitate to contact the mail list again if you require 
further 
assistance.

Regards,

----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu



Elizabeth Tseng wrote:
> Hi,
>    I'm using the latest version of human genome (hg19, Feb. 2009) and have
> downloaded the refGene table. I'm having trouble understanding what cdsStart
> - cdsEnd, exonStarts, exonEnds stand for. I had thought originally that if
> you concatenate all the exons (which can be obtained by looking at
> corressponding entries in exonStarts and exonEnds), you would get the
> transcribed mRNA that would eventually be translated into the amino acids.
> It seems like that is not the case.
> 
> For example,  in refGene table the entry with name='NM_001037165' has the
> following fields:
> 
> cdsStart = 4721939
> cdsEnd = 4802095
> exonStarts =
> 4721929,4780468,4794089,4794867,4796624,4798681,4798941,4800694,4801814
> exonEnds  =
> 4722499,4780654,4794246,4795014,4796818,4798848,4799226,4800919,4811074,
> 
> As you can see cdsEnd != the last of exonEnds, which is 4811074. Then what
> exactly is the region 4802096 - 4811074? Is it an exon? Is it
> transcribed/translated? Or is it part of the UTR?
> 
> 
> Thanks in advance,
> Huei-Hun Elizabeth Tseng
> _______________________________________________
> Genome maillist  -  [email protected]
> https://lists.soe.ucsc.edu/mailman/listinfo/genome
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to