Hello Elizabeth,
You are exactly right -- it's the UTR. You can see it more clearly in
this
Session in the Genome Browser:
http://genome.ucsc.edu/cgi-bin/hgTracks?hgS_doOtherUser=submit&hgS_otherUserName=Ann&hgS_otherUserSessionName=hg19_NM_001037165
Please don't hesitate to contact the mail list again if you require
further
assistance.
Regards,
----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
Elizabeth Tseng wrote:
> Hi,
> I'm using the latest version of human genome (hg19, Feb. 2009) and have
> downloaded the refGene table. I'm having trouble understanding what cdsStart
> - cdsEnd, exonStarts, exonEnds stand for. I had thought originally that if
> you concatenate all the exons (which can be obtained by looking at
> corressponding entries in exonStarts and exonEnds), you would get the
> transcribed mRNA that would eventually be translated into the amino acids.
> It seems like that is not the case.
>
> For example, in refGene table the entry with name='NM_001037165' has the
> following fields:
>
> cdsStart = 4721939
> cdsEnd = 4802095
> exonStarts =
> 4721929,4780468,4794089,4794867,4796624,4798681,4798941,4800694,4801814
> exonEnds =
> 4722499,4780654,4794246,4795014,4796818,4798848,4799226,4800919,4811074,
>
> As you can see cdsEnd != the last of exonEnds, which is 4811074. Then what
> exactly is the region 4802096 - 4811074? Is it an exon? Is it
> transcribed/translated? Or is it part of the UTR?
>
>
> Thanks in advance,
> Huei-Hun Elizabeth Tseng
> _______________________________________________
> Genome maillist - [email protected]
> https://lists.soe.ucsc.edu/mailman/listinfo/genome
_______________________________________________
Genome maillist - [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome