Hi Alden,

These fields provide additional information about the status of the 
start and end of a gene's coding region. The possible statuses are:

   - none - no CDS specified from the sequence's data source.
   - unk - unknown - not known if CDS start/end is complete.
   - incmpl - the CDS start/end is incomplete
   - cmpl - the CDS start/end is complete.

cdsStartStat refers to the cdsStart end of the gene, which is the start 
codon for a positive strand gene and the stop codon for a negative 
strand gene. cdsEndStat refers to the cdsEnd end of the gene, which is 
the stop codon for a positive strand gene and the start codon for a 
negative strand gene.

Please don't hesitate to contact the mail list again if you have any 
further questions.

Katrina Learned
UCSC Genome Bioinformatics Group

Alden Huang wrote, On 02/18/11 16:32:
> Hi,
>
> I just had a quick question...
>
> In the description of the Gene Predictions table format used by UCSC,
> under specifically Gene Predictions (Extended), it lists two fields:
>
>    string cdsStartStat;       "enum('none','unk','incmpl','cmpl')"
>    string cdsEndStat;         "enum('none','unk','incmpl','cmpl')"
>
> I would simply like to know the significance of these particular fields.
>
> thanks,
>
> alden
> _______________________________________________
> Genome maillist  -  [email protected]
> https://lists.soe.ucsc.edu/mailman/listinfo/genome
>   
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to