Dear UCSC team,

I have a very long list of HUGO Gene Symbols of rat (rn4).

I would like to expand each HUGO Gene Symbol into a tab-separated line
containing certain fields which I can see when I click such a gene in the
genome browser in the UCSC Known Genes track. The fields which I want are:

>From the "Rat Gene (gene name here) Description and Page Index" box the
fields: "Description" and "RefSeq Summary".
>From the "Comments and Description Text from UniProtKB " box the field:
FUNCTION (and maybe others).

So for example the HUGO Gene Symbol "Irs1" needs to expand to:

Irs1    insulin receptor substrate 1    a docking protein; may act to link
the insulin receptor kinase with enzymes regulating cellular growth and
metabolism [RGD].     May mediate the control of various cellular processes
by insulin. When phosphorylated by the insulin receptor binds specifically
to various cellular proteins containing SH2 domains such as
phosphatidylinositol 3-kinase p85 subunit or GRB2. Activates
phosphatidylinositol 3-kinase when bound to the regulatory p85 subunit.

The items I want to expand each Gene Symbol may change later, f.e. later I
might also want to include other pieces of information like those from the
microarray box or the KEGG data.

Could you please give me some hints on how to get which data from which
table? I am starting from HUGO Gene Symbol only, so I think I need some
unique identifier, or even several identifiers?! Basically I would like to
ask how to get the data when having only the Gene Symbol. I am able to
program in Perl, Awk but I don't know which data is which table and how to
retrieve by which identifier. Thank you.

best regards,
Anton
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to