Re: [pygr] More on Ensembl data in UCSC

Marek Szuba Wed, 16 Dec 2009 18:58:49 -0800

On Tue, 15 Dec 2009 06:37:44 -0800
"C. Titus Brown" <c...@msu.edu> wrote:



> could you post a script or two showing how you might work with this
> schema in practice?  Thanks!  Uncommented test scripts are just
> fine :)
I suggest you have a look at pygr/apps/ucsc_ensembl_annot.py in my
'ucsc_ensembl' GitHub branch. Note that I keep changing this script on
a regular basis; that said, in this post I shall use

http://github.com/mkszuba/pygr/commit/b50f1b8b792c58180cac10d778ba22ab263272d0

as reference. Don't mind the ugly code, it is as I mentioned earlier a
prototype and will be reorganised before the release.

> > One question: shall we hide from the user that protein data (s)he
> > gets is actually transcript data (i.e come up with some sort of a
> > wrapper) or just let him/her have it as it is?
> I don't understand this question... is the problem that the
> "transcript IDs" would actually be protein IDs?  If only for
> traceability, I would suggest a wrapper.
See lines 137-140 in my script, which produce the following output:

ENSP00000372525 annotENST00000383052[0:47226] chrY[2863321:2910547]

What I meant by hiding from the user was creating a wrapper that would
use data from the transcript database but with protein identifiers;
right now protein-to-transcript-ID mapping is done explicitly and the
database is still just a transcript database.

-- 
MS

--

You received this message because you are subscribed to the Google Groups 
"pygr-dev" group.
To post to this group, send email to pygr-...@googlegroups.com.
To unsubscribe from this group, send email to 
pygr-dev+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/pygr-dev?hl=en.

Re: [pygr] More on Ensembl data in UCSC

Reply via email to