Hi Susanne,

I forgot to mention some very important things to be aware of when 
comparing Human and Chimp.

Special differences about the Chimp reference genome:
1) Portions of the human genome were used in the Chimp assembly process.
2) The RefSeq Genes track includes Human Proteins 

See the Chimp gateway page for information (scroll down to credits and link
out to the data source for details).

Jennifer


------------------------------------------------ 
Jennifer Jackson 
UCSC Genome Bioinformatics Group 

----- "Jennifer Jackson" <[email protected]> wrote:

> From: "Jennifer Jackson" <[email protected]>
> To: "Susanne Knapp" <[email protected]>
> Cc: [email protected]
> Sent: Wednesday, January 27, 2010 12:07:20 PM GMT -08:00 US/Canada Pacific
> Subject: Re: [Genome] ortholog positions human chimp
>
> Hi Susanne,
> 
> The liftOver output has both the query and the target in the output 
> lines, but it is based on the region captured by the filter
> coordinates.
> And identifiers are lost during this processing. This is the expected
> result.
> 
> The choice of using liftOver for cross-species identification of
> Orthologs
> is probably not appropriate. LiftOver identifies the best match on
> one
> direction from-> to. It is a "best homology" tool. An Ortholog is in
> general 
> recognized as a "best reciprocal match" but pseudogenes and other 
> complications (in particular the quality/completeness of the genomes)
> 
> get in the way of finding a clean result. Gene duplications, Cognate 
> groupings that involve more that one gene in any of the targeted
> genomes, 
> etc. can make a simple, automated analysis using just these types of
> rules can 
> be misleading. In many cases, even for closely related species, the
> best 
> hit in one direction is not the Ortholog, it is down farther in the
> hit list.
> 
> These are the recommended tracks to use:
> 
> Use the UCSC Genes track. If your gene is here, many have Orthologs
> identified, some with supporting evidence that goes beyond a
> simple homology comparision. Read the track description to see if
> this is something that would work for you.
> 
> Use the TransMap track. An explanation is here:
> https://lists.soe.ucsc.edu/pipermail/genome/2008-July/016784.html
> 
> For more distantly related species - the process becomes more
> complicated, but some suggestions are:
> 
> Use the conservation track's MAF alignments. These are filtered to be
> reciprocal-best between the main reference genome (human) and each
> target. Again, the track description provides details for methods.
> 
> Use Nets as a starting place (liftOver is based on Chains, and Chains
> are
> used to build Nets). This is still in one direction, but it is a
> summary of 
> sorts - Nets are a better choice as the results are ranked in
> different way
> that can be informative. And don't forget to also look at the net in
> the opposite 
> direction, contained in the target genome's browser. Net methods are
> also in 
> the track description to explain the ranking rules.
> 
> For future queries:
> For a full merge (between any two tracks - including custom tracks),
> try
> using the tools in Galaxy. The output can be formatted to return both
> the
> query and the target in the same line. Galaxy understand the data
> formats
> used by UCSC and can perform "interval" comparisons. Data can be
> directly
> exported from the Table browser to Galaxy as an output option and
> returned 
> as a custom track if it fits one of the formats accepted by the Custom
> 
> Track submission tool. You may need to reformat the data and use the
> name, name2 fields to store information.
> 
> Hopefully this helps to explain the data a bit better and you are able
> to 
> determine the best method for your experiment.
> 
> Good luck and please let us know if we can help more,
> Jennifer
> 
> 
> 
> 
> ------------------------------------------------ 
> Jennifer Jackson 
> UCSC Genome Bioinformatics Group 
> 
> ----- "Susanne Knapp" <[email protected]> wrote:
> 
> > From: "Susanne Knapp" <[email protected]>
> > To: "[email protected]" <[email protected]>
> > Sent: Wednesday, January 27, 2010 8:32:52 AM GMT -08:00 US/Canada
> Pacific
> > Subject: [Genome] ortholog positions human chimp
> >
> > Dear UCSC,
> > 
> > I have discovered the LiftOver function on your website which I
> wanted
> > to use to convert a list of 12,000 positions in the human genome to
> > their ortholog positions in the chimpanzee genome in order to see
> and
> > plot their location. However, using the LiftOver function, I only
> was
> > able to create a list of output positions, where I can't see
> anymore
> > to which position on the human chromosome they relate, and as some
> > positions fail to be mapped (due to gaps in the chimp chromosome?),
> 
> > it is impossible to keep track as the failed markers are listed in
> a
> > separate file. Any better way to do this?
> > 
> > Looking forward to your solution to my problem,
> > 
> > Kind regards,
> > 
> > Susanne Knapp
> > _______________________________________________
> > Genome maillist  -  [email protected]
> > https://lists.soe.ucsc.edu/mailman/listinfo/genome
> _______________________________________________
> Genome maillist  -  [email protected]
> https://lists.soe.ucsc.edu/mailman/listinfo/genome
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to