[R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread pasquale.r...@libero.it


Of course Ted is right, but my problem with this computation, or with the 
simple exercise I was proposing is well another: as a paleontologist I often 
come across pretty exceptional phenotypes (dwarf hippos and elephants, huge 
flightless birds, to make a few examples). When you use methods like this (I 
mean Garland and Ives') and compare the output with those phenotypes, as I did, 
you immediately realize what the the bottom line is: no matter if they are 
nodes or tips, by using the expected (under BM) covariance the estimated 
phenotypes are dull, perfectly reasonable but very different from anything 
exceptional you may find yourself to work with. This is why I feel it is 
difficult to rely on those (unobserved) values to begin with.
Any opinion?
Pas



Messaggio originale
Da: theodore.garl...@ucr.edu
Data: 05/08/2011 18.24
A: Hunt, Genehu...@si.edu, r-sig-phylo@r-project.orgr-sig-phylo@r-
project.org
Ogg: Re: [R-sig-phylo] R:  ancestral state reconstruction for tips

The methods in the Garland and Ives (2000) paper are in our package of DOS 
PDAP programs, and should also be functional in the PDAP module of Mesquite.

Cheers,
Ted

Theodore Garland, Jr.
Professor
Department of Biology
University of California, Riverside
Riverside, CA 92521
Office Phone:  (951) 827-3524
Home Phone:  (951) 328-0820
Facsimile:  (951) 827-4286 = Dept. office (not confidential)
Email:  tgarl...@ucr.edu
http://www.biology.ucr.edu/people/faculty/Garland.html

Experimental Evolution: Concepts, Methods, and Applications of Selection 
Experiments
Edited by Theodore Garland, Jr. and Michael R. Rose
http://www.ucpress.edu/book.php?isbn=9780520261808
(PDFs of chapters are available from me or from the individual authors)


From: r-sig-phylo-boun...@r-project.org [r-sig-phylo-boun...@r-project.org] 
on behalf of Hunt, Gene [hu...@si.edu]
Sent: Friday, August 05, 2011 8:35 AM
To: r-sig-phylo@r-project.org
Subject: Re: [R-sig-phylo] R:  ancestral state reconstruction for tips

Also, the issue of predicting values for unknown tips using data from other 
species in the tree is considered in this reference:

Garland, T., and A. R. Ives. 2000. Using the past to predict the present: 
confidence intervals for regression equations in phylogenetic comparative 
methods. American Naturalist 155(3):346-364.

Best,
Gene



On 8/5/11 11:31 AM, pasquale.r...@libero.it pasquale.r...@libero.it 
wrote:




Hi Morgan,

this is just stuff for thought, and remember, this is wrong anyway. But you
may try something like this:

1. compute pics,
2. take the pic value at the ancestral node subtending to your unknown tip,
3. pretend one of the two tips the pic was originally computed on is in fact
your unknown species,
4. modify the square of the summed branch lengths of the two species by using
the new bl,
5. use the formula for pics (standardized) to derive your unknown tip value 
by
using the other (real) species tip value and the new square of summed branch
lengthts


but again, remember this is wrong, because contrasts were computed without
your unknown species. With ace everything turns out to be much more 
complicated
because ancestral value estimations are 'optimized' by taking the entire tree
and distribution of values at once, so to speak.





Messaggio originale
Da: morgan.g.i.langi...@gmail.com
Data: 05/08/2011 14.15
A: r-sig-phylo@r-project.org
Ogg: [R-sig-phylo] ancestral state reconstruction for tips

I was wondering if there is a way to get ancestral state
reconstructions not for nodes within the tree but for tips that I
don't know the trait of.  I could do this somewhat manually, by taking
the ancestral state resconstruction from the parent and child nodes
surrounding where my unknown tip branches off from the tree and
averaging those results (weighted by the branch length). This approach
seems kind of clunky, so I was hoping there was something better.



Morgan Langille

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo



--
Gene Hunt
Curator, Department of Paleobiology
National Museum of Natural History
Smithsonian Institution [NHB, MRC 121]
P.O. Box 37012
Washington DC 20013-7012
Phone: 202-633-1331  Fax: 202-786-2832
http://paleobiology.si.edu/staff/individuals/hunt.cfm

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or  
 with the
 simple exercise I was proposing is well another: as a  
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and  
 elephants, huge
 flightless birds, to make a few examples). When you use methods  
 like this (I
 mean Garland and Ives') and compare the output with those  
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if  
 they are
 nodes or tips, by using the expected (under BM) covariance the  
 estimated
 phenotypes are dull, perfectly reasonable but very different from  
 anything
 exceptional you may find yourself to work with. This is why I feel  
 it is
 difficult to rely on those (unobserved) values to begin with.

I think that what is being said is that Brownian Motion is too sedate  
a process
and does not predict some of the large changes actually seen in the  
fossil
record.

That's a legitimate point but does put the onus on the maker of the  
point to
propose some other stochastic process that is tractable and has these  
large
changes (and that fits with known Mendelian and Darwinian mechanisms).
Just complaining that the Brownian stochastic process is no good is  
insufficient.

If we want to add the fossils to the calculation, then they will of  
course
pressure the Brownian Motion process to change more in their vicinity,
which may help some.

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread David Bapst
As the diversity of explicit models of trait evolution grow, it will
be interesting to see if any consensus develops about which models
hold most often in general and whether any insight is gained into
which conditions predict appearance of different models.

I think Joe is right that realizing a model is an inaccurate or
imprecise description of reality should impel us to develop better
models of the world around us, because this partly how science moves
forward. However, I don't think pointing out that a model is deficient
requires that that person must themselves develop an alternative.
After all, an alternative model that capture a more realistic level of
complexity may not be possible in some situations (it is certainly
possible in trait evolution models, however.) Requiring such a thing
would put too much pressure on scientific whistle-blowers, who play a
very important role in reminding the rest of us that the world is more
than the models we use to understand it and make our predictions.

-Dave




On Fri, Aug 5, 2011 at 10:51 AM, Joe Felsenstein j...@gs.washington.edu wrote:

 Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or
 with the
 simple exercise I was proposing is well another: as a
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and
 elephants, huge
 flightless birds, to make a few examples). When you use methods
 like this (I
 mean Garland and Ives') and compare the output with those
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if
 they are
 nodes or tips, by using the expected (under BM) covariance the
 estimated
 phenotypes are dull, perfectly reasonable but very different from
 anything
 exceptional you may find yourself to work with. This is why I feel
 it is
 difficult to rely on those (unobserved) values to begin with.

 I think that what is being said is that Brownian Motion is too sedate
 a process
 and does not predict some of the large changes actually seen in the
 fossil
 record.

 That's a legitimate point but does put the onus on the maker of the
 point to
 propose some other stochastic process that is tractable and has these
 large
 changes (and that fits with known Mendelian and Darwinian mechanisms).
 Just complaining that the Brownian stochastic process is no good is
 insufficient.

 If we want to add the fossils to the calculation, then they will of
 course
 pressure the Brownian Motion process to change more in their vicinity,
 which may help some.

 Joe
 
 Joe Felsenstein      j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,
 Box 5065, Seattle Wa 98195-5065


        [[alternative HTML version deleted]]

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo




-- 
David Bapst
Dept of Geophysical Sciences
University of Chicago
5734 S. Ellis
Chicago, IL 60637
http://home.uchicago.edu/~dwbapst/

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

David Bapst wrote:

 I think Joe is right that realizing a model is an inaccurate or
 imprecise description of reality should impel us to develop better
 models of the world around us, because this partly how science moves
 forward. However, I don't think pointing out that a model is deficient
 requires that that person must themselves develop an alternative.
 After all, an alternative model that capture a more realistic level of
 complexity may not be possible in some situations (it is certainly
 possible in trait evolution models, however.) Requiring such a thing
 would put too much pressure on scientific whistle-blowers, who play a
 very important role in reminding the rest of us that the world is more
 than the models we use to understand it and make our predictions.

As a theoretician, I am perhaps oversensitive to the folks who, after
a lecture in which I present a simple model, triumphantly declare
but you didn't include predator satiation.  Then they walk away
looking very pleased with themselves.

There is a similar problem with the quibblers who inhabit grant
review panels, and are always asking me to do much more
complicated models that are impossibly hard (and they
are not aware how hard they are).

Just understand, when you raise legitimate concerns, that us
model-analyzers are also used to getting a lot of these unreasonable
demands too, and may be grumpy as a result.

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Morgan Langille
Hello everyone. I just wanted to say thank you for all of the
responses so far and have thoroughly enjoyed the discussion. Just for
reference I thought I would explain what I am doing in more detail and
why I asked my original question.

I am interested in developing a practical method for using 16s
sequences to infer function. This is to hopefully aid in metagenomic
experiments where we would like to compare the observed functions
(quantity of a particular protein family) in a metagenomic sample to
what we would expect based on the species that we believe are
present in the sample based on 16s sequences (or possibly some other
marker down the road). The current pipeline starts with a 16s
reference tree for all Archaea and Bacteria completed genomes (~1400
species). We know the functions within these genomes so I would like
to leverage that information along with the tree to predict (as best
as possible) what the functions would be for a newly placed species on
that tree.

One method would be simply to take the nearest neighbour (the
species with minimum 16s branch length) to the new species and use the
functions encoded in that genome as a representative. However, this is
very naive. I then turned to ancestral state reconstruction and
current methods (if any) for predicting characters for species that we
don't have information on for those traits.

I realize that caution has to be used for predicting these functions,
as mentioned below by Pasquale, but I am mostly searching for some
best practices to use in my current situation.
The results should be interesting since I will be testing how well the
method does across ~1 functions (e.g. PFAMs). Many of these are
not predictable at all since their phylogenetic signal is basically
nill due to horizontal gene transfer. However, I am optimistic that
many functions will be reliably predictable.


Sincerely,

Morgan Langille


On Fri, Aug 5, 2011 at 2:51 PM, Joe Felsenstein j...@gs.washington.edu wrote:

 Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or
 with the
 simple exercise I was proposing is well another: as a
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and
 elephants, huge
 flightless birds, to make a few examples). When you use methods
 like this (I
 mean Garland and Ives') and compare the output with those
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if
 they are
 nodes or tips, by using the expected (under BM) covariance the
 estimated
 phenotypes are dull, perfectly reasonable but very different from
 anything
 exceptional you may find yourself to work with. This is why I feel
 it is
 difficult to rely on those (unobserved) values to begin with.

 I think that what is being said is that Brownian Motion is too sedate
 a process
 and does not predict some of the large changes actually seen in the
 fossil
 record.

 That's a legitimate point but does put the onus on the maker of the
 point to
 propose some other stochastic process that is tractable and has these
 large
 changes (and that fits with known Mendelian and Darwinian mechanisms).
 Just complaining that the Brownian stochastic process is no good is
 insufficient.

 If we want to add the fossils to the calculation, then they will of
 course
 pressure the Brownian Motion process to change more in their vicinity,
 which may help some.

 Joe
 
 Joe Felsenstein      j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,
 Box 5065, Seattle Wa 98195-5065


        [[alternative HTML version deleted]]

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo