Re: [R-sig-phylo] Unexpected behaviour: all.equal.phylo(), unique.multiPhylo(), and un/rerooting phylo objects

2011-08-05 Thread Emmanuel Paradis

Hi David,

It's a follow-up on your message.

David Williams wrote on 22/06/2011 07:31:


Dear R users and phylogeneticists,

I've been manipulating some phylo objects mostly with ape functions but 
also with some phangorn functions. I'm a bit confused with some effects 
of rerooting and the output of various functions. Any help would be 
greatly appreciated!


R code:

## Rerooting seems to fail sometimes:

library(ape)
tree - 
read.tree(text=G:1,H:1)g:1,(((A:1,B:1)c:1,(C:1,D:1)d:1)b:1,(E:1,F:1)e:1)a:1)f:1,K:1)h:1,I:1,J:1)i;) 


clan - list(c(G,H),c(A,B,C,D,E,F))[[2]]
layout(matrix(c(1,2), 1, 2), width = c(1, 1))
plot(tree,'u');nodelabels()
treeR - root(tree, outgroup = c(I,J,K), resolve.root = F)
plot(treeR,'u');nodelabels()
treeR$edge[1]
reorder(treeR)$edge[1]
## the root node of treeR is unchanged and within the requested
## outgroup :-(


This was a bug which is now fixed:

https://svn.mpl.ird.fr/ape/dev/ape/R/root.R


plot(tree,'u');nodelabels()
treeR - root(tree, outgroup = c(G,H), resolve.root = F)# 
put the root elsewhere temporarily

treeR - root(treeR, outgroup = c(I,J,K), resolve.root = F)
plot(treeR,'u');nodelabels()
treeR$edge[1]
reorder(treeR)$edge[1]
## the root node of treeR is now at the base of the outgroup as
## expected :-) but an additional rerooting was necessary.


That was indeed a work-around.

## all.equal.phylo() and therefore unique.phylo() are sensitive to edge 
## order in the edge matrix in unrooted trees:


clado1 - 
read.tree(text=((:1,(:1,:1):1):1,:1,((:1,:1):1,((:1,:1):1,:1):1):1);)
clado2 - 
read.tree(text=((:1,(:1,:1):1):1,(:1,:1):1,(:1,((:1,:1):1,:1):1):1);)

plot(clado1,'u');tiplabels();nodelabels()
plot(clado2,'u');tiplabels();nodelabels()
## I would say these are topologically identical

is.rooted(clado1)# FALSE
is.rooted(clado2)# FALSE
## confirm these phylo objects are not rooted

all.equal.phylo(clado1,clado2,use.tip.label=F,use.edge.length=F)# FALSE


The difficulty is that these trees have empty strings as tip labels, so 
the algorithm used by all.equal.phylo does not work in this situation. 
However, this affects only the unrooted trees as you noted. I'm going to 
add a warning in the help page to mention this fact.


Thanks for the report (and your patience).

Cheers,

Emmanuel


## all.equal.phylo thinks they are different . . . .
length(unique(c(clado1,clado2)))# 2

## things are as expected if a root is added in equivalent places:
library(phangorn)
plot(midpoint(clado1),'u');tiplabels();nodelabels()
plot(midpoint(clado2),'u');tiplabels();nodelabels()
## I would say these are topologically identical

## all.equal.phylo agrees this time
all.equal.phylo(unroot(midpoint(clado1)),unroot(midpoint(clado2)),use.tip.label=F,use.edge.length=F) 
# TRUE

length(unique(c(midpoint(clado1),midpoint(clado2#1

## if the phylo objects are topologically identical, I would expect
## a Robinson-Foulds distance of 0
RF.dist(clado1,clado2)# 4
RF.dist(reorder(clado1),reorder(clado2))# 4
RF.dist(reorder(midpoint(clado1)),reorder(midpoint(clado2)))# 6
RF.dist(midpoint(clado1),midpoint(clado2))# 6


Thanks!



--
Emmanuel Paradis
IRD, Jakarta, Indonesia
http://ape.mpl.ird.fr/

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


[R-sig-phylo] R: ancestral state reconstruction for tips

2011-08-05 Thread pasquale.r...@libero.it


Hi Morgan,

this is just stuff for thought, and remember, this is wrong anyway. But you 
may try something like this:

1. compute pics,
2. take the pic value at the ancestral node subtending to your unknown tip,
3. pretend one of the two tips the pic was originally computed on is in fact 
your unknown species,
4. modify the square of the summed branch lengths of the two species by using 
the new bl,
5. use the formula for pics (standardized) to derive your unknown tip value by 
using the other (real) species tip value and the new square of summed branch 
lengthts


but again, remember this is wrong, because contrasts were computed without 
your unknown species. With ace everything turns out to be much more complicated 
because ancestral value estimations are 'optimized' by taking the entire tree 
and distribution of values at once, so to speak.





Messaggio originale
Da: morgan.g.i.langi...@gmail.com
Data: 05/08/2011 14.15
A: r-sig-phylo@r-project.org
Ogg: [R-sig-phylo] ancestral state reconstruction for tips

I was wondering if there is a way to get ancestral state
reconstructions not for nodes within the tree but for tips that I
don't know the trait of.  I could do this somewhat manually, by taking
the ancestral state resconstruction from the parent and child nodes
surrounding where my unknown tip branches off from the tree and
averaging those results (weighted by the branch length). This approach
seems kind of clunky, so I was hoping there was something better.



Morgan Langille

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: ancestral state reconstruction for tips

2011-08-05 Thread Hunt, Gene
Also, the issue of predicting values for unknown tips using data from other 
species in the tree is considered in this reference:

Garland, T., and A. R. Ives. 2000. Using the past to predict the present: 
confidence intervals for regression equations in phylogenetic comparative 
methods. American Naturalist 155(3):346-364.

Best,
Gene



On 8/5/11 11:31 AM, pasquale.r...@libero.it pasquale.r...@libero.it wrote:




Hi Morgan,

this is just stuff for thought, and remember, this is wrong anyway. But you
may try something like this:

1. compute pics,
2. take the pic value at the ancestral node subtending to your unknown tip,
3. pretend one of the two tips the pic was originally computed on is in fact
your unknown species,
4. modify the square of the summed branch lengths of the two species by using
the new bl,
5. use the formula for pics (standardized) to derive your unknown tip value by
using the other (real) species tip value and the new square of summed branch
lengthts


but again, remember this is wrong, because contrasts were computed without
your unknown species. With ace everything turns out to be much more complicated
because ancestral value estimations are 'optimized' by taking the entire tree
and distribution of values at once, so to speak.





Messaggio originale
Da: morgan.g.i.langi...@gmail.com
Data: 05/08/2011 14.15
A: r-sig-phylo@r-project.org
Ogg: [R-sig-phylo] ancestral state reconstruction for tips

I was wondering if there is a way to get ancestral state
reconstructions not for nodes within the tree but for tips that I
don't know the trait of.  I could do this somewhat manually, by taking
the ancestral state resconstruction from the parent and child nodes
surrounding where my unknown tip branches off from the tree and
averaging those results (weighted by the branch length). This approach
seems kind of clunky, so I was hoping there was something better.



Morgan Langille

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo



--
Gene Hunt
Curator, Department of Paleobiology
National Museum of Natural History
Smithsonian Institution [NHB, MRC 121]
P.O. Box 37012
Washington DC 20013-7012
Phone: 202-633-1331  Fax: 202-786-2832
http://paleobiology.si.edu/staff/individuals/hunt.cfm

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: ancestral state reconstruction for tips

2011-08-05 Thread Theodore Garland Jr
The methods in the Garland and Ives (2000) paper are in our package of DOS PDAP 
programs, and should also be functional in the PDAP module of Mesquite.

Cheers,
Ted

Theodore Garland, Jr.
Professor
Department of Biology
University of California, Riverside
Riverside, CA 92521
Office Phone:  (951) 827-3524
Home Phone:  (951) 328-0820
Facsimile:  (951) 827-4286 = Dept. office (not confidential)
Email:  tgarl...@ucr.edu
http://www.biology.ucr.edu/people/faculty/Garland.html

Experimental Evolution: Concepts, Methods, and Applications of Selection 
Experiments
Edited by Theodore Garland, Jr. and Michael R. Rose
http://www.ucpress.edu/book.php?isbn=9780520261808
(PDFs of chapters are available from me or from the individual authors)


From: r-sig-phylo-boun...@r-project.org [r-sig-phylo-boun...@r-project.org] on 
behalf of Hunt, Gene [hu...@si.edu]
Sent: Friday, August 05, 2011 8:35 AM
To: r-sig-phylo@r-project.org
Subject: Re: [R-sig-phylo] R:  ancestral state reconstruction for tips

Also, the issue of predicting values for unknown tips using data from other 
species in the tree is considered in this reference:

Garland, T., and A. R. Ives. 2000. Using the past to predict the present: 
confidence intervals for regression equations in phylogenetic comparative 
methods. American Naturalist 155(3):346-364.

Best,
Gene



On 8/5/11 11:31 AM, pasquale.r...@libero.it pasquale.r...@libero.it wrote:




Hi Morgan,

this is just stuff for thought, and remember, this is wrong anyway. But you
may try something like this:

1. compute pics,
2. take the pic value at the ancestral node subtending to your unknown tip,
3. pretend one of the two tips the pic was originally computed on is in fact
your unknown species,
4. modify the square of the summed branch lengths of the two species by using
the new bl,
5. use the formula for pics (standardized) to derive your unknown tip value by
using the other (real) species tip value and the new square of summed branch
lengthts


but again, remember this is wrong, because contrasts were computed without
your unknown species. With ace everything turns out to be much more complicated
because ancestral value estimations are 'optimized' by taking the entire tree
and distribution of values at once, so to speak.





Messaggio originale
Da: morgan.g.i.langi...@gmail.com
Data: 05/08/2011 14.15
A: r-sig-phylo@r-project.org
Ogg: [R-sig-phylo] ancestral state reconstruction for tips

I was wondering if there is a way to get ancestral state
reconstructions not for nodes within the tree but for tips that I
don't know the trait of.  I could do this somewhat manually, by taking
the ancestral state resconstruction from the parent and child nodes
surrounding where my unknown tip branches off from the tree and
averaging those results (weighted by the branch length). This approach
seems kind of clunky, so I was hoping there was something better.



Morgan Langille

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo



--
Gene Hunt
Curator, Department of Paleobiology
National Museum of Natural History
Smithsonian Institution [NHB, MRC 121]
P.O. Box 37012
Washington DC 20013-7012
Phone: 202-633-1331  Fax: 202-786-2832
http://paleobiology.si.edu/staff/individuals/hunt.cfm

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


[R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread pasquale.r...@libero.it


Of course Ted is right, but my problem with this computation, or with the 
simple exercise I was proposing is well another: as a paleontologist I often 
come across pretty exceptional phenotypes (dwarf hippos and elephants, huge 
flightless birds, to make a few examples). When you use methods like this (I 
mean Garland and Ives') and compare the output with those phenotypes, as I did, 
you immediately realize what the the bottom line is: no matter if they are 
nodes or tips, by using the expected (under BM) covariance the estimated 
phenotypes are dull, perfectly reasonable but very different from anything 
exceptional you may find yourself to work with. This is why I feel it is 
difficult to rely on those (unobserved) values to begin with.
Any opinion?
Pas



Messaggio originale
Da: theodore.garl...@ucr.edu
Data: 05/08/2011 18.24
A: Hunt, Genehu...@si.edu, r-sig-phylo@r-project.orgr-sig-phylo@r-
project.org
Ogg: Re: [R-sig-phylo] R:  ancestral state reconstruction for tips

The methods in the Garland and Ives (2000) paper are in our package of DOS 
PDAP programs, and should also be functional in the PDAP module of Mesquite.

Cheers,
Ted

Theodore Garland, Jr.
Professor
Department of Biology
University of California, Riverside
Riverside, CA 92521
Office Phone:  (951) 827-3524
Home Phone:  (951) 328-0820
Facsimile:  (951) 827-4286 = Dept. office (not confidential)
Email:  tgarl...@ucr.edu
http://www.biology.ucr.edu/people/faculty/Garland.html

Experimental Evolution: Concepts, Methods, and Applications of Selection 
Experiments
Edited by Theodore Garland, Jr. and Michael R. Rose
http://www.ucpress.edu/book.php?isbn=9780520261808
(PDFs of chapters are available from me or from the individual authors)


From: r-sig-phylo-boun...@r-project.org [r-sig-phylo-boun...@r-project.org] 
on behalf of Hunt, Gene [hu...@si.edu]
Sent: Friday, August 05, 2011 8:35 AM
To: r-sig-phylo@r-project.org
Subject: Re: [R-sig-phylo] R:  ancestral state reconstruction for tips

Also, the issue of predicting values for unknown tips using data from other 
species in the tree is considered in this reference:

Garland, T., and A. R. Ives. 2000. Using the past to predict the present: 
confidence intervals for regression equations in phylogenetic comparative 
methods. American Naturalist 155(3):346-364.

Best,
Gene



On 8/5/11 11:31 AM, pasquale.r...@libero.it pasquale.r...@libero.it 
wrote:




Hi Morgan,

this is just stuff for thought, and remember, this is wrong anyway. But you
may try something like this:

1. compute pics,
2. take the pic value at the ancestral node subtending to your unknown tip,
3. pretend one of the two tips the pic was originally computed on is in fact
your unknown species,
4. modify the square of the summed branch lengths of the two species by using
the new bl,
5. use the formula for pics (standardized) to derive your unknown tip value 
by
using the other (real) species tip value and the new square of summed branch
lengthts


but again, remember this is wrong, because contrasts were computed without
your unknown species. With ace everything turns out to be much more 
complicated
because ancestral value estimations are 'optimized' by taking the entire tree
and distribution of values at once, so to speak.





Messaggio originale
Da: morgan.g.i.langi...@gmail.com
Data: 05/08/2011 14.15
A: r-sig-phylo@r-project.org
Ogg: [R-sig-phylo] ancestral state reconstruction for tips

I was wondering if there is a way to get ancestral state
reconstructions not for nodes within the tree but for tips that I
don't know the trait of.  I could do this somewhat manually, by taking
the ancestral state resconstruction from the parent and child nodes
surrounding where my unknown tip branches off from the tree and
averaging those results (weighted by the branch length). This approach
seems kind of clunky, so I was hoping there was something better.



Morgan Langille

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo



--
Gene Hunt
Curator, Department of Paleobiology
National Museum of Natural History
Smithsonian Institution [NHB, MRC 121]
P.O. Box 37012
Washington DC 20013-7012
Phone: 202-633-1331  Fax: 202-786-2832
http://paleobiology.si.edu/staff/individuals/hunt.cfm

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or  
 with the
 simple exercise I was proposing is well another: as a  
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and  
 elephants, huge
 flightless birds, to make a few examples). When you use methods  
 like this (I
 mean Garland and Ives') and compare the output with those  
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if  
 they are
 nodes or tips, by using the expected (under BM) covariance the  
 estimated
 phenotypes are dull, perfectly reasonable but very different from  
 anything
 exceptional you may find yourself to work with. This is why I feel  
 it is
 difficult to rely on those (unobserved) values to begin with.

I think that what is being said is that Brownian Motion is too sedate  
a process
and does not predict some of the large changes actually seen in the  
fossil
record.

That's a legitimate point but does put the onus on the maker of the  
point to
propose some other stochastic process that is tractable and has these  
large
changes (and that fits with known Mendelian and Darwinian mechanisms).
Just complaining that the Brownian stochastic process is no good is  
insufficient.

If we want to add the fossils to the calculation, then they will of  
course
pressure the Brownian Motion process to change more in their vicinity,
which may help some.

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread David Bapst
As the diversity of explicit models of trait evolution grow, it will
be interesting to see if any consensus develops about which models
hold most often in general and whether any insight is gained into
which conditions predict appearance of different models.

I think Joe is right that realizing a model is an inaccurate or
imprecise description of reality should impel us to develop better
models of the world around us, because this partly how science moves
forward. However, I don't think pointing out that a model is deficient
requires that that person must themselves develop an alternative.
After all, an alternative model that capture a more realistic level of
complexity may not be possible in some situations (it is certainly
possible in trait evolution models, however.) Requiring such a thing
would put too much pressure on scientific whistle-blowers, who play a
very important role in reminding the rest of us that the world is more
than the models we use to understand it and make our predictions.

-Dave




On Fri, Aug 5, 2011 at 10:51 AM, Joe Felsenstein j...@gs.washington.edu wrote:

 Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or
 with the
 simple exercise I was proposing is well another: as a
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and
 elephants, huge
 flightless birds, to make a few examples). When you use methods
 like this (I
 mean Garland and Ives') and compare the output with those
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if
 they are
 nodes or tips, by using the expected (under BM) covariance the
 estimated
 phenotypes are dull, perfectly reasonable but very different from
 anything
 exceptional you may find yourself to work with. This is why I feel
 it is
 difficult to rely on those (unobserved) values to begin with.

 I think that what is being said is that Brownian Motion is too sedate
 a process
 and does not predict some of the large changes actually seen in the
 fossil
 record.

 That's a legitimate point but does put the onus on the maker of the
 point to
 propose some other stochastic process that is tractable and has these
 large
 changes (and that fits with known Mendelian and Darwinian mechanisms).
 Just complaining that the Brownian stochastic process is no good is
 insufficient.

 If we want to add the fossils to the calculation, then they will of
 course
 pressure the Brownian Motion process to change more in their vicinity,
 which may help some.

 Joe
 
 Joe Felsenstein      j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,
 Box 5065, Seattle Wa 98195-5065


        [[alternative HTML version deleted]]

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo




-- 
David Bapst
Dept of Geophysical Sciences
University of Chicago
5734 S. Ellis
Chicago, IL 60637
http://home.uchicago.edu/~dwbapst/

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

David Bapst wrote:

 I think Joe is right that realizing a model is an inaccurate or
 imprecise description of reality should impel us to develop better
 models of the world around us, because this partly how science moves
 forward. However, I don't think pointing out that a model is deficient
 requires that that person must themselves develop an alternative.
 After all, an alternative model that capture a more realistic level of
 complexity may not be possible in some situations (it is certainly
 possible in trait evolution models, however.) Requiring such a thing
 would put too much pressure on scientific whistle-blowers, who play a
 very important role in reminding the rest of us that the world is more
 than the models we use to understand it and make our predictions.

As a theoretician, I am perhaps oversensitive to the folks who, after
a lecture in which I present a simple model, triumphantly declare
but you didn't include predator satiation.  Then they walk away
looking very pleased with themselves.

There is a similar problem with the quibblers who inhabit grant
review panels, and are always asking me to do much more
complicated models that are impossibly hard (and they
are not aware how hard they are).

Just understand, when you raise legitimate concerns, that us
model-analyzers are also used to getting a lot of these unreasonable
demands too, and may be grumpy as a result.

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Morgan Langille
Hello everyone. I just wanted to say thank you for all of the
responses so far and have thoroughly enjoyed the discussion. Just for
reference I thought I would explain what I am doing in more detail and
why I asked my original question.

I am interested in developing a practical method for using 16s
sequences to infer function. This is to hopefully aid in metagenomic
experiments where we would like to compare the observed functions
(quantity of a particular protein family) in a metagenomic sample to
what we would expect based on the species that we believe are
present in the sample based on 16s sequences (or possibly some other
marker down the road). The current pipeline starts with a 16s
reference tree for all Archaea and Bacteria completed genomes (~1400
species). We know the functions within these genomes so I would like
to leverage that information along with the tree to predict (as best
as possible) what the functions would be for a newly placed species on
that tree.

One method would be simply to take the nearest neighbour (the
species with minimum 16s branch length) to the new species and use the
functions encoded in that genome as a representative. However, this is
very naive. I then turned to ancestral state reconstruction and
current methods (if any) for predicting characters for species that we
don't have information on for those traits.

I realize that caution has to be used for predicting these functions,
as mentioned below by Pasquale, but I am mostly searching for some
best practices to use in my current situation.
The results should be interesting since I will be testing how well the
method does across ~1 functions (e.g. PFAMs). Many of these are
not predictable at all since their phylogenetic signal is basically
nill due to horizontal gene transfer. However, I am optimistic that
many functions will be reliably predictable.


Sincerely,

Morgan Langille


On Fri, Aug 5, 2011 at 2:51 PM, Joe Felsenstein j...@gs.washington.edu wrote:

 Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or
 with the
 simple exercise I was proposing is well another: as a
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and
 elephants, huge
 flightless birds, to make a few examples). When you use methods
 like this (I
 mean Garland and Ives') and compare the output with those
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if
 they are
 nodes or tips, by using the expected (under BM) covariance the
 estimated
 phenotypes are dull, perfectly reasonable but very different from
 anything
 exceptional you may find yourself to work with. This is why I feel
 it is
 difficult to rely on those (unobserved) values to begin with.

 I think that what is being said is that Brownian Motion is too sedate
 a process
 and does not predict some of the large changes actually seen in the
 fossil
 record.

 That's a legitimate point but does put the onus on the maker of the
 point to
 propose some other stochastic process that is tractable and has these
 large
 changes (and that fits with known Mendelian and Darwinian mechanisms).
 Just complaining that the Brownian stochastic process is no good is
 insufficient.

 If we want to add the fossils to the calculation, then they will of
 course
 pressure the Brownian Motion process to change more in their vicinity,
 which may help some.

 Joe
 
 Joe Felsenstein      j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,
 Box 5065, Seattle Wa 98195-5065


        [[alternative HTML version deleted]]

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


[R-sig-phylo] R: Re: R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread pasquale.r...@libero.it


Hi All,

I'm happy I have stimulated some discussion about this subject matter. For 
some reason I can't imagine it looks this whole thing is going to be somehow 
personal and I have not posted this last e-mail to the list as a consequence. 
Joe, unfotunately I never attended a lecture of yours, and didn't raise trivial 
distinctions and objections to a grant proposal you submitted. My intention was 
not to be critical about BM or ICs, or whatever. I just wanted to point it out 
that things are sometimes a bit too complex and some unreliable predictions 
from our models may slip out unnoticed evey now and then, as I believe it is 
apparent reading the literature (including my own, of course). Having said 
that, my guess was that we *may* use the BM and computations at nodes to see 
where (in which lineages) do phenotypes appear very different from predictions. 
For instance, I think it could be somehow possible to use estimated ancestral 
charactes to see how much the inclusion of some fossil (or new) species changes 
the estimated value (e.g. by creating a polytomy by the inclusion of the new 
species), or even back-calculate branch lenghts (under BM assumption) for these 
unusual phenotypes to see how much evolution accelerates in these lineages (by 
comparison with real branch lengths).
I hope I spoke my mind more clearly at this time.
Pas





Messaggio originale
Da: dwba...@uchicago.edu
Data: 05/08/2011 20.23
A: Joe Felsensteinj...@gs.washington.edu
Cc: r-sig-phylo@r-project.orgr-sig-phylo@r-project.org
Ogg: Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips

As the diversity of explicit models of trait evolution grow, it will
be interesting to see if any consensus develops about which models
hold most often in general and whether any insight is gained into
which conditions predict appearance of different models.

I think Joe is right that realizing a model is an inaccurate or
imprecise description of reality should impel us to develop better
models of the world around us, because this partly how science moves
forward. However, I don't think pointing out that a model is deficient
requires that that person must themselves develop an alternative.
After all, an alternative model that capture a more realistic level of
complexity may not be possible in some situations (it is certainly
possible in trait evolution models, however.) Requiring such a thing
would put too much pressure on scientific whistle-blowers, who play a
very important role in reminding the rest of us that the world is more
than the models we use to understand it and make our predictions.

-Dave




On Fri, Aug 5, 2011 at 10:51 AM, Joe Felsenstein j...@gs.washington.edu 
wrote:

 Pasquale Raia said:

 Of course Ted is right, but my problem with this computation, or
 with the
 simple exercise I was proposing is well another: as a
 paleontologist I often
 come across pretty exceptional phenotypes (dwarf hippos and
 elephants, huge
 flightless birds, to make a few examples). When you use methods
 like this (I
 mean Garland and Ives') and compare the output with those
 phenotypes, as I did,
 you immediately realize what the the bottom line is: no matter if
 they are
 nodes or tips, by using the expected (under BM) covariance the
 estimated
 phenotypes are dull, perfectly reasonable but very different from
 anything
 exceptional you may find yourself to work with. This is why I feel
 it is
 difficult to rely on those (unobserved) values to begin with.

 I think that what is being said is that Brownian Motion is too sedate
 a process
 and does not predict some of the large changes actually seen in the
 fossil
 record.

 That's a legitimate point but does put the onus on the maker of the
 point to
 propose some other stochastic process that is tractable and has these
 large
 changes (and that fits with known Mendelian and Darwinian mechanisms).
 Just complaining that the Brownian stochastic process is no good is
 insufficient.

 If we want to add the fossils to the calculation, then they will of
 course
 pressure the Brownian Motion process to change more in their vicinity,
 which may help some.

 Joe
 
 Joe Felsenstein      j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,
 Box 5065, Seattle Wa 98195-5065


        [[alternative HTML version deleted]]

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo




-- 
David Bapst
Dept of Geophysical Sciences
University of Chicago
5734 S. Ellis
Chicago, IL 60637
http://home.uchicago.edu/~dwbapst/

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

Folks --

I was intending my most recent message to be apologetic --
that I was perhaps overreactive.  Certainly Pas has not
raised unreasonable objections or been obstructive with
my grants! (Others have).

Let me raise an issue so I understand him more clearly:
Pas, are you saying that you see phenotypes in the fossils
that seem incompatible with the Brownian Motion assumption?

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: Re: R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Joe Felsenstein

Pas said:

 What I', trying to do now is writing a R routine to back-calculate  
 the expected branch lengths for the unusual critters, given the  
 fitted ancestral values and tip values of the phenotypes, and  
 assuming BM, in order to compare the actual branch lengths to the  
 expected. The ratio of these lengths, if I'm not delusional and  
 definitely lucky, is a per-lineage rate of phenotypic evolution.  
 The bottom line is answering the question: how long should the  
 branch leading to that particular species be if it evolved at the  
 same rate of its sister species?

Good way to approach it.  If you can calculate the likelihood of  
trees, one way would be to not bother fitting any ancestral values:  
just try different lengths for the branch that connects the fossil to  
the tree, and see which one maximizes the likelihood.

Joe

Joe Felsenstein  j...@gs.washington.edu
  Dept of Genome Sciences and Dept of Biology, Univ. of Washington,  
Box 5065, Seattle Wa 98195-5065


[[alternative HTML version deleted]]

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] R: Re: R: Re: R: ancestral state reconstruction for tips

2011-08-05 Thread Theodore Garland Jr
Hi Pas,
 
No worries, we have all done an accidental Reply All more than once!
 
I estimated ancestral (and tip) values for which I have real data via
BM assumption to see how good the fit is
 
Can you clarify?  Unless you have some a priori hypothesis to test about a 
particular tip (or set of tips, such as a whole clade), then why would you 
estimate their values and how would you do this?  Did you just delete one at a 
time, crank the numbers (presumably yielding the same values as you would get 
from Garland and Ives, 2000), and see what you got?
 
Whether it is a tip or a fossil taxon (which is just a tip with a branch that 
terminates before now), the confidence intervals on the predicted values will 
be hugely affected by how long the branch is to that taxon (the longer the 
branch, the wider the prediction intervals).  And also by how many close 
relatives are attached to the node it comes from, and by how much phenotypic 
diversity exists in those close relatives.
 
The bottom line is answering the question: how long should the branch
leading to that particular species be if it evolved at the same rate of its 
sister species?
 
That's an interesting way to look at it (a sort of inverse [perverse?] 
parameterization), but it does not give you any additional information beyond 
asking whether a taxon is an outlier via the tests we have discussed a bit 
ago.  Or am I missing something?
 
Cheers,
Ted
 
From: pasquale.r...@libero.it [pasquale.r...@libero.it]
Sent: Friday, August 05, 2011 12:38 PM
To: j...@gs.washington.edu
Cc: dwba...@uchicago.edu; hu...@si.edu; Theodore Garland Jr; 
r-sig-phylo@r-project.org
Subject: R: Re: R: Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction 
for tips


Folks,

I was intending my most recent message to be off-list and didn't realize 
r-sig-phylo@r-project.org was in the CC field, which means I'm a fool. All 
kidding aside, yes Joe, I estimated ancestral (and tip) values for which I have 
real data via BM assumption to see how good the fit is. Actually, estimated 
values are very close to real values for some species, barely so for some 
others, and absolutely not for others still. The good news is that since there 
is a single mode of evolution tree wise, deviations from real values really 
mean that evolution is accelerated, or decelerated, either, in these particular 
lineages for which a significant deviation from the expected value is 
noticeable. What I', trying to do now is writing a R routine to back-calculate 
the expected branch lengths for the unusual critters, given the fitted 
ancestral values and tip values of the phenotypes, and assuming BM, in order to 
compare the actual branch lengths to the expected. The ratio of these !
 lengths, if I'm not delusional and definitely lucky, is a per-lineage rate of 
phenotypic evolution. The bottom line is answering the question: how long 
should the branch leading to that particular species be if it evolved at the 
same rate of its sister species?
Pas






Messaggio originale
Da: j...@gs.washington.edu
Data: 05/08/2011 21.04
A: pasquale.r...@libero.itpasquale.r...@libero.it
Cc: dwba...@uchicago.edu, hu...@si.edu, theodore.garl...@ucr.edu, 
r-sig-phylo@r-project.orgr-sig-phylo@r-project.org
Ogg: Re: R: Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips




Folks --


I was intending my most recent message to be apologetic --
that I was perhaps overreactive.  Certainly Pas has not
raised unreasonable objections or been obstructive with
my grants! (Others have).


Let me raise an issue so I understand him more clearly:
Pas, are you saying that you see phenotypes in the fossils
that seem incompatible with the Brownian Motion assumption?


Joe

Joe Felsenstein  j...@gs.washington.edu
 Dept of Genome Sciences and Dept of Biology, Univ. of Washington, Box 5065, 
Seattle Wa 98195-5065
___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo