[R-sig-phylo] R: Re: R: ancestral state reconstruction for tips
Of course Ted is right, but my problem with this computation, or with the simple exercise I was proposing is well another: as a paleontologist I often come across pretty exceptional phenotypes (dwarf hippos and elephants, huge flightless birds, to make a few examples). When you use methods like this (I mean Garland and Ives') and compare the output with those phenotypes, as I did, you immediately realize what the the bottom line is: no matter if they are nodes or tips, by using the expected (under BM) covariance the estimated phenotypes are dull, perfectly reasonable but very different from anything exceptional you may find yourself to work with. This is why I feel it is difficult to rely on those (unobserved) values to begin with. Any opinion? Pas Messaggio originale Da: theodore.garl...@ucr.edu Data: 05/08/2011 18.24 A: Hunt, Genehu...@si.edu, r-sig-phylo@r-project.orgr-sig-phylo@r- project.org Ogg: Re: [R-sig-phylo] R: ancestral state reconstruction for tips The methods in the Garland and Ives (2000) paper are in our package of DOS PDAP programs, and should also be functional in the PDAP module of Mesquite. Cheers, Ted Theodore Garland, Jr. Professor Department of Biology University of California, Riverside Riverside, CA 92521 Office Phone: (951) 827-3524 Home Phone: (951) 328-0820 Facsimile: (951) 827-4286 = Dept. office (not confidential) Email: tgarl...@ucr.edu http://www.biology.ucr.edu/people/faculty/Garland.html Experimental Evolution: Concepts, Methods, and Applications of Selection Experiments Edited by Theodore Garland, Jr. and Michael R. Rose http://www.ucpress.edu/book.php?isbn=9780520261808 (PDFs of chapters are available from me or from the individual authors) From: r-sig-phylo-boun...@r-project.org [r-sig-phylo-boun...@r-project.org] on behalf of Hunt, Gene [hu...@si.edu] Sent: Friday, August 05, 2011 8:35 AM To: r-sig-phylo@r-project.org Subject: Re: [R-sig-phylo] R: ancestral state reconstruction for tips Also, the issue of predicting values for unknown tips using data from other species in the tree is considered in this reference: Garland, T., and A. R. Ives. 2000. Using the past to predict the present: confidence intervals for regression equations in phylogenetic comparative methods. American Naturalist 155(3):346-364. Best, Gene On 8/5/11 11:31 AM, pasquale.r...@libero.it pasquale.r...@libero.it wrote: Hi Morgan, this is just stuff for thought, and remember, this is wrong anyway. But you may try something like this: 1. compute pics, 2. take the pic value at the ancestral node subtending to your unknown tip, 3. pretend one of the two tips the pic was originally computed on is in fact your unknown species, 4. modify the square of the summed branch lengths of the two species by using the new bl, 5. use the formula for pics (standardized) to derive your unknown tip value by using the other (real) species tip value and the new square of summed branch lengthts but again, remember this is wrong, because contrasts were computed without your unknown species. With ace everything turns out to be much more complicated because ancestral value estimations are 'optimized' by taking the entire tree and distribution of values at once, so to speak. Messaggio originale Da: morgan.g.i.langi...@gmail.com Data: 05/08/2011 14.15 A: r-sig-phylo@r-project.org Ogg: [R-sig-phylo] ancestral state reconstruction for tips I was wondering if there is a way to get ancestral state reconstructions not for nodes within the tree but for tips that I don't know the trait of. I could do this somewhat manually, by taking the ancestral state resconstruction from the parent and child nodes surrounding where my unknown tip branches off from the tree and averaging those results (weighted by the branch length). This approach seems kind of clunky, so I was hoping there was something better. Morgan Langille ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo -- Gene Hunt Curator, Department of Paleobiology National Museum of Natural History Smithsonian Institution [NHB, MRC 121] P.O. Box 37012 Washington DC 20013-7012 Phone: 202-633-1331 Fax: 202-786-2832 http://paleobiology.si.edu/staff/individuals/hunt.cfm ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips
Pasquale Raia said: Of course Ted is right, but my problem with this computation, or with the simple exercise I was proposing is well another: as a paleontologist I often come across pretty exceptional phenotypes (dwarf hippos and elephants, huge flightless birds, to make a few examples). When you use methods like this (I mean Garland and Ives') and compare the output with those phenotypes, as I did, you immediately realize what the the bottom line is: no matter if they are nodes or tips, by using the expected (under BM) covariance the estimated phenotypes are dull, perfectly reasonable but very different from anything exceptional you may find yourself to work with. This is why I feel it is difficult to rely on those (unobserved) values to begin with. I think that what is being said is that Brownian Motion is too sedate a process and does not predict some of the large changes actually seen in the fossil record. That's a legitimate point but does put the onus on the maker of the point to propose some other stochastic process that is tractable and has these large changes (and that fits with known Mendelian and Darwinian mechanisms). Just complaining that the Brownian stochastic process is no good is insufficient. If we want to add the fossils to the calculation, then they will of course pressure the Brownian Motion process to change more in their vicinity, which may help some. Joe Joe Felsenstein j...@gs.washington.edu Dept of Genome Sciences and Dept of Biology, Univ. of Washington, Box 5065, Seattle Wa 98195-5065 [[alternative HTML version deleted]] ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips
As the diversity of explicit models of trait evolution grow, it will be interesting to see if any consensus develops about which models hold most often in general and whether any insight is gained into which conditions predict appearance of different models. I think Joe is right that realizing a model is an inaccurate or imprecise description of reality should impel us to develop better models of the world around us, because this partly how science moves forward. However, I don't think pointing out that a model is deficient requires that that person must themselves develop an alternative. After all, an alternative model that capture a more realistic level of complexity may not be possible in some situations (it is certainly possible in trait evolution models, however.) Requiring such a thing would put too much pressure on scientific whistle-blowers, who play a very important role in reminding the rest of us that the world is more than the models we use to understand it and make our predictions. -Dave On Fri, Aug 5, 2011 at 10:51 AM, Joe Felsenstein j...@gs.washington.edu wrote: Pasquale Raia said: Of course Ted is right, but my problem with this computation, or with the simple exercise I was proposing is well another: as a paleontologist I often come across pretty exceptional phenotypes (dwarf hippos and elephants, huge flightless birds, to make a few examples). When you use methods like this (I mean Garland and Ives') and compare the output with those phenotypes, as I did, you immediately realize what the the bottom line is: no matter if they are nodes or tips, by using the expected (under BM) covariance the estimated phenotypes are dull, perfectly reasonable but very different from anything exceptional you may find yourself to work with. This is why I feel it is difficult to rely on those (unobserved) values to begin with. I think that what is being said is that Brownian Motion is too sedate a process and does not predict some of the large changes actually seen in the fossil record. That's a legitimate point but does put the onus on the maker of the point to propose some other stochastic process that is tractable and has these large changes (and that fits with known Mendelian and Darwinian mechanisms). Just complaining that the Brownian stochastic process is no good is insufficient. If we want to add the fossils to the calculation, then they will of course pressure the Brownian Motion process to change more in their vicinity, which may help some. Joe Joe Felsenstein j...@gs.washington.edu Dept of Genome Sciences and Dept of Biology, Univ. of Washington, Box 5065, Seattle Wa 98195-5065 [[alternative HTML version deleted]] ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo -- David Bapst Dept of Geophysical Sciences University of Chicago 5734 S. Ellis Chicago, IL 60637 http://home.uchicago.edu/~dwbapst/ ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips
David Bapst wrote: I think Joe is right that realizing a model is an inaccurate or imprecise description of reality should impel us to develop better models of the world around us, because this partly how science moves forward. However, I don't think pointing out that a model is deficient requires that that person must themselves develop an alternative. After all, an alternative model that capture a more realistic level of complexity may not be possible in some situations (it is certainly possible in trait evolution models, however.) Requiring such a thing would put too much pressure on scientific whistle-blowers, who play a very important role in reminding the rest of us that the world is more than the models we use to understand it and make our predictions. As a theoretician, I am perhaps oversensitive to the folks who, after a lecture in which I present a simple model, triumphantly declare but you didn't include predator satiation. Then they walk away looking very pleased with themselves. There is a similar problem with the quibblers who inhabit grant review panels, and are always asking me to do much more complicated models that are impossibly hard (and they are not aware how hard they are). Just understand, when you raise legitimate concerns, that us model-analyzers are also used to getting a lot of these unreasonable demands too, and may be grumpy as a result. Joe Joe Felsenstein j...@gs.washington.edu Dept of Genome Sciences and Dept of Biology, Univ. of Washington, Box 5065, Seattle Wa 98195-5065 [[alternative HTML version deleted]] ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
Re: [R-sig-phylo] R: Re: R: ancestral state reconstruction for tips
Hello everyone. I just wanted to say thank you for all of the responses so far and have thoroughly enjoyed the discussion. Just for reference I thought I would explain what I am doing in more detail and why I asked my original question. I am interested in developing a practical method for using 16s sequences to infer function. This is to hopefully aid in metagenomic experiments where we would like to compare the observed functions (quantity of a particular protein family) in a metagenomic sample to what we would expect based on the species that we believe are present in the sample based on 16s sequences (or possibly some other marker down the road). The current pipeline starts with a 16s reference tree for all Archaea and Bacteria completed genomes (~1400 species). We know the functions within these genomes so I would like to leverage that information along with the tree to predict (as best as possible) what the functions would be for a newly placed species on that tree. One method would be simply to take the nearest neighbour (the species with minimum 16s branch length) to the new species and use the functions encoded in that genome as a representative. However, this is very naive. I then turned to ancestral state reconstruction and current methods (if any) for predicting characters for species that we don't have information on for those traits. I realize that caution has to be used for predicting these functions, as mentioned below by Pasquale, but I am mostly searching for some best practices to use in my current situation. The results should be interesting since I will be testing how well the method does across ~1 functions (e.g. PFAMs). Many of these are not predictable at all since their phylogenetic signal is basically nill due to horizontal gene transfer. However, I am optimistic that many functions will be reliably predictable. Sincerely, Morgan Langille On Fri, Aug 5, 2011 at 2:51 PM, Joe Felsenstein j...@gs.washington.edu wrote: Pasquale Raia said: Of course Ted is right, but my problem with this computation, or with the simple exercise I was proposing is well another: as a paleontologist I often come across pretty exceptional phenotypes (dwarf hippos and elephants, huge flightless birds, to make a few examples). When you use methods like this (I mean Garland and Ives') and compare the output with those phenotypes, as I did, you immediately realize what the the bottom line is: no matter if they are nodes or tips, by using the expected (under BM) covariance the estimated phenotypes are dull, perfectly reasonable but very different from anything exceptional you may find yourself to work with. This is why I feel it is difficult to rely on those (unobserved) values to begin with. I think that what is being said is that Brownian Motion is too sedate a process and does not predict some of the large changes actually seen in the fossil record. That's a legitimate point but does put the onus on the maker of the point to propose some other stochastic process that is tractable and has these large changes (and that fits with known Mendelian and Darwinian mechanisms). Just complaining that the Brownian stochastic process is no good is insufficient. If we want to add the fossils to the calculation, then they will of course pressure the Brownian Motion process to change more in their vicinity, which may help some. Joe Joe Felsenstein j...@gs.washington.edu Dept of Genome Sciences and Dept of Biology, Univ. of Washington, Box 5065, Seattle Wa 98195-5065 [[alternative HTML version deleted]] ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo ___ R-sig-phylo mailing list R-sig-phylo@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-phylo