Re: [R-sig-phylo] Multistate Trait Polymorphism

Joe Felsenstein Fri, 08 Apr 2011 05:02:13 -0700


Luke Harmon wrote:

Yes Joe is correct, there is more to this problem than meets theeye. My implementation assumes equal probability of each unknownstate, which is quite different from modeling an actual polymorphiccharacter. I'm sure that doing something different might matter inmany cases.

Assuming equal probability of each possible state might be thought ofas a model of ambiguity of state, not polymorphism. But even forthat it is not a complete likelihood treatment. In likelihoodmachinery, one uses conditional likelihoods, which give a likelihoodof 1 to each possible state. This is not as crazy as it sounds (seepages 255-256 of my book). It is simply that what we have in theconditional likelihoods is NOT the probability of the state, but theprobability of the ambiguous observation given the state. So, forexample, if we see a purine but don't know whether it is A or G (in aDNA sequence case), the probability of seeing purine, given that weonly can see purineness or pyrimidineness, and the state really is A,is 1, and similarly if it is really G. So the conditionallikelihoods for the four nucleotides are (1,0,1,0). Sounds wrong butit isn't.


Polymorphism is totally different: you have actually seen both states.

For discrete 0/1 characters, one can use Sewall Wright's (1934)threshold model which I have discussed (briefly in the book and moreextensively in a 2005 paper in the Philosophical Transactions of theRoyal Society B). I have a paper under revision at a major journalabout it and will release my program Threshml soon in a pre-PHYLIPversion. Unlike Mark Pagel and Paul Lewis's Mk model, it predictspolymorphism in a natural way. The population has an underlyingunobservable quantitative character, the "liability", that impliessome frequency of both 0 and 1 states. I think Ted Garland andothers also use a log-linear model that has somewhat similarproperties but is not exactly the same.

To get these models to deal with multiple character states ispossible but very very nontrivial. If you see states 0, 1, 2, is 1intermediate between 0 and 2, or is it off at right angles to both?There are possible threshold models that could do either -- tellingthe difference between them requires lots of data. With, say, 6states it would be a nightmare.


Joe
----
Joe Felsenstein, j...@gs.washington.edu
 Dept. of Genome Sciences, Univ. of Washington
 Box 355065, Seattle, WA 98195-5065 USA

_______________________________________________
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo

Re: [R-sig-phylo] Multistate Trait Polymorphism

Reply via email to