[R-sig-phylo] Deadline extension for full talk abstracts for Conference on Informatics for Phylogenetics, Evolution, and Biodiversity (iEvoBio)

2011-03-15 Thread Hilmar Lapp
The deadline for submitting abstracts for full talks to the inaugural  
conference on Informatics for Phylogenetics, Evolution, and  
Biodiversity (iEvoBio) has been extended by one week to Friday, March  
25, 2011. We hope that this will give those interested in contributing  
sufficient time to do so.


Please see http://bit.ly/iEvoBio2011_CfA for the full Call for  
Abstracts and author instructions. We still expect to be able to  
notify accepted talks in time for the early registration deadline of  
iEvoBio (and Evolution).


More details about the program and guidelines for contributing content  
are available at http://ievobio.org.  You can also find continuous  
updates on the conference's Twitter feed at http://twitter.com/ 
iEvoBio, and there is a Google group you can join at http://groups.google.com/group/ievobio-announce 
 to receive announcements.


iEvoBio is sponsored by the US National Evolutionary Synthesis Center  
(NESCent) in partnership with the Society for the Study of Ecolution  
(SSE) and the Society of Systematic Biologists (SSB). Additional  
support has been provided by the Encyclopedia of Life (EOL).


The iEvoBio 2011 Organizing Committee:
Rob Guralnick (University of Colorado at Boulder) (Co-chair)
Cynthia Parr (Encyclopedia of Life) (Co-chair)
Dawn Field (UK National Environmental Research Center)
Mark Holder (University of Kansas)
Hilmar Lapp (NESCent)
Rod Page (University of Glasgow)

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] bootstrapping with boot.phylo()

2011-03-15 Thread Klaus Schliep
Dear Gontran,

for short sequences such a result can happen. Maybe you check whether
any values in
dist.dna(seq100b, model = raw)
are close to 0.75.
I suspect that some elements in this distance matrix are close to 0.75
than pairwise distances for the bootstrap samples are likely to be
equal or greater than 0.75. Most models (K80, JC69 etc.) are not
defined for distances =0.75 and will return Inf or NaN (the 0.75 can
vary a bit, depending on the substitution model). bionj of course does
not like building trees from infinite values as input. With short
sequences the variances are of course larger and you are more likely
to observe this, that's why your larger data set works fine.
However in this cases NaN or Inf are the correct results!
It would be nice to catch this error with a try and use only trees
from finite distance matrices or set infinite values to a large value.
But one should return a warning as these samples are likely to be
biased.

Kind regards,
Klaus




On 3/15/11, Gontran Sonet gontran.so...@naturalsciences.be wrote:

 Dear all,
 When I make a bootstrapping analysis of relatively short DNA sequences
 (100bp), I have an error message:
 Error in bionj(dist.dna(seq100b, model = K80, pairwise.deletion =
 FALSE,  :
NA/NaN/Inf in foreign function call (arg 1)

 I am not able to solve the problem since:
 1)This short sequence data set is part of a larger data set where
 bootstrapping works perfectly using the same commands:

 mytre-bionj(dist.dna(seq100b,model=K80,pairwise.deletion=FALSE,as.matrix=TRUE))
  boot.phylo(mytre,seq100b,FUN=function(seq100b) {
 bionj(dist.dna(seq100b,model=K80,pairwise.deletion=FALSE,
 as.matrix=TRUE))})

 2)The problematic data set doesn't contain any - or ? or N, or
 ambiguities and the dist.dna() doesn't produce any NA or missing value.
seq100b
 396 DNA sequences in binary format stored in a matrix (.
 All sequences of same length: 100
 Labels: XX, XX, XX
 Base composition:
  a c g t
 0.224 0.281 0.222 0.273

 3)This data set also works in other programs (like Mega) without any
 problem.

 Thank you very much for your help

 Gontran Sonet

 ___
 R-sig-phylo mailing list
 R-sig-phylo@r-project.org
 https://stat.ethz.ch/mailman/listinfo/r-sig-phylo



-- 
Klaus Schliep
Université Paris 6 (Pierre et Marie Curie)
9, Quai Saint-Bernard, 75005 Paris

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo


Re: [R-sig-phylo] bootstrapping with boot.phylo()

2011-03-15 Thread Joe Felsenstein

Klaus Schliep wrote --

 I suspect that some elements in this distance matrix are close to 0.75
 than pairwise distances for the bootstrap samples are likely to be
 equal or greater than 0.75. Most models (K80, JC69 etc.) are not
 defined for distances =0.75 and will return Inf or NaN (the 0.75 can
 vary a bit, depending on the substitution model). bionj of course does
 not like building trees from infinite values as input. With short
 sequences the variances are of course larger and you are more likely
 to observe this, that's why your larger data set works fine.
 However in this cases NaN or Inf are the correct results!

I often have to deal with users of my PHYLIP package who
are upset at this happening with large distances when
the sequences are bootstrapped.  (In my package the distance is
set to -1.0 in that case, and it should not be used to make a tree).
NaN is not the correct value, but Inf is -- the correct distance
for (say) the Jukes-Cantor model or the Kimura 2-parameter model
when the sequences differ by more than 75% is (positive) infinity,
since these are inferred to be unrelated sequences.

 It would be nice to catch this error with a try and use only trees
 from finite distance matrices or set infinite values to a large value.
 But one should return a warning as these samples are likely to be
 biased.

Exactly.

J.F.

Joe Felsenstein j...@gs.washington.edu
 Department of Genome Sciences and Department of Biology,
 University of Washington, Box 355065, Seattle, WA 98195-5065 USA

___
R-sig-phylo mailing list
R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo