Thank You, dear Martin,
I wasn't aware about TreeDist, going to explore. :-)
Yes, RF has some limits, e.g. single difference close to root...
Sincerely,
V.

Dne úterý 7. března 2023 12:23:07 CET, Martin R. Smith napsal(a):
> Dear Vojtěch,
> The performance of calculating Robinson-Foulds topological distances
> can be improved using the algorithm of Day (1985), which is implemented
> in the R package TreeDist:
> https://ms609.github.io/TreeDist/reference/Robinson-Foulds.html
> This will often be faster than dist.topo.
> (As noted there, the RF distance has certain issues which means that
> it is not always the most suitable measure of tree dissimilarity!)
> And you can find ape's GitHub repo https://github.com/emmanuelparadis/ape
> Martin
> > ----------------------------------------------------------------------
> > Message: 1
> > Date: Mon, 06 Mar 2023 14:43:04 +0100
> > From: =?utf-8?B?Vm9qdMSbY2g=?= Zeisek <vo...@trapa.cz>
> > To: mailinglist R <r-sig-phylo@r-project.org>
> > Subject: [R-sig-phylo] Parallelization in ape::dist.topo
> > Message-ID: <3551602.QO7bkq4lFn@veles>
> > Content-Type: text/plain; charset="utf-8"
> > 
> > Hello dear colleagues,
> > I use often ape::dist.topo (see here dist.topo.r), which is doing
> > the calculations sequentially, which is very slow for large data
> > sets. I'm sorry, I haven't found any relevant Git repository or
> > so, so I hope Emmanuel won't mind if I discuss it here.
> > I discussed various options with ChatGPT and dist.topo.par1.r is the
> > simplest solution, basically using mc.lapply instead of 2 for loops.
> > Good study material for how to do it in general. Little enhancements
> > are in dist.topo.par2.r, which should be slightly better in case
> > some pair of comparisons would return NA or so, but from my tests
> > there doesn't seem to be any difference.
> > And finally there is dist.topo.par3.r which doesn't load parallel
> > (and uses plain lapply) for cores==1, while parallel and doParallel
> > for multiple cores. It also contains some checks and error handling.
> > From my testing it works well. I'm not sure if tryCatch is really
> > needed there. In any case, improvements welcomed. :-)
> > So, what do You think? Is this usable improvement of ape::dist.topo?
> > Sincerely,
> > V.
-- 
Vojtěch Zeisek
https://trapa.cz/en/

Department of Botany, Faculty of Science
Charles University, Prague, Czech Republic
https://www.natur.cuni.cz/biology/botany/
https://lab-allience.natur.cuni.cz/

Institute of Botany, Czech Academy of Sciences
Průhonice, Czech Republic
https://www.ibot.cas.cz/en/
Computing cluster
https://sorbus.ibot.cas.cz/en/start

Attachment: signature.asc
Description: This is a digitally signed message part.

_______________________________________________
R-sig-phylo mailing list - R-sig-phylo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-phylo
Searchable archive at http://www.mail-archive.com/r-sig-phylo@r-project.org/

Reply via email to