Re: [R-sig-eco] distances in NMDS ordination space
Hi Kate, I think we should use the name Euclidean NMDS for the kind of NMDS we have in vegan, because its ordination space is strictly Euclidean. There is nothing non-metric in the ordination space. What is non-metric is the transformation of observed dissimilarities to optimize the fit to the Euclidean ordination space. The central term in the squared stress function is SUM (theta(d) - delta)^2 where delta are the Euclidean distances among points in the ordination, d are the observed community dissimilarities among sampling units, and theta() is a non-metric monotone function to transform d. Sum of squared differences is -- by definition -- squared Euclidean distance, and hence our kind of NMDS is Euclidean. The isolated axes are not meaningful *because* the space is Euclidean. Euclidean space is invariant under rotation: you can rotate axes and distances among points does not change, and you can rotate the axes and the configuration of points does not change. Any direction of axes is just as good. In vegan, the default is to rotate axes to principal components so that first dimension is longest. However, you can also rotate a dimension parallel to an environmental variable using function MDSrotate. These rotations do not change the stress, configuration or distances among points. Such a rotated dimension can be more meaningful and there is some justification in using that as a variable in some other analysis. To repeat: vegan NMDS is Euclidean NMDS and the NMDS ordination space is Euclidean. Because it is Euclidean, it is rotation-invariant and any rotation is equally good. Therefore axes do not have a natural orientation in Euclidean space. The only thing that is non-metric is the transformation of community dissimilarities. That non-metric transformation is made to optimize the goodness of fit to Euclidean ordination space. Cheers, Jari Oksanen On 16/07/2015, at 22:19 PM, Kate Boersma wrote: > Hi all. > > I have a methodological question regarding non-metric multidimensional > scaling. This is not specific to R. Feel free to refer me to another > venue/resource if there is one more appropriate to my question. > > Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS > frequently makes sense for community data, but it also means that distances > in NMDS ordination space cannot be interpreted simplistically as they can in > eigenvalue-based methods like PCA. This is why it is inadvisable > (meaningless) to use NMDS axes as response variables in a linear modeling > framework (e.g., with environmental variables as predictors). > > My question is this: Does that mean that it is also inadvisable to use > distances among points in ordination space as response variables? > > My (potentially flawed) understanding: While the coordinates may not make > sense in isolation, they should be meaningful relative to each other. In a 2D > ordination, if communities A & B are closer together in ordination space than > communities C & D, that means they have more similar species compositions. > Therefore, I should be able to predict the distance between points in a > linear modeling framework. > > Alternately, I could use the actual distances among communities from my > dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or > CCA for a reason. It seems more straightforward to use the distances from my > NMDS ordination instead of generating new coordinates from a PCoA to fit an > RDA framework (as in db-RDA)... but this logic only works if NMDS distances > are informative. > > Are these comparable analyses? If not, why not? > > I'd love your opinions. > > Thank you, > Kate > > -- > Kate Boersma, PhD > Department of Biology > University of San Diego > 5998 Alcala Park > San Diego CA 92110 > kateboer...@gmail.com > http://www.oregonstate.edu/~boersmak/ > > ___ > R-sig-ecology mailing list > R-sig-ecology@r-project.org > https://stat.ethz.ch/mailman/listinfo/r-sig-ecology ___ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology
Re: [R-sig-eco] distances in NMDS ordination space
Hi Kate, The Euclidean distances between points in the NMDS ordination are an approximation to the rank ordering of the original distances. Hence I would consider whether the (approximate) rank ordering of the original distances is the correct metric for the thing you want to include in your model. You would also need to consider the stress of the solution, the error in the mapping. I'm not convinced that NMDS distances are better than embedding the original distances in a Euclidean space using PCoA. Each has difficulties (ranks vs imaginary eigenvalues). HTH Gavin On 16 July 2015 at 13:19, Kate Boersma wrote: > Hi all. > > I have a methodological question regarding non-metric multidimensional > scaling. This is not specific to R. Feel free to refer me to another > venue/resource if there is one more appropriate to my question. > > Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS > frequently makes sense for community data, but it also means that distances > in NMDS ordination space cannot be interpreted simplistically as they can > in eigenvalue-based methods like PCA. This is why it is inadvisable > (meaningless) to use NMDS axes as response variables in a linear modeling > framework (e.g., with environmental variables as predictors). > > My question is this: Does that mean that it is also inadvisable to use > distances among points in ordination space as response variables? > > My (potentially flawed) understanding: While the coordinates may not make > sense in isolation, they should be meaningful relative to each other. In a > 2D ordination, if communities A & B are closer together in ordination space > than communities C & D, that means they have more similar species > compositions. Therefore, I should be able to predict the distance between > points in a linear modeling framework. > > Alternately, I could use the actual distances among communities from my > dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or > CCA for a reason. It seems more straightforward to use the distances from > my NMDS ordination instead of generating new coordinates from a PCoA to fit > an RDA framework (as in db-RDA)... but this logic only works if NMDS > distances are informative. > > Are these comparable analyses? If not, why not? > > I'd love your opinions. > > Thank you, > Kate > > -- > Kate Boersma, PhD > Department of Biology > University of San Diego > 5998 Alcala Park > San Diego CA 92110 > kateboer...@gmail.com > http://www.oregonstate.edu/~boersmak/ > > ___ > R-sig-ecology mailing list > R-sig-ecology@r-project.org > https://stat.ethz.ch/mailman/listinfo/r-sig-ecology > -- Gavin Simpson, PhD [[alternative HTML version deleted]] ___ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology
Re: [R-sig-eco] distances in NMDS ordination space
Kate — others can give you more in-depth information, but I believe (i.e., my students and I and colleagues believe it) NMDS does indeed use pairwise distance measures, in place of eigenvector calculations, in computing ordination scores. Some of these distance-based measures, like the Sorensen’s distance, are not true “metrics”, in that they do not obey the triangle inequality; hence “non-metric” scaling, but still fully quantitative with the scores being continuous variables. As such they can be used as response variables in OLS and other regression-type models. Others may correct me if I misspoke. As you probably know, there has been considerable heat generated in the ecological community over the relative value of distance-based vs. eigenvector methods for ordination. My sense from the debate is that when your community data are presence-absence the distance-based measures are more robust, but you will hear arguments against that too. > On Jul 16, 2015, at 12:19 PM, Kate Boersma wrote: > > Hi all. > > I have a methodological question regarding non-metric multidimensional > scaling. This is not specific to R. Feel free to refer me to another > venue/resource if there is one more appropriate to my question. > > Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS > frequently makes sense for community data, but it also means that distances > in NMDS ordination space cannot be interpreted simplistically as they can in > eigenvalue-based methods like PCA. This is why it is inadvisable > (meaningless) to use NMDS axes as response variables in a linear modeling > framework (e.g., with environmental variables as predictors). > > My question is this: Does that mean that it is also inadvisable to use > distances among points in ordination space as response variables? > > My (potentially flawed) understanding: While the coordinates may not make > sense in isolation, they should be meaningful relative to each other. In a 2D > ordination, if communities A & B are closer together in ordination space than > communities C & D, that means they have more similar species compositions. > Therefore, I should be able to predict the distance between points in a > linear modeling framework. > > Alternately, I could use the actual distances among communities from my > dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or > CCA for a reason. It seems more straightforward to use the distances from my > NMDS ordination instead of generating new coordinates from a PCoA to fit an > RDA framework (as in db-RDA)... but this logic only works if NMDS distances > are informative. > > Are these comparable analyses? If not, why not? > > I'd love your opinions. > > Thank you, > Kate > > -- > Kate Boersma, PhD > Department of Biology > University of San Diego > 5998 Alcala Park > San Diego CA 92110 > kateboer...@gmail.com > http://www.oregonstate.edu/~boersmak/ > > ___ > R-sig-ecology mailing list > R-sig-ecology@r-project.org > https://stat.ethz.ch/mailman/listinfo/r-sig-ecology ___ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology
[R-sig-eco] distances in NMDS ordination space
Hi all. I have a methodological question regarding non-metric multidimensional scaling. This is not specific to R. Feel free to refer me to another venue/resource if there is one more appropriate to my question. Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS frequently makes sense for community data, but it also means that distances in NMDS ordination space cannot be interpreted simplistically as they can in eigenvalue-based methods like PCA. This is why it is inadvisable (meaningless) to use NMDS axes as response variables in a linear modeling framework (e.g., with environmental variables as predictors). My question is this: Does that mean that it is also inadvisable to use distances among points in ordination space as response variables? My (potentially flawed) understanding: While the coordinates may not make sense in isolation, they should be meaningful relative to each other. In a 2D ordination, if communities A & B are closer together in ordination space than communities C & D, that means they have more similar species compositions. Therefore, I should be able to predict the distance between points in a linear modeling framework. Alternately, I could use the actual distances among communities from my dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or CCA for a reason. It seems more straightforward to use the distances from my NMDS ordination instead of generating new coordinates from a PCoA to fit an RDA framework (as in db-RDA)... but this logic only works if NMDS distances are informative. Are these comparable analyses? If not, why not? I'd love your opinions. Thank you, Kate -- Kate Boersma, PhD Department of Biology University of San Diego 5998 Alcala Park San Diego CA 92110 kateboer...@gmail.com http://www.oregonstate.edu/~boersmak/ ___ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology