Re: [R-sig-eco] distances in NMDS ordination space

2015-07-16 Thread Jari Oksanen
Hi Kate,

I think we should use the name Euclidean NMDS for the kind of NMDS we have in 
vegan, because its ordination space is strictly Euclidean. There is nothing 
non-metric in the ordination space. What is non-metric is the transformation of 
observed dissimilarities to optimize the fit to the Euclidean ordination space. 
The central term in the squared stress function is

SUM (theta(d) - delta)^2

where delta are the Euclidean distances among points in the ordination, d are 
the observed community dissimilarities among sampling units, and theta() is a 
non-metric monotone function to transform d. Sum of squared differences is -- 
by definition -- squared Euclidean distance, and hence our kind of NMDS is 
Euclidean. 

The isolated axes are not meaningful *because* the space is Euclidean. 
Euclidean space is invariant under rotation: you can rotate axes and distances 
among points does not change, and you can rotate the axes and the configuration 
of points does not change. Any direction of axes is just as good.

In vegan, the default is to rotate axes to principal components so that first 
dimension is longest. However, you can also rotate a dimension parallel to an 
environmental variable using function MDSrotate. These rotations do not change 
the stress, configuration or distances among points. Such a rotated dimension 
can be more meaningful and there is some justification in using that as a 
variable in some other analysis.

To repeat: vegan NMDS is Euclidean NMDS and the NMDS ordination space is 
Euclidean. Because it is Euclidean, it is rotation-invariant and any rotation 
is equally good. Therefore axes do not have a natural orientation in Euclidean 
space. The only thing that is non-metric is the transformation of community 
dissimilarities. That non-metric transformation is made to optimize the 
goodness of fit to Euclidean ordination space.

Cheers, Jari Oksanen

On 16/07/2015, at 22:19 PM, Kate Boersma wrote:

> Hi all.
> 
> I have a methodological question regarding non-metric multidimensional 
> scaling. This is not specific to R. Feel free to refer me to another 
> venue/resource if there is one more appropriate to my question.
> 
> Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS 
> frequently makes sense for community data, but it also means that distances 
> in NMDS ordination space cannot be interpreted simplistically as they can in 
> eigenvalue-based methods like PCA. This is why it is inadvisable 
> (meaningless) to use NMDS axes as response variables in a linear modeling 
> framework (e.g., with environmental variables as predictors).
> 
> My question is this: Does that mean that it is also inadvisable to use 
> distances among points in ordination space as response variables?
> 
> My (potentially flawed) understanding: While the coordinates may not make 
> sense in isolation, they should be meaningful relative to each other. In a 2D 
> ordination, if communities A & B are closer together in ordination space than 
> communities C & D, that means they have more similar species compositions. 
> Therefore, I should be able to predict the distance between points in a 
> linear modeling framework.
> 
> Alternately, I could use the actual distances among communities from my 
> dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or 
> CCA for a reason. It seems more straightforward to use the distances from my 
> NMDS ordination instead of generating new coordinates from a PCoA to fit an 
> RDA framework (as in db-RDA)... but this logic only works if NMDS distances 
> are informative.
> 
> Are these comparable analyses? If not, why not?
> 
> I'd love your opinions.
> 
> Thank you,
> Kate
> 
> -- 
> Kate Boersma, PhD
> Department of Biology
> University of San Diego
> 5998 Alcala Park
> San Diego CA 92110
> kateboer...@gmail.com
> http://www.oregonstate.edu/~boersmak/
> 
> ___
> R-sig-ecology mailing list
> R-sig-ecology@r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-ecology

___
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology


Re: [R-sig-eco] distances in NMDS ordination space

2015-07-16 Thread Gavin Simpson
Hi Kate,

The Euclidean distances between points in the NMDS ordination are an
approximation to the rank ordering of the original distances. Hence I would
consider whether the (approximate) rank ordering of the original distances
is the correct metric for the thing you want to include in your model. You
would also need to consider the stress of the solution, the error in the
mapping.

I'm not convinced that NMDS distances are better than embedding the
original distances in a Euclidean space using PCoA. Each has difficulties
(ranks vs imaginary eigenvalues).

HTH

Gavin

On 16 July 2015 at 13:19, Kate Boersma  wrote:

> Hi all.
>
> I have a methodological question regarding non-metric multidimensional
> scaling. This is not specific to R. Feel free to refer me to another
> venue/resource if there is one more appropriate to my question.
>
> Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS
> frequently makes sense for community data, but it also means that distances
> in NMDS ordination space cannot be interpreted simplistically as they can
> in eigenvalue-based methods like PCA. This is why it is inadvisable
> (meaningless) to use NMDS axes as response variables in a linear modeling
> framework (e.g., with environmental variables as predictors).
>
> My question is this: Does that mean that it is also inadvisable to use
> distances among points in ordination space as response variables?
>
> My (potentially flawed) understanding: While the coordinates may not make
> sense in isolation, they should be meaningful relative to each other. In a
> 2D ordination, if communities A & B are closer together in ordination space
> than communities C & D, that means they have more similar species
> compositions. Therefore, I should be able to predict the distance between
> points in a linear modeling framework.
>
> Alternately, I could use the actual distances among communities from my
> dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or
> CCA for a reason. It seems more straightforward to use the distances from
> my NMDS ordination instead of generating new coordinates from a PCoA to fit
> an RDA framework (as in db-RDA)... but this logic only works if NMDS
> distances are informative.
>
> Are these comparable analyses? If not, why not?
>
> I'd love your opinions.
>
> Thank you,
> Kate
>
> --
> Kate Boersma, PhD
> Department of Biology
> University of San Diego
> 5998 Alcala Park
> San Diego CA 92110
> kateboer...@gmail.com
> http://www.oregonstate.edu/~boersmak/
>
> ___
> R-sig-ecology mailing list
> R-sig-ecology@r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-ecology
>



-- 
Gavin Simpson, PhD

[[alternative HTML version deleted]]

___
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology


Re: [R-sig-eco] distances in NMDS ordination space

2015-07-16 Thread Don McKenzie
Kate — others can give you more in-depth information, but I believe (i.e., my 
students and I and colleagues believe it) NMDS does indeed use pairwise 
distance measures, in place of  eigenvector calculations, in computing 
ordination scores.  Some of these distance-based measures, like the Sorensen’s 
distance, are not true “metrics”, in that they do not obey the triangle 
inequality; hence “non-metric” scaling, but still fully quantitative with the 
scores being continuous variables.  As such they can be used as response 
variables in OLS and other regression-type models.

Others may correct me if I misspoke.  

As you probably know, there has been considerable heat generated in the 
ecological community over the relative value of distance-based vs. eigenvector 
methods for ordination.  My sense from the debate is that when your community 
data are presence-absence the distance-based measures are more robust, but you 
will hear arguments against that too.

> On Jul 16, 2015, at 12:19 PM, Kate Boersma  wrote:
> 
> Hi all.
> 
> I have a methodological question regarding non-metric multidimensional 
> scaling. This is not specific to R. Feel free to refer me to another 
> venue/resource if there is one more appropriate to my question.
> 
> Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS 
> frequently makes sense for community data, but it also means that distances 
> in NMDS ordination space cannot be interpreted simplistically as they can in 
> eigenvalue-based methods like PCA. This is why it is inadvisable 
> (meaningless) to use NMDS axes as response variables in a linear modeling 
> framework (e.g., with environmental variables as predictors).
> 
> My question is this: Does that mean that it is also inadvisable to use 
> distances among points in ordination space as response variables?
> 
> My (potentially flawed) understanding: While the coordinates may not make 
> sense in isolation, they should be meaningful relative to each other. In a 2D 
> ordination, if communities A & B are closer together in ordination space than 
> communities C & D, that means they have more similar species compositions. 
> Therefore, I should be able to predict the distance between points in a 
> linear modeling framework.
> 
> Alternately, I could use the actual distances among communities from my 
> dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA or 
> CCA for a reason. It seems more straightforward to use the distances from my 
> NMDS ordination instead of generating new coordinates from a PCoA to fit an 
> RDA framework (as in db-RDA)... but this logic only works if NMDS distances 
> are informative.
> 
> Are these comparable analyses? If not, why not?
> 
> I'd love your opinions.
> 
> Thank you,
> Kate
> 
> -- 
> Kate Boersma, PhD
> Department of Biology
> University of San Diego
> 5998 Alcala Park
> San Diego CA 92110
> kateboer...@gmail.com
> http://www.oregonstate.edu/~boersmak/
> 
> ___
> R-sig-ecology mailing list
> R-sig-ecology@r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-ecology



___
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology

[R-sig-eco] distances in NMDS ordination space

2015-07-16 Thread Kate Boersma

Hi all.

I have a methodological question regarding non-metric multidimensional 
scaling. This is not specific to R. Feel free to refer me to another 
venue/resource if there is one more appropriate to my question.


Correct me if I'm wrong: NMDS axes are non-metric, which is why NMDS 
frequently makes sense for community data, but it also means that 
distances in NMDS ordination space cannot be interpreted simplistically 
as they can in eigenvalue-based methods like PCA. This is why it is 
inadvisable (meaningless) to use NMDS axes as response variables in a 
linear modeling framework (e.g., with environmental variables as 
predictors).


My question is this: Does that mean that it is also inadvisable to use 
distances among points in ordination space as response variables?


My (potentially flawed) understanding: While the coordinates may not 
make sense in isolation, they should be meaningful relative to each 
other. In a 2D ordination, if communities A & B are closer together in 
ordination space than communities C & D, that means they have more 
similar species compositions. Therefore, I should be able to predict the 
distance between points in a linear modeling framework.


Alternately, I could use the actual distances among communities from my 
dissimilarity matrix with a method like db-RDA. But I used NMDS over RDA 
or CCA for a reason. It seems more straightforward to use the distances 
from my NMDS ordination instead of generating new coordinates from a 
PCoA to fit an RDA framework (as in db-RDA)... but this logic only works 
if NMDS distances are informative.


Are these comparable analyses? If not, why not?

I'd love your opinions.

Thank you,
Kate

--
Kate Boersma, PhD
Department of Biology
University of San Diego
5998 Alcala Park
San Diego CA 92110
kateboer...@gmail.com
http://www.oregonstate.edu/~boersmak/

___
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology