I don't think the issue is really the choice of Procrustes
vs other metrics. Unless one is comfortable just looking at numbers,
one needs to make plots to see what patterns, groupings, etc. there
may be in ones data. I agree that the points I raised apply  to most
any multivariate analysis for whatever metric one chooses. The
problem of seeing what is going on in high-dimensional spaces is
quire general. Once one gets past the various tests of significance in a
multivariate analysis one usually wants to try to _see_ what is going on
in ones data and that usually requires looking at some sort of
low-dimensional plots of the data. This does not imply that one wishes
to find clusters as in some of the early numerical taxonomy
applications. It is just a question of trying to see what is going on in
high-dimensional data from looking at a low-dimensional summary. 

It is routine now to perform an analysis of relative warps
(perhaps as a PCA of partial warp scores + uniform) and retain just
the first two or three dimensions because that is all one can plot at
one time. These plots are examined in order to look for patterns or
trends of variation in shape. In these ordination plots distances
between points no longer match the Procrustes distances between the
corresponding shapes since the space is just a projection from a
higher-dimensional space. In a PCA, shapes that are very different will
be shown as far apart in the ordination but some points shown as close
together may actually be rather different in shape. For that reason,
statistical tests etc. should be performed in the original
high-dimensional space and not in the low-dimensional ordination (as one
sometimes sees in Applications). 

The question is what information would you prefer to lose
when you prepare a low-dimensional plot for visual examination. PCA
has one choice built into it of what information should be preserved
and what should be discarded. Non-metric MDS is based on another
choice. A good-fitting non-metric MDS may preserve more useful
information about the original Procrustes distances than does PCA. Often
it is pretty obvious which shapes are most different so it
seems to me, to be more useful to have ordinations that preserve more
subtle differences. It is often instructive to plot distances in
the 2 or 3-dimensional ordination against the distances in the
full data. One can then see at a glance the distances that are
faithfully reproduced vs. those that are not. Sometimes the differences
are very interesting and help one decide for a particular
dataset whether one would lose too much by going from a metric
method such as PCA to a non-metric MDS.

Jim

-----------------------
F. James Rohlf, Distinguished Professor & Graduate Program
Director
State University of New York, Stony Brook, NY 11794-5245
www: http://life.bio.sunysb.edu/ee/rohlf  

> -----Original Message-----
> From: morphmet
[mailto:[EMAIL PROTECTED] 
> Sent: Saturday, October 01, 2005 12:49 PM
> To: morphmet
> Subject: RE: PWS and MDS?
> 
>                                        October 1, 2005
> 
>         I thank Jim Rohlf for his prompt and very
interesting comment.
>  He is correct, of course, that any MDS technique that  
> relaxes the requirement of matching original distances
must 
> at  the same time relax the identity of the resulting 
> ordinations  with PCOORD plots of the variables
contributing 
> to the original  distances.  It is a tribute to his
longevity 
> in our field that  the primary paper he cites is dated
1972.  
> At that time I was  still a graduate student of social 
> theory. It was years before  I became a statistician, and
in 
> going back to pick up the  literature I'd overlooked in a 
> misspent youth I never encountered  Rohlf's indeed quite 
> pertinent paper before yesterday's post.
>         But that dating of 1972 is also a useful cue for
what 
> is  (I hope) a useful continuing comment.  1972 was long 
> before  the onset of useful shape coordinates -- indeed 
> before the  original Kendall publication of 1977 (that
none 
> of us saw  for years) that set the Procrustes metric on
the 
> road to  its multivariate implementation.  Is it
appropriate 
> to  apply a technique from 1972 to shape variables
invented  
> in the 1990's?  What assumptions does the 1972 method
make 
> about the origins of the data to which it is being
applied?
>         Jim's comment would apply to _any_ multivariate
sum 
> of squares, arising from any vector-valued data matrix
>  -- shape coordinates, measured distances, measured
angles,  
> log-distances.  It does not even assume that the metric
_is_ 
> a sum of  squares. Like the rest of the MDS family of 
> ordinations,  it can work (meaning: it can produce useful 
> ordination plots)  when applied to  Manhattan metrics,
string 
> matching measures, perceived  similarity from subjects in 
> psychology experiments, whatever.  
>  But shape coordinates are not like that.  They arise from

> an optimizing construction in the original Cartesian  
> geometry of the image data, and relaxing the distance
alters 
> the conceptual context according to which that choice  of 
> distance itself came to be justified.
>         Remember that the Procrustes family of techniques
has 
>  two exactly equipotent purposes: sorting the organisms
(the 
> task we call ordination), but also ordinating the  space
of 
> possible measurements (the space of "shape  variables"
dual 
> to the coordinate space).  The underlying  beauty of the 
> Procrustes toolkit is its reinterpretation of  the
original 
> Kendall geometry in a multivariate context that  makes
this 
> duality possible.  Remember, too, that MDS  itself arose 
> originally in the social sciences, where the  data to
which 
> it applied do not have any geometry of their own.
>         There is no natural match between the two
approaches 
> to  the meaning of "distances," which  correspond to two 
> completely different styles of statistical science.
>  The relaxation of the distance metric that Jim published
in 
> 1972, as applied to Procrustes shape coordinates (or, for

> that matter, to the new size-shape coordinates), breaks
the  
> connection between those two purposes.  While the result
of  
> a metric MDS on shape coordinates are interpretable as  
> rotations of the Procrustes shape coordinates, the axes
of a 
> nonmetric MDS are not interpretable as shape variables  at

> all.  One arrives at an ordination, yes, wherein nearby  
> specimens are more similar, are perhaps usefully clustered
-- 
Replies will be sent to the list.
For more information visit http://www.morphometrics.org

Reply via email to