Re: [MORPHMET2] Questions about Kendall’s shape space and tangent space projection

Leandro Rabello Monteiro Thu, 09 Sep 2021 15:06:04 -0700

Also, if you have a mathematics background, I would recommend Goodall 
(1991) (https://doi.org/10.1111/j.2517-6161.1991.tb01825.x), besides the 
papers from Kendall in the 1980s. I have colleagues that are mathematicians 
that were able to understand what it was all about in a couple of days 
after reading this material. 
Leandro


Em quinta-feira, 9 de setembro de 2021 às 10:36:19 UTC-3, 
f.jame...@stonybrook.edu escreveu:

> I was a little vague there. Unfortunately there are several relevant 
> distances. Usually best to think of Procrustes distances in terms of angles 
> or great circle distances. Max between two shapes is pi/2.  Distances in 
> the tangent space are linear but with a maximum of 2 for shapes with 
> projections maximally far apart - but misleading because they are actually 
> the same shape (Procrustes distance of almost zero). Points around the 
> outer limits of the hemisphere have a Euclidean distance of 1 to the 
> reference triangle but a Procrustes distance of pi/2 = 1.57. Problem of 
> trying to project surface of a sphere onto a flat map. I shouldn't have 
> tried to simplify so much. Looking at pictures may be safer than words.
>
> It is less confusing for the range found in most biological data where 
> shape variation is fortunately usually "small".
>
>
> *F. James Rohlf                                    *
> Distinguished Professor, Emeritus and Research Professor
> Depts: Anthropology and Ecology & Evolution
> Stony Brook University
>
> On 9/9/2021 5:36:16 AM, Karolin Engelkes <karolin....@gmail.com> wrote:
> Thank you all so much for the literature recommendations and, Prof. Rohlf, 
> for the explanations! Especially the part on the PCA and the patterns of 
> distribution in the different spaces is very helpful for me, but also gives 
> me a lot to think about. So please give me some time to go through all of 
> it and through the literature and digest it.
>
> @ Prof. Rohlf: One question immediately comes to my mind that is related 
> to your statement that the “distance in the tangent space will be a little 
> larger [than the distances on the hemisphere] due to the projection“. Does 
> this refer to an orthogonal or a stereographic projection? I would expect 
> the opposite for an orthogonal projection (at least from taking a ruler and 
> measuring the linear distances between the “reference” [intersection of the 
> hemisphere with the y-axis] and, respectively, the points B, C, and D in 
> your Fig. 4 in the paper from 1999).
>
> Thanks again and best wishes,
> Karo
>
> Am Do., 9. Sept. 2021 um 09:54 Uhr schrieb mahendiran mylswamy <
> mahe...@gmail.com>:
>
>> Another nice piece of writing 'on Shape' Theory by Prof. Mac leod, pdf 
>> attached, may help to understand and appreciate the concepts well.
>>
>> On Thu, Sep 9, 2021 at 7:35 AM f.jame...@stonybrook.edu <
>> f.jame...@stonybrook.edu> wrote:
>>
>>> Good questions. The topic can be confusing and difficult to visualize - 
>>> especially for 3D landmark data.
>>>
>>> The 1999 paper in the Journal of Classification  that Adams mentions was 
>>> my attempt to describe its practical relevance in morphometrics. My 1999 
>>> paper in Hystrix may also be of interest to you. You may wish to play with 
>>> my tpsTri software also as it was used for some of the figures in those 
>>> papers.
>>>
>>> Easiest to at first just think of variation in shapes of triangles in 2 
>>> dimensions. As you mention, Kendall's shape space for triangles can be 
>>> visualized as the surface of a sphere of radius 1/2. What convinced me 
>>> of the importance of Kendall's shape space was that Kendall showed that the 
>>> distribution of all possible triangles was a uniform distribution in 
>>> Kendall's shape space.  After a GPA the distribution of shapes is on a 
>>> hemisphere of radius 1 (corresponding to centroid size of 1).  I am not 
>>> sure if anyone has given a good name for this hemisphere corresponding to 
>>> all possible triangles Procrustes aligned to any single shape. In 1999 I 
>>> called it a "preshape space of triangles aligned to a reference triangle". 
>>> Not very snappy!  Perhaps I should have called it something like the "Slice 
>>> hemisphere"  as Dennis Slice first showed it to me and was puzzled why it 
>>> was not a surface of a sphere. 
>>>
>>> The distribution of triangles is not, however, uniform on the surface of 
>>> the hemisphere. As you mention, conventional multivariate methods assume 
>>> linear spaces and use linear matrix algebra. The tangent space is as you 
>>> mention the projection of points from the surface of the hemisphere onto a 
>>> plane that passes just through the point on the hemisphere that corresponds 
>>> to the reference triangle (not really a "reference" just the mean shape in 
>>> practical applications). I have called it Kendall's tangent space but I 
>>> probably should have named it after John Kent as he influenced my 
>>> understanding. Something I found fascinating was that the distribution of 
>>> all possible triangles was again uniform in the projection within the 
>>> circular distribution (Kendall showed that also). The importance, to me, of 
>>> being uniform was that if I see some pattern in the distribution (clusters, 
>>> covaniance, etc.) then it implies something about the distribution of 
>>> shapes not just an artifact of the mathematical operations used to create 
>>> the projection (as in the case of EDMA and some other earlier  statistical 
>>> approaches suggested for analyzing shape variation).
>>>
>>>  Of course, for very small variation in shape the points will be close 
>>> to their mean so that distances on the surface of the hemisphere (thus 
>>> close to their projections) and distances in the tangent space will be very 
>>> similar (though distance in the tangent space will be a little larger due 
>>> to the projection). Might be good enough for some studies but if one does 
>>> not do the projection then you will find that a PCA of your GPA  aligned 
>>> data (assuming large n > 2p-4) will not yield 4 zero eigenvalues as it 
>>> should with centering, rotation, and size removed. Only 3 will be zero 
>>> (i.e., computationally numbers like 10^-15 or so). The 4th smallest might 
>>> be "only" 10^-8 or so. That is a result of the curved shape of the 
>>> hemisphere. If you do the projection then the last 4 eigenvalues will be 
>>> essentially zero as the curvature is now gone.
>>>
>>> An alternative is to perform the multivariate analysis directly in 
>>> Kendall's shape space. Kent (I cannot locate the references right now but 
>>> it was in early 2000s) showed one could, for example, perform a 
>>> generalization of a PCA directly in the curved surface. Some odd properties 
>>> as eigenvectors were great circles on the curved surfaces as I remember).
>>>
>>> One can generalize, of course from triangles to shapes with more 
>>> landmarks and to landmarks in 3 dimensions. The 3-dimensional case is more 
>>> complicated to try to visualize because the simplest case requires 5 
>>> dimensions to represent not just 3 so one cannot just look at the space. It 
>>> also has some more complicated properties. You could look at the book 
>>> "Shape & shape theory" by Kendall, Barden, and Le (1999). It presents a way 
>>> to visualize variation in the 5-dimensional shape space. Was not an easy 
>>> read for me but perhaps I should try again!
>>>
>>> This distinction between shape space and tangent space is not of much 
>>> importance in practical applications where biological variation tends to be 
>>> small compared to all possible variation among p landmarks and because one 
>>> usually only looks at the distribution along the first few eigenvectors 
>>> with the largest eigenvalues but I prefer to have computations match what 
>>> one expects theoretically rather than just being good approximations. When 
>>> programming being just "close enough" could hide subtle bugs. Getting rid 
>>> of that known artifact also allows one to try to possibly interpret those 
>>> eigenvectors with the smallest eigenvalues as they correspond to the most 
>>> stable aspects of possible shape variation (i.e., least varying due to 
>>> development, environment, etc.). 
>>>
>>> Does this help or confuse more?
>>>
>>> *F. James Rohlf                                    *
>>> Distinguished Professor, Emeritus and Research Professor
>>> Depts: Anthropology and Ecology & Evolution
>>> Stony Brook University
>>>
>>> On 9/8/2021 2:26:38 PM, Adams, Dean [EEOB] <dca...@iastate.edu> wrote:
>>>
>>> Karolin,
>>>
>>>  
>>>
>>> A reading of Rohlf 1999 may help.
>>>
>>>  
>>>
>>> Dean
>>>
>>>  
>>>
>>> Rohlf, F.J. 1999. Shape statistics: Procrustes superimpositions and 
>>> tangent spaces. Journal of Classification. 16:197-223.
>>>
>>>  
>>>
>>> Dr. Dean C. Adams
>>>
>>> Distinguished Professor of Evolutionary Biology
>>>
>>> Director of Graduate Education, EEB Program
>>>
>>> Department of Ecology, Evolution, and Organismal Biology
>>>
>>> Iowa State University
>>>
>>> https://faculty.sites.iastate.edu/dcadams/
>>>
>>> phone: 515-294-3834 <(515)%20294-3834>
>>>
>>>  
>>>
>>> *From:* morp...@googlegroups.com <morp...@googlegroups.com> *On Behalf 
>>> Of *karolin....@gmail.com
>>> *Sent:* Tuesday, September 7, 2021 7:04 AM
>>> *To:* Morphmet <morp...@googlegroups.com>
>>> *Subject:* [MORPHMET2] Questions about Kendall’s shape space and 
>>> tangent space projection
>>>
>>>  
>>>
>>> Dear Morphometricians,
>>>
>>> I am currently trying to understand the mathematical backgrounds of 
>>> landmark-based geometric morphometrics. Some questions arose that we could 
>>> not answer during discussions in our lab which is why I hope you can help - 
>>> many thanks in advance!
>>>
>>> The first question is: What exactly is “Kendall’s shape space”? If I 
>>> understand Kendall’s (1984) statement in Eq. 4 correctly, the shape space 
>>> is a quotient space; the elements are equivalence classes of pre-shapes (a 
>>> fiber on the pre-shape sphere becomes one element in shape space). The 
>>> elements of the equivalence classes have less “coordinates” (vector 
>>> elements) than the original landmark configuration and lie on a 
>>> hyperdimensional sphere with a radius of 1. In Theorem 2 Kendall (1984) 
>>> states that the shape space for triangles is isometric to a 
>>> three-dimensional sphere with a radius of 0.5. The triangles on this sphere 
>>> with a radius of 0.5 are represented by three Cartesian coordinates that 
>>> are calculated from the original landmark configuration (Kendall 1984, 
>>> section 5), whereas the triangles are represented by equivalence classes in 
>>> shape space.
>>> In several publications I now find illustrations of a hemisphere of 
>>> radius 1 and a sphere of radius 0.5 (both share one point at the pole); 
>>> those publications usually use the full landmark set. The sphere of radius 
>>> 0.5 is often termed “Kendall’s shape space” (sometimes with a reference to 
>>> triangles, sometimes not). So, how does this fit with the definitions and 
>>> statements in Kendall (1984)? Is there a publication that extends Kendall 
>>> (1984) to the use of full landmark configurations and explains how they are 
>>> (mathematically) related to the sphere with radius 0.5 (for all numbers and 
>>> dimensions of landmarks)? Related to this question: what do the points on 
>>> the sphere of radius 0.5 in those publications look like? Are they 
>>> equivalence classes, full landmark configurations, or 3 cartesian 
>>> coordinates representing triangles? Are they really scaled to unit centroid 
>>> size as the shapes on the pre-shape sphere [= elements of equivalence 
>>> classes in shape space]? 
>>>
>>> The second question is: Why do we need a tangent space projection? I 
>>> understand that the superimposed landmark configurations lie on a 
>>> hyper-hemisphere and I know the argument that standard statistical 
>>> procedures need a linear space. Yet, the superimposed landmark 
>>> configurations are matrices or vectors, depending on how they are 
>>> formatted, for which we can compute Euclidean distances. Where exactly do 
>>> the statistical tests go wrong if we use the superimposed landmark 
>>> configurations without tangent space projection and calculate Euclidean 
>>> distances?
>>> If I, for example, think about MANOVAs as suggested by Anderson (2001, 
>>> Austral Ecology), I guess that the mean shapes of the groups need to be 
>>> calculated to be able to calculate the different sums of squares. If the 
>>> mean “shape” is calculated by group-wisely simply calculating the mean of 
>>> each of the coordinates, the resulting mean “shape” of each group lies 
>>> within the hyper-hemisphere of radius 1. So the mean “shape” is not a shape 
>>> because the centroid size is not standardized. Yet, if I got all distance 
>>> calculations correctly (see attached R-script 
>>> “Compare_distance_measures_in_original_and_tangent_space.R”), I find that 
>>> the Euclidean distances between the mean “shapes” inside the 
>>> hyper-hemisphere are slightly closer to the corresponding Procrustes 
>>> distances than the Euclidean distances in tangent space; the Procrustes 
>>> distances have been calculated by rescaling the mean “shapes” to unit 
>>> centroid size followed by determining the arc length between them. If the 
>>> mean “shapes” inside the hyper-hemisphere are rescaled to unit centroid 
>>> size, then the Euclidean distances between them are even closer to the 
>>> Procrustes distances.
>>> In addition, if I simulate groups of landmark configurations, 
>>> superimpose them without and with tangent space projection, and test for 
>>> significant differences between the groups, I feel that the decision on the 
>>> significance of the group differences is correct slightly more often if it 
>>> is based on the superimposed landmark sets without tangent space projection 
>>> (not exhaustively or formally tested; see R-script 
>>> “compare_ProcrustesMANOVA_in_original_and_tangent_space.R”).
>>>
>>> And one last, more general question: If all landmark configurations are 
>>> superimposed onto a common mean shape, does this also minimize the 
>>> Procrustes distances (measured as arc length) between all pairs of landmark 
>>> configurations and between the mean shapes of sub-groups of landmark 
>>> configurations? 
>>>
>>> Thanks a lot for your insights!
>>>
>>> Kind regards
>>> Karo
>>> -- 
>>> Dr. Karolin Engelkes
>>> Institute of Evolutionary Biology and Animal Ecology
>>> University of Bonn
>>>
>>> Phone: +49 (0) 228 73 5481 <+49%20228%20735481>
>>>
>>> An der Immenburg 1
>>> 53121 Bonn
>>> Germany
>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "Morphmet" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to morphmet2+...@googlegroups.com.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/morphmet2/ac177d3d-b28a-41b2-bfb6-a7a9f8cd73d0n%40googlegroups.com
>>>  
>>> <https://groups.google.com/d/msgid/morphmet2/ac177d3d-b28a-41b2-bfb6-a7a9f8cd73d0n%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "Morphmet" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to morphmet2+...@googlegroups.com.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/morphmet2/Mailbird-a92a6485-db7d-4c94-82a2-397208a82e0f%40stonybrook.edu
>>>  
>>> <https://groups.google.com/d/msgid/morphmet2/Mailbird-a92a6485-db7d-4c94-82a2-397208a82e0f%40stonybrook.edu?utm_medium=email&utm_source=footer>
>>> .
>>>
>>
>>
>> -- 
>> ***************************************
>> M Mahendiran, Ph D
>> Sr. Scientist - Division of Wetland Ecology
>> Salim Ali Centre for Ornithology and Natural History (SACON)
>> Anaikatti (PO), Coimbatore - 641108, TamilNadu, India
>> Tel: 0422-2203100 (Ext. 122), 2203122 (Direct), Mob: 09787320901
>> Fax: 0422-2657088
>> http://www.sacon.in/staff/dr-m-mahendiran/
>>
>> P Please consider the environment before printing this email
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Morphmet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to morphmet2+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/morphmet2/2a1d6878-29de-46fc-a5eb-711cdefbd81cn%40googlegroups.com.

Re: [MORPHMET2] Questions about Kendall’s shape space and tangent space projection

Reply via email to