-------- Original Message --------
Subject: Re: procrustes variances
Date: Tue, 1 Sep 2009 08:41:20 -0700 (PDT)
From: Haber Annat <[email protected]>
To: <[email protected]>

Hello,
Do you (or any one else) mind explaining this in more details?
On the face of it I would think that it makes more sense to superimpose each sample separately, for exactly the reason that Louis gave. This is assuming: 1) the samples are based on exactly the same set of landmarks; 2) you're only comparing the total variance within each sample and not around each landmark, and not anything else; 3) none of the samples include any of the other samples (and maybe other assumptions I'm forgetting right now). If that is wrong to do then what about bootstrapping? After all, as far as I understand, whenever you bootstrap a sample in order, for example, to get confidence interval of something, you re-superimpose every bootstrapped sample but you don't superimpose all 500 or so bootstrapped samples simultaneously, do you? Then if you pile up all these bootstrapped values into one distribution, that implies that they are comparable, doesn't it? Even though they're each based on a somewhat different space. I tried to test it using simulations as well as looking into my own data. So I generated two samples with somewhat different mean shape and level of variance (based on a non-isotropic covariance matrix), but the same sample size, and compared the variance within each sample when they are superimposed together as opposed to when they are superimposed separately (repeated 100 times). With the real data I simply took the same number of males and females from each of 9 species (ruminants, what else) where I know there is significant sexual dimorphism. With the simulations, the variances based on the combined GPA correlate very tightly (>0.92) with the ones based on separate GPA but are consistently slightly overestimated for both samples in each pair. The higher the variance the more they are overestimated relative to their variance when superimposed alone. And also, the higher the difference in mean shape the bigger the bias. However, when one of the samples is five times bigger, the bigger sample's variances correlate perfectly while the smaller samples are vastly overestimated and correlate ~ 0.8.

With the empirical data I get exactly the same variances for each of the samples either way I calculate it, but that may be because the mean shape is not that different after all.

So all in all, I can see that the space they're in makes a difference but I still don't see why it wouldn't be better to compare their within-sample variance based on separate superimposition for each sample, especially since there is a consistent bias (although these brief simulations obviously didn't cover all possible scenarios). And what does that mean for bootstrapping procedures?
Thanks
Annat


From: morphmet <[email protected]>
Reply-To: <[email protected]>
Date: Fri, 28 Aug 2009 07:42:29 -0400
To: morphmet <[email protected]>
Subject: RE: procrustes variances
Resent-From: <[email protected]>
Resent-Date: Fri, 28 Aug 2009 04:45:01 -0700 (PDT)



-------- Original Message --------
Subject: RE: procrustes variances
Date: Thu, 27 Aug 2009 19:08:25 -0700 (PDT)
From: F. James Rohlf <[email protected]>
Reply-To: [email protected]
Organization: Stony Brook University
To: [email protected]
References: <[email protected]>

You should only compare them when computed after a combined GPA.
Otherwise you are comparing quantities computed in different spaces.

------------------------
F. James Rohlf, Distinguished Professor
Ecology & Evolution, Stony Brook University
www: http://life.bio.sunysb.edu/ee/rohlf

-----Original Message-----
From: morphmet [mailto:[email protected]]
Sent: Thursday, August 27, 2009 11:32 AM
To: morphmet
Subject: procrustes variances



-------- Original Message --------
Subject:  procrustes variances
Date:  Thu, 27 Aug 2009 07:41:39 -0700 (PDT)
From:  Louis Boell <[email protected]>
To:  <[email protected]>



Dear colleagues,

I have a question about procrustes variance. I want to compare the
shape
variances of different samples. I have three groups of 77, 96 and 17
specimens, respectively. I calculated the procrustes variance of each
group in two ways: 1) after pooling the raw data of all three groups
into a common total dataset and fitting them together; b) after
calculating the procrustes fit for each dataset/group separately.

The results for the two large samples are quite consistent between both
procedures; however, the estimate of the procrustes variance for the 17
specimen sample is much larger when fitted together with the other two
samples than when fitted separately.

I assume that this is because the procrustes fit is a "democratic"
procedure, which is much more influenced by large samples than by small
samples when they are fitted together. This could potentially result in
a "spreading" of the specimens from the specimens from the small sample
in the space of the procrustes coordinates, if their covariance pattern
is different from the mean covariance pattern of the total dataset
which
will be largely determined by the large samples.

Altogether, my question amounts to whether it is more approriate to
compare procrustes variances from separate procrustes fits or from a
procrustes fit of the pooled total dataset.

Thanks in advance for help

Louis Boell



Louis Boell
MPI für Evolutionsbiologie
August-Thienemannstr.2
24306 Plön
[email protected]
[email protected]



-----------------------------------------------------------------------
-
Mehr wissen - besser reisen.
<http://redirect.gimas.net/?n=M0908aReisen1>

--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org




--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org





--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org

Reply via email to