Re: procrustes variances

morphmet Tue, 01 Sep 2009 09:23:17 -0700


-------- Original Message --------
Subject: Re: procrustes variances
Date: Tue, 1 Sep 2009 08:41:20 -0700 (PDT)
From: Haber Annat <[email protected]>
To: <[email protected]>

Hello,
Do you (or any one else) mind explaining this in more details?

On the face of it I would think that it makes more sense to superimposeeach sample separately, for exactly the reason that Louis gave. This isassuming: 1) the samples are based on exactly the same set of landmarks;2) you're only comparing the total variance within each sample and notaround each landmark, and not anything else; 3) none of the samplesinclude any of the other samples (and maybe other assumptions I'mforgetting right now). If that is wrong to do then what aboutbootstrapping? After all, as far as I understand, whenever you bootstrapa sample in order, for example, to get confidence interval of something,you re-superimpose every bootstrapped sample but you don't superimposeall 500 or so bootstrapped samples simultaneously, do you? Then if youpile up all these bootstrapped values into one distribution, thatimplies that they are comparable, doesn't it? Even though they're eachbased on a somewhat different space. I tried to test it usingsimulations as well as looking into my own data. So I generated twosamples with somewhat different mean shape and level of variance (basedon a non-isotropic covariance matrix), but the same sample size, andcompared the variance within each sample when they are superimposedtogether as opposed to when they are superimposed separately (repeated100 times). With the real data I simply took the same number of malesand females from each of 9 species (ruminants, what else) where I knowthere is significant sexual dimorphism. With the simulations, thevariances based on the combined GPA correlate very tightly (>0.92) withthe ones based on separate GPA but are consistently slightlyoverestimated for both samples in each pair. The higher the variance themore they are overestimated relative to their variance when superimposedalone. And also, the higher the difference in mean shape the bigger thebias. However, when one of the samples is five times bigger, the biggersample's variances correlate perfectly while the smaller samples arevastly overestimated and correlate ~ 0.8.

With the empirical data I get exactly the same variances for each of thesamples either way I calculate it, but that may be because the meanshape is not that different after all.

So all in all, I can see that the space they're in makes a differencebut I still don't see why it wouldn't be better to compare theirwithin-sample variance based on separate superimposition for eachsample, especially since there is a consistent bias (although thesebrief simulations obviously didn't cover all possible scenarios). Andwhat does that mean for bootstrapping procedures?

Thanks
Annat

From: morphmet <[email protected]>
Reply-To: <[email protected]>
Date: Fri, 28 Aug 2009 07:42:29 -0400
To: morphmet <[email protected]>
Subject: RE: procrustes variances
Resent-From: <[email protected]>
Resent-Date: Fri, 28 Aug 2009 04:45:01 -0700 (PDT)



-------- Original Message --------
Subject: RE: procrustes variances
Date: Thu, 27 Aug 2009 19:08:25 -0700 (PDT)
From: F. James Rohlf <[email protected]>
Reply-To: [email protected]
Organization: Stony Brook University
To: [email protected]
References: <[email protected]>

You should only compare them when computed after a combined GPA.
Otherwise you are comparing quantities computed in different spaces.

------------------------
F. James Rohlf, Distinguished Professor
Ecology & Evolution, Stony Brook University
www: http://life.bio.sunysb.edu/ee/rohlf

-----Original Message-----
From: morphmet [mailto:[email protected]]
Sent: Thursday, August 27, 2009 11:32 AM
To: morphmet
Subject: procrustes variances

-------- Original Message --------
Subject:  procrustes variances
Date:  Thu, 27 Aug 2009 07:41:39 -0700 (PDT)
From:  Louis Boell <[email protected]>
To:  <[email protected]>

Dear colleagues,

I have a question about procrustes variance. I want to compare the
shape
variances of different samples. I have three groups of 77, 96 and 17
specimens, respectively. I calculated the procrustes variance of each
group in two ways: 1) after pooling the raw data of all three groups
into a common total dataset and fitting them together; b) after
calculating the procrustes fit for each dataset/group separately.

The results for the two large samples are quite consistent between both
procedures; however, the estimate of the procrustes variance for the 17
specimen sample is much larger when fitted together with the other two
samples than when fitted separately.

I assume that this is because the procrustes fit is a "democratic"
procedure, which is much more influenced by large samples than by small
samples when they are fitted together. This could potentially result in
a "spreading" of the specimens from the specimens from the small sample
in the space of the procrustes coordinates, if their covariance pattern
is different from the mean covariance pattern of the total dataset
which
will be largely determined by the large samples.

Altogether, my question amounts to whether it is more approriate to
compare procrustes variances from separate procrustes fits or from a
procrustes fit of the pooled total dataset.

Thanks in advance for help

Louis Boell

Louis Boell
MPI für Evolutionsbiologie
August-Thienemannstr.2
24306 Plön
[email protected]
[email protected]

-----------------------------------------------------------------------
-
Mehr wissen - besser reisen.
<http://redirect.gimas.net/?n=M0908aReisen1>

--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org





--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org





--
Replies will be sent to the list.
For more information visit http://www.morphometrics.org

Re: procrustes variances

Reply via email to