Re: Bootstrapping with Semi-landmarks

morphmet_moderator Sun, 10 Feb 2013 15:52:41 -0800

       
----- Forwarded message from andrea cardini <[email protected]> -----

     Date: Thu, 7 Feb 2013 03:09:25 -0500
      From: andrea cardini <[email protected]>
      Reply-To: andrea cardini <[email protected]>
      Subject: Re: Bootstrapping with Semi-landmarks
      To: [email protected]

Hi Collin,
If I've got it right, you have two aims: one is 
to test the significance of group mean 
differences and the other one to see if any of 
the unknown groups is closer to one or the other of the two known ones. 

First a consideration: unequal N and maybe small 
N in some of the groups is never ideal, although 
it might often be the reality of the data. Small 
N will also very likely lead to large errors in 
the estimates of means etc. (Zoomorphology, 126: 121-134.). 

For testing group differences, you can do it 
pairwise (with a 'correction' for multiple tests) 
using permutations (MorphoJ, PAST etc.) or 
bootstraps (IMP and probably other programs). [NB 
in all the programs I am suggesting, you'll have 
to slide (if you slide them) the semilandmarks in 
another program.] You can also do a MANOVA using 
a resampling approach. The 'old' Morpheus does it 
(brief description in Italian Journal of Zoology, 
71: 63-72) but especially if you have many 3D 
landmarks/semilandmarks it might take really 
long. If you need, I can see what the commands 
are. PAST might also have something using 
permutations and the DOS series of programs 
(PERMANOVA etc.) by Marti Anderson will also have 
something of this kind. We recently used 
PERMANOVA to test measurement error (Franklin et 
al. Concordance of traditional osteometric and 
volume rendered MSCT interlandmark cranial 
measurements. International Journal of Legal 
Medicine, in press: DOI: 
10.1007/s00414-012-0772-9). That's a different 
aim and PERMANOVA requires equal N across groups, 
but there should be another program of the same 
series which does not need equal N and might be used also without replicas. 
Last but not least, R will do everything 
(including sliding) but one needs to know the 'language'. 

For assessing group similarities, if you were 
focusing on individuals, I would have suggested a 
classification method like a DA. You're right 
that there is a problem when you have many 
variables (and maybe small and unequal N) but it 
might still be doable after dimensionality 
reduction (e.g., using the first appropriate 
number of PCs). Dimensionality reduction is 
always tricky. There are papers suggesting how to 
test the sensitivity of DAs to the inclusion of 
more or less variables (including stuff I was 
involved in - Journal of Archaeological Science, 
38: 3006-3018 and 40: 735-743 - but there's 
plenty more in the morphometric and statistical 
literature, I am sure). I would also suggest you 
to read some of the papers by 'Viennese-Leipzig' 
school, where they need to assess where unknown 
fossils might belong to. I remember several 
articles using ordinations and bootstraps to 
estimate confidence regions and at least one or 
two where they did DAs on the first few PCs of 
shape coordinates from large configurations of 
landmarks and semilandmarks. Philipp Gunz or 
someone else in the list will be able to suggest the most appropriate refs. 

Most of the stuff I know about group similarities 
is in taxonomy where simple ordinations and 
phenograms were often used to assess group 
similarity. These are usually done either in the 
original shape space (e.g., a simple PCA of shape 
coordinates), and that preserves the original 
shape distances, or in a statistical space such 
as the DA/CVA space, which maximizes group 
differences, makes variation around means 
circular but 'distorts' distances and makes quite 
a few assumptions (hard to test with small N and 
many variables). Klingenberg and Monteiro (2005, 
Syst. Biol.) discuss some of the issues with 
different spaces and analyses, and Mitteroecker 
and Bookstein (Evol Biol (2011) 38:100–114) do it as well. 

It's a good idea to do these analyses taking into 
consideration sample variation and uncertainties 
in estimates of, for instance, mean shapes. 
Examples of ordinations of mean shapes with 
confidence regions estimated using bootstraps can 
be found in some of my papers (e.g., if I 
remember well, Journal of Zoological Systematics 
and Evolutionary Research, 47: 258–267 and done 
with a more sophisticated and certainly more 
accurate method in at least one or two old papers 
by Leandro Monteiro (again, Leandro can help with 
the exact ref. which I can't remember). David 
Polly, if I am correct, and I with various 
coauthors also use bootstraps to assess the 
impact of sampling on phenograms of mean shapes. 
The method is described in the Biological Journal 
of the Linnean Society, 2008, 93, 813–834 and 
it's not the same as the one implemented in PAST: 
we bootstrap individuals in samples and 
re-estimate means; PAST bootstraps variables - 
which is unlikely to be OK for Procrustes shape 
data, has a different aim (assessing character 
sampling) and is appropriate in other contexts. 
I used to do most of these analyses in NTSYSpc 
and recently did it for someone else on the list 
who was interested in it (but I CANNOT PROMISE TO FIND TIME!). 

I am sure there are other options and hope that others will make suggestions. 
Good luck. 
Cheers

Andrea

PS
With ordinations, you may also want to add a 
minimum spanning tree! NTSYSpc does it using the 
Procrustes distances; PAST does it but I believe 
is using only the distances from the variables 
you're plotting (e.g., two PCs). I prefer the 
first option as it helps to see distortions due 
to dimensionality reduction in scatterplots. 

Andrea

At 04:27 07/02/2013, you wrote:
>
>----- Forwarded message from Collin VanBuren 
><[email protected]> -----
>
>      Date: Mon, 4 Feb 2013 19:59:46 -0500
>       From: Collin VanBuren <[email protected]>
>       Reply-To: Collin VanBuren <[email protected]>
>       Subject: Bootstrapping with Semi-landmarks
>       To: "[email protected]" <[email protected]>
>
>Hello all,
>
>I am trying to determine if there are 
>significant differences between groups (of 
>unequal sample sizes) using an outline analysis 
>with sliding semi-landmarks. Two of these groups 
>are my 'known' groups, and I'm essentially 
>trying to determine to which of the 'known' 
>groups the other five groups most likely belong 
>(i.e., they are unknowns). According to what I'm 
>reading, the only (?) way to do this is to 
>bootstrap the partial Procrustes distances and 
>then use a Goodall's F-test to test for 
>significant differences. This method avoids the 
>issues semi-landmarks have with degrees of 
>freedom, and it is my understanding that I 
>cannot run typical statistical analyses on 
>semi-landmark datasets because of these issues. 
>
>However, I am having issues finding a program 
>with which to run this analysis. In IMP, it 
>seems a maximum of four groups are allowed, but 
>I have seven total (the other issue is that it 
>doesn't seem to be letting me open files, but 
>I'm still playing around with that). It doesn't 
>appear that MorphoJ or any of the R packages 
>I've looked at (mostly geomorph and shapes) work 
>well with semi-landmarks, at least not for what 
>I'm looking to do. Any advice on a way I could 
>perform this analysis or an alternative option?
>
>Thank you in advance for your help!
>Collin
>
>----- End forwarded message -----
>
>

----- End forwarded message -----

Re: Bootstrapping with Semi-landmarks

Reply via email to