Il 01/11/22 18:02, Han Xiao ha scritto:
Dear morpho people,
I am writing to ask a rarely discussed question, which is to test
associations between genomic data and shape variation.
Dear Han,
yes, this is a topic that is perhaps less frequently discussed than
others. As a participant to one of the editions of my geometric
morphometric course, you may recall we covered this general topic to
some extent.
To describe my system and bit first, I am working with four sympatric
fish morphs in a lake. I have both genetic data (SNP generated by
ddRADSeq, around 12000 SNPs) and shape data (landmarks and
semilandmarks-based GM for the head shape) of the same fish samples. The
genetics indicate that three morphs are genetically distinct, while one
is of hybridization origin between two morphs with different degrees of
introgression. I like to ask the question: for the three related morphs
(two parental morphs and one hybrid morph) there any correlation between
the degree of genetic introgression and the shape variation?
Presumably, if this is your main question and by "shape variation" you
mean "individual shape", you would be performing the analysis mainly on
the only introgressed morph (whose individuals I have to assume based on
your text have varying levels of introgression), using information from
the two "parental" morphs.
I was suggested to apply a 2b-PLS to test it. Then I searched for some
literature and find a few cases. However, the studies vary for the input
data of both genetics and morphometrics. For genetics, people have used
genetic distances (calculated as Fst/(1-Fst), Fst is a measurement of
genetic differentiation), Prevosti distance, allele frequencies (a few
microsatellites), and expression results (numeric and continuous). For
the shape data, people used Eucidean distances, GPA coordinates,
centroid sizes, etc.
About the genetic measures, if the question is about the degree of
introgression (of one morph into another when producing the third
morph), it is doubtful that any of the measures you mention would
adequately capture that. For instance, within your introgressed morph
genetic variation among individuals may not be produced exclusively by
varying levels of introgression. So FST would be a poor choice because
it would capture genetic variation produced by other causes (e.g.,
neutral variation). There are other reasons why FST may be a poor
choice, but let's keep it at that.
Perhaps a semi-decent solution to quantify the degree of introgression
would be to subset your SNP panel to only those SNPs (if any) which are
fixed between "parental" morphs (and which are biallelic in the
introgressed morph), code them to reflect their "polarity" (e.g., 0 the
allele in one parental morph, 1 the allele in the other parental morph)
rather than using the actual nucleotides, and then use the data scored
this way for your individuals from the introgressed morph to do tests
with morphology.
The above is just a very rough solution, with ample margins of
improvement depending on the details of your system (e.g., ongoing gene
flow between the two "parental" morphs, with most of alleles not being
fixed between them). But, as you may imagine, this goes well beyond this
brief reply and would require more in-depth knowledge of your specific
situation (notice how I had to make several assumptions about how
genetic variation is distributed among your morphs).
Notice also that if the level of introgression is all you care about
(regardless of which loci it comes from) you may obtain a much better
and "faster" (i.e., less work for you) solution by using individual
estimates of levels of admixture between morphs from one of the software
used for analysis of genetic admixture (which you have probably used
anyway).
So my questions are:
1. Do you all agree that sb-PLS should also make sense for such a
comparison?
PLS may be a good solution to identify how the shape and levels of
introgression co-vary. Tests based on a measure of association (e.g.,
Escoufier RV) may be used to test the null hypothesis that they are
independent.
If your estimate of level of introgression is univariate (notice that in
the rough solution I suggested above this may not be the case), you may
also consider general linear models (and associated tests of
significance) using the level of introgression as a predictor.
2. What you will suggest for the input files? (I do have some
considerations to discuss)
See above. I suppose the main issue is a bit beyond input files per se
and more about how you quantify/represent introgression.
3. Is there any other analysis you will recommend? >
I know normally people will use GWAS to search for associations,
however, I am looking for something that can tolerate a smaller sample
size (30 fish per morph).
This is absolutely correct. But, more fundamentally, GWAS' goal is quite
distinct from the hypothesis you want to test.
Also, the potential transgressive shape of
hybrids may be a confounding factor, especially there is different
allometry observed.
Yes. But transgressive segregation may not be a concern if you are just
interested in whether and how levels of introgression scored within a
single "introgressed" morph is associated to shape variation.
Best,
Carmelo
--
==================
Carmelo Fruciano
Italian National Research Council (CNR)
IRBIM Messina
http://www.fruciano.org/
==================
--
You received this message because you are subscribed to the Google Groups
"Morphmet" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/morphmet2/27f7c32c-0e0f-682d-0b95-bcb927a67380%40unict.it.