Dear Bruce, Besides any other problems I can see that you consider FA as a factor and that could be a problem with your data set.
Manuel 2014-07-30 7:10 GMT-06:00 Bruce Miller <batsnc...@gmail.com>: > Hi all, > > Sorry this is a bit long, but the explanation of what I want to do needs > to be clear to avoid issues such as this quote..."It is impossible to > speak in such a way that you cannot be misunderstood." Karl Popper. > > I am running linear regression models, but I am getting expected results. > I wonder what else I might try to derive an estimated value of bat > echolocation parameters based on forearm measurements. It is known that > the size of the bat is negatively related to the characteristic > frequency (Fc) of their echolocation calls (decades of my field work) . > So in general larger guys have lower frequency calls and smaller guys > have higher frequency calls. > > I have run the regressions based on the FA (valid forearm measurements) > and the known and valid Fc ranges for a dozen species or so and using > the lm models to "predict" Fc values for a few species that have FA > values but have not yet been recorded. Hence there are no valid > echolocation call parameters. R Code used is below discussion. > > I have valid ranges for the known species FA (forearm measurements) and > Fc(minimum) and Fc (maximum). So I do two separate runs with the data > using the lm model one with FA~Fcmin and one FA~Fcmax. > > The goal is to provide the predicted (estimated) values for the species > with known FA values but w/o verified Fc value ranges. > > My concern is that the predicted values returned are much lower than the > true values for the verified species. Therefore I am not confident the > predicted values for those w/o verified Fc ranges are useful. > > One very helpful person looked at one simple data set I sent and showed > that the statistical differences between the true values and predicted > were not significant. > > However Krebs' admonishment to students eons ago "Do not confuse > statistical significance with ecological significance" is true here. > The values of the predicted ranges are far lower than reality so the few > species that do not have field recorded Fc values are suspect. These > differences in predicted values from a true range will will make a > difference for potentially IDing the unknown calls. A difference of > 10kHz Fc generally suggests a different species, albeit some are much > closer and may only have a 5 kHz difference. > > I am looking at acoustic data sets of calls from South America and there > are many "sonospecies." > These are clearly separate species based on echolocation call parameters > that have yet had "faces & voices" matched. We know that call > parameters are diagnostic for families and genera even when the species > is unknown. It is then the Fc values that assist in identifying the > species within a cluster of calls from the same genus. > > Sample of R code used: > > Bats <- dget('C:/=Bat data working/Acoustic > Parameters/_Working/=Vespertilionidae/Bats.robj') > > model.lm <- lm(formula=Fc ~ as.factor(FA),data=Bats,na.action=na.omit) > > Anova(model.lm,type='II') Error in solve.default(L %*% V %*% t(L)) : > system is computationally singular: reciprocal condition number = 0 > > summary(model.lm) > > Call: > lm(formula = Fc ~ as.factor(FA), data = Bats, na.action = na.omit) > > Residuals: > ALL 5 residuals are 0: no residual degrees of freedom! > > Coefficients: > Estimate Std. Error t value Pr(>|t|) > (Intercept) 53.3 NA NA NA > as.factor(FA)34.4 -4.6 NA NA NA > as.factor(FA)35.4 2.3 NA NA NA > as.factor(FA)35.5 9.0 NA NA NA > as.factor(FA)40.5 -7.3 NA NA NA > > Residual standard error: NaN on 0 degrees of freedom > (2 observations deleted due to missingness) > Multiple R-squared: 1, Adjusted R-squared: NaN > F-statistic: NaN on 4 and 0 DF, p-value: NA > > > tmp<-predict(model.lm) > > Bats[names(tmp),"predicted"]<-tmp > > rm('tmp') > > rm('model.lm') > > > > >model.lm <- lm(formula= Fc~ FA,data=Bats,na.action=na.omit) > > >Anova(model.lm,type='II') > > >summary(model.lm) > > >tmp<-predict(model.lm,Bats) > > >Bats[names(tmp),"Predicted.Fc"]<-tmp > > >rm('tmp') > > >rm('model.lm') > > With the results it can be seen that the predicted Fc values on right > are not close to the true Fc values on left and then make me hesitant to > accept the 2 with NA predicted values. FYI Species are simple 6 letter > coded for genus and species. > Species FA Fcmin Fcmax FcMinpredic FcMaxpredic > Myoalb 35.3 45.7 48.7 51.73 55.26 > Myoata 37 NA NA 49.52 52.59 > Myokea 33.7 57.8 61.3 53.80 57.77 > Myonig 34.5 51.6 55.7 52.77 56.52 > Myooxy 40.5 45.7 47.6 44.98 47.09 > Myorip 36 53.3 57.5 50.82 54.16 > Myosim 38 NA NA 48.23 51.02 > > > Perhaps simple linear regression is not the method to use? > Thanks for any additional suggestions. > > Bruce > > [[alternative HTML version deleted]] > > _______________________________________________ > R-sig-ecology mailing list > R-sig-ecology@r-project.org > https://stat.ethz.ch/mailman/listinfo/r-sig-ecology > -- *Manuel SpÃnola, Ph.D.* Instituto Internacional en Conservación y Manejo de Vida Silvestre Universidad Nacional Apartado 1350-3000 Heredia COSTA RICA mspin...@una.ac.cr mspinol...@gmail.com Teléfono: (506) 2277-3598 Fax: (506) 2237-7036 Personal website: Lobito de rÃo <https://sites.google.com/site/lobitoderio/> Institutional website: ICOMVIS <http://www.icomvis.una.ac.cr/> [[alternative HTML version deleted]]
_______________________________________________ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology