[adegenet-forum] Using PCA of SPCA in linear models with environmental data.
t.jombart at imperial.ac.uk
Thu Jul 12 14:35:02 CEST 2012
Yes, there has been quite a few methods developed since. A starting point would be:
Dray, S.; Legendre, P. & Peres-Neto, P. Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM) Ecological Modelling, 2006, 196, 483-493
From: Hanan Sela [dooshra at gmail.com]
Sent: 12 July 2012 12:44
To: Jombart, Thibaut
Cc: adegenet-forum at lists.r-forge.r-project.org
Subject: Re: [adegenet-forum] Using PCA of SPCA in linear models with environmental data.
Thank you for the answer
I want to test whether space (lat+lon) has significant effect on the genetic structure. Therefore I would like to use spatial variables in the right side of the model. Can you suggest a better representation of the spatial structures than lat-lon?
On Thu, Jul 12, 2012 at 1:58 PM, Jombart, Thibaut <t.jombart at imperial.ac.uk<mailto:t.jombart at imperial.ac.uk>> wrote:
this is a tricky question, and I don't think there is a single universal answer. Technically speaking, the only requirement is that your residuals are independent, so you need to make sure there is no spatial autocorrelation left there. Otherwise minimizing the sum of squared residuals is no longer the ML solution.
The real problem relates to the interpretation, and the assumption implicitly made by the model. There is several reasons why spatial genetic patterns can occur. Your model has the form:
genetic pattern = lat+lon + environment + residuals
Which means that beyond linear trends, genetic patterns are due to the environment. It makes sense to treat spatial autocorrelation as a confounding factor first removed from the analysis. But lat+lon is often not enough to capture all spatial structures. In this respect, using PCs from PCA on the left side is probably better than sPCA (no need to seek spatial structures to remove them afterwards).
From: adegenet-forum-bounces at lists.r-forge.r-project.org<mailto:adegenet-forum-bounces at lists.r-forge.r-project.org> [adegenet-forum-bounces at lists.r-forge.r-project.org<mailto:adegenet-forum-bounces at lists.r-forge.r-project.org>] on behalf of Hanan Sela [dooshra at gmail.com<mailto:dooshra at gmail.com>]
Sent: 12 July 2012 07:34
To: adegenet-forum at lists.r-forge.r-project.org<mailto:adegenet-forum at lists.r-forge.r-project.org>
Subject: [adegenet-forum] Using PCA of SPCA in linear models with environmental data.
I am trying to estimate the major factors affecting the spatial distribution of wild wheat genotypes. I am using a linear model where the PCA or the SPCA first and second axis are the dependent variables and the environmental variables are the predictors. Additionally I am using the longitude and the latitude as predictors. Since there is a spatial reference on the left side of the formula, I was wondering if using SPCA on the right side will not be a problem.
Hanan Sela Ph.D.
Curator of the Lieberman Cereal Germplasm Bank
The Institute for Cereal Crops Improvement
P.O. Box 39040
Tel Aviv 69978
hans at tauex.tau.ac.il<mailto:hans at tauex.tau.ac.il>
More information about the adegenet-forum