<div dir="ltr"><div><div><div><div class="im"><div>Doctor Thibault and dear colleagues,<br><br></div></div>Deal:). I'll do my best.<br><br></div><div>About

 the PC's with biological meaning but not possessing  traits/phenotypic 

information. Later on, I'll explain to you why I think this"crazy" idea might be feasible, in this case.<br>

</div><div><br></div>Thank you once more for the helpful suggestions.<br><br></div>Cheers,<br></div>Manuela</div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-06-03 12:47 GMT+01:00 Manuela <span dir="ltr"><<a href="mailto:manuelacorreia2@gmail.com" target="_blank">manuelacorreia2@gmail.com</a>></span>:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div><div><div class=""><div>Doctor Thibault and dear colleagues,<br><br></div></div>Deal:). I'll do my best.<br>

<br></div><div>About the PC's with biological meaning but not having traits/phenotipic information. Later I'll explain to you the reason why I insist on using the softwares you have developed for PCA and sPCA  to go on with this "crazy" idea.<br>


</div><div><br></div>Thank you once more for the helpful suggestions.<br><br></div>Cheers,<br></div>Manuela</div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-06-03 10:26 GMT+01:00 Jombart, Thibaut <span dir="ltr"><<a href="mailto:t.jombart@imperial.ac.uk" target="_blank">t.jombart@imperial.ac.uk</a>></span>:<div>

<div class="h5"><br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>

Hi there,<br>

<br>

I would not recommend using all three phylogenetic reconstruction methods, even if with 19 SNPs there shouldn't be major differences. I covered the maximum parsimony for historical reasons, but I can't see it being useful here.<br>


<br>

Other clustering approaches sounds like a good idea. If you ever fancy documenting how to use them on genetic data in a small tutorial, I think that would be a very handy to others ;)<br>

<br>

As for your last question, it makes a lot of sense, but you will need external information for this. Eigenvalue selection procedures based on inertia will basically fail to detect the structures you talk about. So you will need to be able to test e.g. the correlation of your PCs to a set of traits, or their spatial distribution, etc.<br>


<br>

Cheers<br>

Thibaut<br>

<br>

<br>

________________________________________<br>

From: Manuela [<a href="mailto:manuelacorreia2@gmail.com" target="_blank">manuelacorreia2@gmail.com</a>]<br>

Sent: 03 June 2014 10:01<br>

To: Jombart, Thibaut<br>

Cc: <a href="mailto:adegenet-forum@lists.r-forge.r-project.org" target="_blank">adegenet-forum@lists.r-forge.r-project.org</a><br>

Subject: Re: [adegenet-forum] Monmonier algorithm and individual scores<br>

<div><br>

Doctor Thibault and dear colleagues,<br>

<br>

I would like to thank you for the valuable criticism you made in this output. The idea behind the IS was, solely, to have a first draft of the georeferenced clusters because in spatial clusters  I'm well-aware that several different genoypes at the same coordinates in species with a very low mobility or with no mobility could be a strong indication that the genetic variability is only due to environment while a great genetic diversity nearby may result from a short dispersal highly spatial correlated. To need of further confirmation by sPCA and/or clustering techniques.<br>


<br>

The identification of spatial clusters in PCA, particularly by sPCA is no doubt more realiable than with Monmonier algoritm in this case. But I'd rather try to study more deeply each one of the 3 different methods (distance based-methods, Parsymony and maximum Likelihood) proposed in your tutorial "Trees" just to check it in first place if they might be appropriate to this dataset, Secondly, if they would gave different information perhaps with higher resolution when compared to classic NJ Tree, after validation by bootstrap.  Eventually, if none is appropriate I always be able to rely on several clustering techniques more adequate for qualitative data, available at the "Cluster" package and to perform the validation by "cl Valid" following several criteria.<br>


<br>

>From a very simplistic point of view, PCA analysis (not scaled) might provides us with information of the genetic variability whereas sPCA about the significance of local and global structures. But, on the whole, the information provided by these two analysis: Moran's Index , variance and allele loadings, enable us to discriminate the loci more informative on genetic variability but not spatially structured from those whose variability its spatial structured. To be further confirmed through biplots.<br>


<br>

Another challenge ahead. To figure out the way to select the PC's having biological meaning and most probably not associated to the highest eigenvalues. Particularly, in the absence of traits or phenotype information.<br>


<br>

Please, feel free to make more comments or to give another suggestion(s).<br>

<br>

Cheers,<br>

Manuela<br>

<br>

<br>

</div>2014-06-02 17:20 GMT+01:00 Jombart, Thibaut <<a href="mailto:t.jombart@imperial.ac.uk" target="_blank">t.jombart@imperial.ac.uk</a><mailto:<a href="mailto:t.jombart@imperial.ac.uk" target="_blank">t.jombart@imperial.ac.uk</a>>>:<br>


<div>Hi Manuela,<br>

<br>

thanks for re-posting on the forum. In this case, it seems that locations are very aggregated - a lot of genotypes were sampled roughly at the same place. Monmonier is unlikely to do well under such circumstances. The algorithm is very sensitive to local differences, and these are unstable for this kind of spatial distribution. I would recommend other approaches. For instance, if you want to define spatial clusters, you could use a basic clustering algorithm based on the principal components of a PCA (if spatial structure is obvious) or sPCA (if not, but there is still a spatial structure). Assuming 'foo' is your analysis (PCA or sPCA), one example would be using something along the lines of:<br>


<br>

h1 <- hclust(dist(foo$li)^2)<br>

plot(h1)<br>

cutree(h1)<br>

<br>

Etc.<br>

Check ?hclust for different clustering methods.<br>

<br>

Cheers<br>

Thibaut<br>

<br>

<br>

________________________________________<br>

</div>From: <a href="mailto:adegenet-forum-bounces@lists.r-forge.r-project.org" target="_blank">adegenet-forum-bounces@lists.r-forge.r-project.org</a><mailto:<a href="mailto:adegenet-forum-bounces@lists.r-forge.r-project.org" target="_blank">adegenet-forum-bounces@lists.r-forge.r-project.org</a>> [<a href="mailto:adegenet-forum-bounces@lists.r-forge.r-project.org" target="_blank">adegenet-forum-bounces@lists.r-forge.r-project.org</a><mailto:<a href="mailto:adegenet-forum-bounces@lists.r-forge.r-project.org" target="_blank">adegenet-forum-bounces@lists.r-forge.r-project.org</a>>] on behalf of Manuela [<a href="mailto:manuelacorreia2@gmail.com" target="_blank">manuelacorreia2@gmail.com</a><mailto:<a href="mailto:manuelacorreia2@gmail.com" target="_blank">manuelacorreia2@gmail.com</a>>]<br>


<div>Sent: 31 May 2014 21:46<br>

</div>To: <a href="mailto:adegenet-forum@lists.r-forge.r-project.org" target="_blank">adegenet-forum@lists.r-forge.r-project.org</a><mailto:<a href="mailto:adegenet-forum@lists.r-forge.r-project.org" target="_blank">adegenet-forum@lists.r-forge.r-project.org</a>><br>


<div><div>Subject: [adegenet-forum] Monmonier algorithm and individual scores<br>

<br>

Dear colleagues of Adegenet forum,<br>

<br>

First of all I must congratulate Doctor Thimbault for the wonderful work he has been so far developed. And following his own suggestion I'm sharing with you a specific issue raised by the output generated by Monmonier algorithm used for boundary detection.<br>


I have a sample made of 170 individuals, collected on 9 different places and genotyped for 19 SNPs by Realtime PCR.<br>

Before I run this line on the R script I had to explain to you about each one of them:<br>

mon1<- monmonier(xy ,D, gab)<br>

<br>

xy – spatial coordinates UTM/Km) ;<br>

D – pairwise allele sharing distance (“Prabclus” package);<br>

gab <-chooseCN(xy,ask=FALSE,type=1)  (Delaunay Triangulation)<br>

<br>

plot(mon1,1:170,method=”greylevel”,add.arr=FALSE,bwd=6,col=”red”)<br>

>From the output produced, it can be clearly seen that there are 4 clusters of individuals having four scores (50,100,150,200). But, I can't find a way to have access to individual scores. As matter in fact, I consulted in detail all the arguments provided on Plot function but none of them seemed to me to be on the way I could extract the individuals scores (IS).<br>


I’m wondering if you could give me a hint about it. Any help will be appreciated.<br>

Kind regards,<br>

Manuela (Biochemist)<br>

<br>

</div></div></blockquote></div></div></div><br></div>

</blockquote></div><br></div>