<div dir="ltr">Hi there, <div><br></div><div>no, the position of the loci is not needed. This retains most contributing alleles, no matter what the loci are. </div><div><br></div><div>Best</div><div>Thibaut</div></div><div class="gmail_extra"><br clear="all"><div><div class="gmail_signature" data-smartmail="gmail_signature"><br>--<br>Dr Thibaut Jombart<br>Lecturer, Department of Infectious Disease Epidemiology, Imperial College London<br>Head of RECON: <a href="http://repidemicsconsortium.org" target="_blank">repidemicsconsortium.org</a><br>WHO Consultant - outbreak analysis<br><a href="http://sites.google.com/site/thibautjombart/" target="_blank">sites.google.com/site/thibautjombart/</a><br>Twitter: @TeebzR<br>+44(0)20 7594 3658</div></div>
<br><div class="gmail_quote">On 12 September 2017 at 22:19, Ngoc-Tien Tran <span dir="ltr"><<a href="mailto:Ngoc-Tien.Tran@insa-rennes.fr" target="_blank">Ngoc-Tien.Tran@insa-rennes.fr</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hello ,<br>
Thank you so much for your suggestion.<br>
However, this method only works if the position on the chromosomes are available for each locus. Am I correct ?<br>
<br>
Best wishes !<br>
Ngoc-Tien<br>
<br>
----- Mail original -----<br>
De: "thibautjombart" <<a href="mailto:thibautjombart@gmail.com">thibautjombart@gmail.com</a>><br>
À: "Ngoc-Tien Tran" <<a href="mailto:Ngoc-Tien.Tran@insa-rennes.fr">Ngoc-Tien.Tran@insa-rennes.fr</a><wbr>><br>
Cc: "adegenet-forum" <<a href="mailto:adegenet-forum@lists.r-forge.r-project.org">adegenet-forum@lists.r-forge.<wbr>r-project.org</a>><br>
Envoyé: Lundi 11 Septembre 2017 14:05:46<br>
Objet: Re: [adegenet-forum] Loci selection for SNP data<br>
<div class="HOEnZb"><div class="h5"><br>
Hello,<br>
<br>
there is no tool specifically dedicated to this, but you could run a PCA,<br>
retain axes corresponding to the xxx% of variance, and then keep only the<br>
alleles contributing most to these axes. Here's a quick example:<br>
<br>
> data(sim2pop)<br>
> pca1 <- dudi.pca(tab(sim2pop), scannf = FALSE, nf = 3)<br>
> contrib <- pca1$c1^2<br>
> which(contrib > 0.01, TRUE) # contrib > 1%<br>
row col<br>
L02.06 21 1<br>
L04.02 47 1<br>
L05.02 60 1<br>
L05.09 67 1<br>
L06.6 74 1<br>
L07.7 84 1<br>
L08.07 91 1<br>
L09.8 102 1<br>
L09.9 103 1<br>
L10.01 104 1<br>
L10.12 115 1<br>
<br>
[...]<br>
<br>
Best<br>
Thibaut<br>
<br>
<br>
--<br>
Dr Thibaut Jombart<br>
Lecturer, Department of Infectious Disease Epidemiology, Imperial College<br>
London<br>
Head of RECON: <a href="http://repidemicsconsortium.org" rel="noreferrer" target="_blank">repidemicsconsortium.org</a><br>
WHO Consultant - outbreak analysis<br>
<a href="http://sites.google.com/site/thibautjombart/" rel="noreferrer" target="_blank">sites.google.com/site/<wbr>thibautjombart/</a><br>
Twitter: @TeebzR<br>
<a href="tel:%2B44%280%2920%207594%203658" value="+442075943658">+44(0)20 7594 3658</a><br>
<br>
On 7 August 2017 at 13:45, Ngoc-Tien Tran <<a href="mailto:Ngoc-Tien.Tran@insa-rennes.fr">Ngoc-Tien.Tran@insa-rennes.fr</a><wbr>><br>
wrote:<br>
<br>
> Hello,<br>
> I'm working with SNP data with a large number of loci.<br>
> I would like to know if there is a tool in the package "adegenet" allowing<br>
> me to select just one smaller set of loci without loosing information.<br>
><br>
> Thanks for your help !<br>
> Best wishes,<br>
> Ngoc-Tien<br>
><br>
><br>
> --<br>
> ------------------------------<wbr>------------------------------<wbr>--<br>
> TRAN Ngoc-Tien<br>
> Département Génie Mathématiques/Analyse de Risque et Optimisation et<br>
> Modélisation.<br>
> INSA-Rennes<br>
> Tel : <a href="tel:%2B33.%280%297.51.57.96.53" value="+33751579653">+33.(0)7.51.57.96.53</a><br>
> @mail : <a href="mailto:ngoc-tien.tran@insa-rennes.fr">ngoc-tien.tran@insa-rennes.fr</a><br>
> ______________________________<wbr>_________________<br>
> adegenet-forum mailing list<br>
> <a href="mailto:adegenet-forum@lists.r-forge.r-project.org">adegenet-forum@lists.r-forge.<wbr>r-project.org</a><br>
> <a href="https://lists.r-forge.r-project.org/cgi-bin/mailman/" rel="noreferrer" target="_blank">https://lists.r-forge.r-<wbr>project.org/cgi-bin/mailman/</a><br>
> listinfo/adegenet-forum<br>
--<br>
------------------------------<wbr>------------------------------<wbr>--<br>
TRAN Ngoc-Tien<br>
Département Génie Mathématiques/Analyse de Risque et Optimisation et Modélisation.<br>
INSA-Rennes<br>
Tel : <a href="tel:%2B33.%280%297.51.57.96.53" value="+33751579653">+33.(0)7.51.57.96.53</a><br>
@mail : <a href="mailto:ngoc-tien.tran@insa-rennes.fr">ngoc-tien.tran@insa-rennes.fr</a><br>
</div></div></blockquote></div><br></div>