[adegenet-forum] Loci selection for SNP data

Thibaut Jombart thibautjombart at gmail.com
Mon Sep 11 14:05:46 CEST 2017


Hello,

there is no tool specifically dedicated to this, but you could run a PCA,
retain axes corresponding to the xxx% of variance, and then keep only the
alleles contributing most to these axes. Here's a quick example:

> data(sim2pop)
> pca1 <- dudi.pca(tab(sim2pop), scannf = FALSE, nf = 3)
> contrib <- pca1$c1^2
> which(contrib > 0.01, TRUE)  # contrib > 1%
       row col
L02.06  21   1
L04.02  47   1
L05.02  60   1
L05.09  67   1
L06.6   74   1
L07.7   84   1
L08.07  91   1
L09.8  102   1
L09.9  103   1
L10.01 104   1
L10.12 115   1

[...]

Best
Thibaut


--
Dr Thibaut Jombart
Lecturer, Department of Infectious Disease Epidemiology, Imperial College
London
Head of RECON: repidemicsconsortium.org
WHO Consultant - outbreak analysis
sites.google.com/site/thibautjombart/
Twitter: @TeebzR
+44(0)20 7594 3658

On 7 August 2017 at 13:45, Ngoc-Tien Tran <Ngoc-Tien.Tran at insa-rennes.fr>
wrote:

> Hello,
> I'm working with SNP data with a large number of loci.
> I would like to know if there is a tool in the package "adegenet" allowing
> me to select just one smaller set of loci without loosing information.
>
> Thanks for your help !
> Best wishes,
> Ngoc-Tien
>
>
> --
> --------------------------------------------------------------
> TRAN Ngoc-Tien
> Département Génie Mathématiques/Analyse de Risque et Optimisation et
> Modélisation.
> INSA-Rennes
> Tel : +33.(0)7.51.57.96.53
> @mail : ngoc-tien.tran at insa-rennes.fr
> _______________________________________________
> adegenet-forum mailing list
> adegenet-forum at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/
> listinfo/adegenet-forum
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/adegenet-forum/attachments/20170911/4bdd9e99/attachment.html>


More information about the adegenet-forum mailing list