[adegenet-forum] Analysing mixed ploidy datasets
Zhian Kamvar
zkamvar at gmail.com
Mon Feb 12 22:10:16 CET 2018
Hi Jeremy,
> I'm wondering if this the right way (statistically speaking) to analyse mixed ploidy datasets in R? Will my estimates of genetic diversity and structure be accurate?
I think the answer is... it depends. Meirmans, Liu, and Tienderen just came out with a review on this topic: https://doi.org/10.1093/jhered/esy006 <https://doi.org/10.1093/jhered/esy006>*. For multivariate analyses (sPCA, PCA, DAPC, etc), you will want to reduce the effect of polyploidy by converting your data to allele frequencies with:
tab(myData, freq = TRUE)
If you have an organism that changes ploidy by life stage, you may need to analyze them separately. Moreover, if you use Bruvo's distance, the choice of model is very important.
> I know there is the POLYSAT package, which seems to have been developed specifically for dealing with mixed ploidy datasets, however, I would rather stick to using adagenet and poppr, as I'm familiar with the functions and the structure of genind and genclone objects.
I know conversion is a PITA, but I would highly recommend using POLYSAT. There are far more tools to deal with polyploidy in that package that can complement any analyses you would perform with poppr or adegenet. A few years ago, I wrote a tiny function to help with conversion from genind to polysat data:
https://gist.github.com/zkamvar/aeaff83b9d126d55aade
In fact, Clarke and Schreier came out with a paper a year ago talking about how to resolve ambiguous ploidy, and it's only available in polysat: http://onlinelibrary.wiley.com/doi/10.1111/1755-0998.12639/abstract
Hope that helps,
Zhian
* The last figure in this paper will change slightly since they used an old version of poppr to calculate Bruvo's distance, which had a silent bug.
-----
Zhian N. Kamvar, Ph. D.
Postdoctoral Researcher (Everhart Lab)
Department of Plant Pathology
University of Nebraska-Lincoln
ORCID: 0000-0003-1458-7108
>
> Date: Fri, 9 Feb 2018 00:48:39 +0000
> From: JEREMY SAMUEL BENWELL-CLARKE <17197751 at students.latrobe.edu.au>
> To: "adegenet-forum at lists.r-forge.r-project.org"
> <adegenet-forum at lists.r-forge.r-project.org>
> Subject: [adegenet-forum] Analysing mixed ploidy datasets
> Message-ID:
> <SY3PR01MB15646BCB509043E46D83877FC7F20 at SY3PR01MB1564.ausprd01.prod.outlook.com>
>
> Content-Type: text/plain; charset="us-ascii"
>
> Hi everyone,
>
> I'm trying to analyse a mixed ploidy dataset. The majority of my samples are diploid but there are a few triploids too. To make ploidy even in my raw data matrix I added zeros to all my diploid samples, which makes them triploids. I then use the read.genalex function from the poppr package to read in my data setting ploidy=3. However, I don't want '0' to be recognised as an extra allele and I want the true diploid samples to be separate from the true triploid samples. Therefore, I use the recode_polyploids function from poppr and set newploidy=T. Here is my code:
>
> genclone<-read.genalex("C:/Users/...", ploidy = 3)
> genclone<-recode_polyploids(genclone, newploidy = T)
>
>
> I'm wondering if this the right way (statistically speaking) to analyse mixed ploidy datasets in R? Will my estimates of genetic diversity and structure be accurate?
> I know there is the POLYSAT package, which seems to have been developed specifically for dealing with mixed ploidy datasets, however, I would rather stick to using adagenet and poppr, as I'm familiar with the functions and the structure of genind and genclone objects.
>
> Any help would be much appreciated!
>
> Cheers,
>
> Jeremy
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <http://lists.r-forge.r-project.org/pipermail/adegenet-forum/attachments/20180209/7c415c08/attachment-0001.html>
>
> ------------------------------
>
> _______________________________________________
> adegenet-forum mailing list
> adegenet-forum at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/adegenet-forum
>
> End of adegenet-forum Digest, Vol 114, Issue 7
> **********************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/adegenet-forum/attachments/20180212/5a1b729c/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: Message signed with OpenPGP
URL: <http://lists.r-forge.r-project.org/pipermail/adegenet-forum/attachments/20180212/5a1b729c/attachment.sig>
More information about the adegenet-forum
mailing list