<div dir="ltr">Hi Andrea, <br><br>I'm afraid that without seeing the exact code you used to generate the results you have presented, it is a bit difficult to say for certain what the origin of your problem is. So please forgive me if the following suggestion misses the mark. (If so, can I ask you to reply with the functions and arguments you used to generate that output?)<div>

<br>I notice you've stated that your original data file is formatted using a "/" separator. One way of getting the df2genind output format you are experiencing is by neglecting to inform the df2genind function that you are using that separator. If you have not done so already, try adding the argument sep="/" to the list of arguments taken by df2genind. Let me know if that does the trick. If not, please post back with the code you are using and we can go from there. <br>

<br>Best, </div><div>Caitlin. </div></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Tue, Jun 17, 2014 at 10:12 AM, Andrea Garavito <span dir="ltr"><<a href="mailto:neagef@gmail.com" target="_blank">neagef@gmail.com</a>></span> wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div>Hi everybody!<br><br>I'm currently trying to do a PCA analysis using a SNP matrix from a diploid organism, most of them are bi-allelic.<br>

</div><div>Although the results that I obtain are logic in terms of previous knowledge of the groups, I'm confused with the genind object that I obtain, and I want to be sure about what's going on with the analysis.  <br>


</div><div>My data file is formatted using the nucleotides as alleles and a "/" separator, and missing data coded as "NA".<br></div>ind    mk1    mk2     <br>ind1  G/A    C/T       <br>ind2  G/G    C/T        <div>


After loading my data matrix with the df2genid function my data is stored as a matrix with for times the number of columns of the original file :<br><br>ind    mk1.A    mk1.G    mk1.A    mk1.G   mk2.C    mk2.T    mk2.C    mk2.T<br>


ind1    0.5           0.0         0            0.5         0.0         0.5         0.5         0<br></div><div>ind2    0.0           0.5         0            0.5         0.0         0.5         0.5         0<br><br></div>


<div>Is that correct? I thought I would get two columns per marker loci instead of 4.<br>From there I obtain doubled statistics for each one of the alleles. Since I don't know the phase, an A/G is the same as a G/A, so how can I have the unified stats for each allele? <br>


<br></div><div>Thank you for your answer<br><br></div><div>Best regards<span class="HOEnZb"><font color="#888888"><br></font></span></div><span class="HOEnZb"><font color="#888888"><div>Andrea<br></div></font></span></div>


<br>_______________________________________________<br>

adegenet-forum mailing list<br>

<a href="mailto:adegenet-forum@lists.r-forge.r-project.org">adegenet-forum@lists.r-forge.r-project.org</a><br>

<a href="https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/adegenet-forum" target="_blank">https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/adegenet-forum</a><br></blockquote></div><br></div>