Dear Thibaut and Adegenet users,<br><br>I have a polyploid dataset coded as binary (PA datatype) containing 297 individuals and 97 'loci' (microsatellite alleles). I've been implementing the find.clusters command, retaining 40 PCA axes to capture >95% of the variance. <br>
<br>The issue is that I get different 'best' solutions for the number of K clusters in different find.clusters runs, with a modal value of 9, but ranging from 6-12. Obviously the actual differences in BIC value are pretty small, but even when I designate a 'cut-off' (e.g. when the BIC value must decrease by at least 2 for the solution to be 'better' than the previous K), there is variation in the solution. <br>
<br>This variability is even higher when I choose fewer PCA axes to retain (e.g. retaining 80% of the variance), as would be expected, but even when I use 100 PCA axes (>>95% of variance), the value varies between 'runs'. <br>
<br>Has anyone else observed this - and do you have any advice?<br><br>Thanks for your help<br><br>Pip <br>