<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<pre style="font-family: Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);"><div style="margin: 0px; font-size: 12pt; font-family: Calibri, Helvetica, sans-serif; background-color: rgb(255, 255, 255); color: rgb(0, 0, 0);">Hi all! </div><div style="margin: 0px; font-size: 12pt; font-family: Calibri, Helvetica, sans-serif; background-color: rgb(255, 255, 255); color: rgb(0, 0, 0);">This is a repost of an earlier inquiry from late last year. Still having the same issue and looking for help...<br><div>I have a data set of 19 microsatellite loci about 1000 individuals. Results from STRUCTURE suggest 4 populations with sample sizes ranging from 130-468 (pairwise Fst range from 0.02-0.06). I wanted to run DAPC on the same data to see what happens. When I do find.clusters the BIC graph is jagged and inconsistent when I re-run it (gives lowest BIC typically for somewhere between 5-10 clusters, and the differences in BIC between them are sometimes very small and sometimes not). Regardless of the number of clusters I choose to continue with, when I do cross validation (n.pca.max=200, training.set=0.9, n.rep = 50) the number of PCs with the lowest RMSE is always very near my n.pca.max (usually 180, so I am not getting a nice arch in my graph). Furthermore, the mean successful assignment rate increases with the number of PCA axes retained, typically reaching in the high 80's or even 90's around 180 PCs. This seems fishy to me, but I am not sure why. I've never before, either prior to xval implementation or using xval with other data sets, ever kept anywhere near the max number of PC's - either because I didn't want to over-fit the data or more recently because of xval recommendation. I guess that is why I am wary to do this now. But am I wrong to distrust the xval results, or is this perhaps an indication that my data isn't powerful enough? <br></div><div><br></div><div>Not surprisingly, there is a noticeable difference is cluster number and individuals assigned to each cluster depending on how many PC's I retain for the DAPC analysis. <br></div><div><br></div><div>Any help/insight is greatly appreciated, thank you!<br></div><span>Nikki</span><br></div></pre>
<br>
</div>
</body>
</html>