<div dir="ltr">My data are structured as follows<div>C1              C2                     C3              C4                etc.</div><div><br></div><div>Population   Individual           Locus1        Locus2         etc. </div><div>PopA           C0001               0205           1326             etc.</div><div>PopA           C0002</div><div>etc</div><div>etc</div><div>etc</div><div>PopB           X0001</div><div>PopB           X0002</div><div>etc.</div><div>.</div><div>.</div><div>.</div><div>Missing data coded as NA</div><div><br></div><div>Issued the following commands</div><div><br></div><div>scotdata <- read.table("Scotland_adegenet_no_river_names.txt", header=TRUE, sep="\t", quote="\"", colClasses="character", stringsAsFactors=FALSE)</div><div><br></div><div>scotind <- df2genind(scotdata[,-(1:2)], sep="", ind.names=scotdata$Individual, pop=scotdata$Population, missing="NA", ploidy=2, type="codom", ncode=4)</div><div><br></div><div><br></div><div>This converted it to a genind format and issuing 'scotind' gave the correct number of loci, individuals, etc. </div><div><br></div><div>However, x <- summary(scotind) gave exactly 10 alleles per locus and massive differences between Hexp and Hobs. I have since made a modified genepop file which reads in correctly and gives the correct number of alleles, Hexp, Hobs, etc. (however see my other post for DAPC issues!). So I can go with this for now, but routinely would find the dataframe format above easier to work with given large datasets. </div><div><br></div><div>Cheers,</div><div>Mark</div></div>