[GenABEL-dev] extracting results from OmicABEL runs - something is wrong
Yurii Aulchenko
yurii.aulchenko at gmail.com
Sat Jun 29 22:13:25 CEST 2013
Dear Sodbo, Diego,
It looks like either 'reshuffle' or CLACK-GWAS do something wrong - and I
actually think this is reshuffle (see my tests below). Basically, I run
analysis on 'big' genotype set (gtDataChr1) and a thinned one, where I take
every 100th marker only (gtDataChr1small). When I extract results with
reshuffle, I see a marker with chi2>25 in small, but not in the original
big data set!
You can find the test-files in my public dropbox folder (I am going to send
the link to you separately; if anyone else is interested in testing, let me
know); the commands I used for testing are provided below.
best, and let mw know,
YA
Running analysis:
CLAK-GWAS -cov covData -phi gKinId -pheno phDataChr1 -snp gtDataChr1small
-out BAllSmall -var eigen -nths 4
reshuffle BAllSmall --chi=25
[identifies one SNP-trait pair, rs3738547-GI_7657133.S, with chi2>25]
CLAK-GWAS -cov covData -phi gKinId -pheno phDataChr1 -snp gtDataChr1 -out
BAll -var eigen -nths 4
reshuffle BAll --chi=25
[no SNP-trait pairs identified at chi2>25]
To make sure that actually the data are right, I ran the following tests in
R making sure that the SNP is present in both big and thinned data set
> library(DatABEL)
DatABEL v.0.9-4 (March 12, 2013) loaded
> g <- databel("gtDataChr1")
> gs <- databel("gtDataChr1small")
> p <- databel("phDataChr1")
> snp <- as(g[,"rs3738547"],"vector")
> snps <- as(gs[,"rs3738547"],"vector")
> ph <- as(p[,"GI_7657133.S"],"vector")
> summary(lm(ph~snp))
Call:
lm(formula = ph ~ snp)
Residuals:
Min 1Q Median 3Q Max
-0.33692 -0.04485 0.00333 0.05632 0.13978
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.009116 0.005337 1.708 0.0892 .
snp -0.091529 0.015967 -5.732 3.53e-08 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 0.07357 on 204 degrees of freedom
Multiple R-squared: 0.1387, Adjusted R-squared: 0.1345
F-statistic: 32.86 on 1 and 204 DF, p-value: 3.527e-08
> summary(lm(ph~snps))
Call:
lm(formula = ph ~ snps)
Residuals:
Min 1Q Median 3Q Max
-0.33692 -0.04485 0.00333 0.05632 0.13978
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.009116 0.005337 1.708 0.0892 .
snps -0.091529 0.015967 -5.732 3.53e-08 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 0.07357 on 204 degrees of freedom
Multiple R-squared: 0.1387, Adjusted R-squared: 0.1345
F-statistic: 32.86 on 1 and 204 DF, p-value: 3.527e-08
And this is why I blame this on reshuffle:
yuryaulchenko$ perl extractCell.pl BAll 1029 115001
0.00911574810743332 -0.0915286764502525 0.00573702435940504
0.0171629786491394 -2.74352405540412e-05
YAMacBookPro:analysisOfHapMapeQTL yuryaulchenko$ perl extractCell.pl
BAllSmall 1029 1151
0.00911574810743332 -0.0915286764502525 0.00573702435940504
0.0171629786491394 -2.74352405540412e-05
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/genabel-devel/attachments/20130629/79a3f9ae/attachment.html>
More information about the genabel-devel
mailing list