[Gtdb-commits] r32 - in pkg/gt.db: . man
noreply at r-forge.r-project.org
noreply at r-forge.r-project.org
Wed Feb 17 20:02:32 CET 2010
Author: dahinds
Date: 2010-02-17 20:02:32 +0100 (Wed, 17 Feb 2010)
New Revision: 32
Added:
pkg/gt.db/man/load.affy.chp.data.Rd
pkg/gt.db/man/read.affy.anno.Rd
Modified:
pkg/gt.db/INDEX
Log:
Added help pages for read.affy.anno(), load.affy.chp.data().
Modified: pkg/gt.db/INDEX
===================================================================
--- pkg/gt.db/INDEX 2010-02-17 02:12:37 UTC (rev 31)
+++ pkg/gt.db/INDEX 2010-02-17 19:02:32 UTC (rev 32)
@@ -47,6 +47,7 @@
ld.gt.data Compute Pairwise Linkage Disequilibrium
ld.plot Pairwise Linkage Disequilibrium Plot
ld.prune Prune SNP List to Limit Linkage Disequilibrium
+load.affy.chp.data Import Affymetrix CHP Genotype Data
ls.assay List Assay Definitions
ls.assay.position List Assay Positions
ls.dataset List Genotype Datasets
@@ -92,6 +93,7 @@
qqprcomp Quantile-Quantile Plots of PCA Loadings
qqpval Quantile-Quantile Plots for P Values
qqthin Sparse Normal Quantile-Quantile Plots
+read.affy.anno Import Affymetrix NetAffx Annotation Data
reshape.gt.data Reshape Genotype Data
revcomp Reverse Complement DNA Sequences
score.and.store Test SNPs for Association and Store Results
Added: pkg/gt.db/man/load.affy.chp.data.Rd
===================================================================
--- pkg/gt.db/man/load.affy.chp.data.Rd (rev 0)
+++ pkg/gt.db/man/load.affy.chp.data.Rd 2010-02-17 19:02:32 UTC (rev 32)
@@ -0,0 +1,63 @@
+%
+% Copyright (C) 2010, 23andMe, Inc.
+%
+% Written by David A. Hinds <dhinds at sonic.net>
+%
+% This is free software; you can redistribute it and/or modify it
+% under the terms of the GNU General Public License as published by
+% the Free Software Foundation; either version 3 of the license, or
+% (at your option) any later version.
+%
+% This program is distributed in the hope that it will be useful,
+% but WITHOUT ANY WARRANTY; without even the implied warranty of
+% MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
+% GNU General Public License for more details.
+%
+% You should have received a copy of the GNU General Public License
+% along with this program. If not, see <http://www.gnu.org/licenses/>
+%
+\name{load.affy.chp.data}
+\alias{load.affy.chp.data}
+\title{Import Affymetrix CHP Genotype Data}
+\description{
+ Import Affymetrix CHP genotype data files into the database, including
+ genotypes, quality scores, and underlying measurements.
+}
+\usage{
+load.affy.chp.data(dataset, anno, files, progress=TRUE)
+}
+\arguments{
+ \item{dataset}{the unique identifier of the dataset to receive the
+ genotype data.}
+ \item{anno}{an annotation data frame from \code{\link{load.affy.chp.data}}.}
+ \item{files}{a vector of all the text CHP files to be imported.}
+ \item{progress}{logical: indicates whether to report progress during
+ the database load.}
+}
+\details{
+ The input files should be tab-delimited text versions of CHP files as
+ created by \code{apt-chp-to-txt} in the Affymetrix Power Tools.
+
+ Loading CHP data is a two stage process. In the first stage, we
+ reorganize the input data into new files that represent all samples
+ across smaller sets of SNPs. In the second stage, we load data from
+ these files into the database. The two stages effectively allow us to
+ perform an \dQuote{out-of-core} transpose of the genotype matrix,
+ because we typically will not be able to hold an entire genotype
+ dataset for a large study in working memory. The temporary files
+ require about 20\% of the space of the original text CHP files. They
+ are created in the current directory and deleted at the end of the
+ import.
+
+ When creating a dataset to receive CHP data, be sure to specify
+ \code{raw.format='chpdata'}.
+}
+\value{
+ The number of assays for which data was loaded.
+}
+\seealso{
+ \code{\link{read.affy.anno}},
+ \code{\link{mk.dataset}},
+ \code{\link{mk.assay.data}}.
+}
+\keyword{IO}
Added: pkg/gt.db/man/read.affy.anno.Rd
===================================================================
--- pkg/gt.db/man/read.affy.anno.Rd (rev 0)
+++ pkg/gt.db/man/read.affy.anno.Rd 2010-02-17 19:02:32 UTC (rev 32)
@@ -0,0 +1,69 @@
+%
+% Copyright (C) 2010, 23andMe, Inc.
+%
+% Written by David A. Hinds <dhinds at sonic.net>
+%
+% This is free software; you can redistribute it and/or modify it
+% under the terms of the GNU General Public License as published by
+% the Free Software Foundation; either version 3 of the license, or
+% (at your option) any later version.
+%
+% This program is distributed in the hope that it will be useful,
+% but WITHOUT ANY WARRANTY; without even the implied warranty of
+% MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
+% GNU General Public License for more details.
+%
+% You should have received a copy of the GNU General Public License
+% along with this program. If not, see <http://www.gnu.org/licenses/>
+%
+\name{read.affy.anno}
+\alias{read.affy.anno}
+\alias{load.affy.platform}
+\alias{load.affy.mapping}
+\title{Import Affymetrix NetAffx Annotation Data}
+\description{
+ The \code{read.affy.anno} function reads the contents of an Affymetrix
+ annotation CSV file into a data frame; \code{load.affy.platform} loads
+ assay definitions (flanks, alleles) into the database; and
+ \code{load.affy.mapping} loads assay positions and dbSNP information.
+}
+\usage{
+read.affy.anno(file)
+load.affy.platform(anno, description, progress=TRUE)
+load.affy.mapping(anno, progress=TRUE)
+}
+\arguments{
+ \item{file}{the file name of the annotation file.}
+ \item{anno}{an annotation data frame from \code{read.affy.anno}.}
+ \item{description}{a free-text description of the platform.}
+ \item{progress}{logical: indicates whether to report progress during
+ the database load.}
+}
+\details{
+ \code{load.affy.platform} creates and populates a new platform, using
+ the chip name from the annotation file header as the platform name.
+
+ \code{load.affy.mapping} creates and populates a new mapping, using
+ the version of the NetAffx release as the mapping name.
+}
+\value{
+ For \code{read.affy.anno}, a data frame representing a subset of
+ annotations suitable for importing into GT.DB, with key=value
+ attributes from the file header stored as attributes of the result.
+
+ For \code{load.affy.platform} and \code{load.affy.mapping}, the result
+ is the number of assays processed.
+}
+\seealso{
+ \code{\link{load.affy.chp.data}},
+ \code{\link{mk.platform}}, \code{\link{mk.assay}},
+ \code{\link{mk.mapping}}, \code{\link{mk.assay.position}}.
+}
+\examples{
+\dontrun{
+anno <- read.affy.anno('Axiom_GW_Hu_SNP.r2.na30.annot.csv')
+load.affy.platform(anno, 'Affymetrix Axiom 600K')
+load.affy.mapping(anno)
+}
+}
+\keyword{IO}
More information about the Gtdb-commits
mailing list