[Seqinr-forum] HELP for using R for codon usage data analysis
Kinshuk Nayak
kinshuk_nayak at yahoo.co.in
Tue Mar 13 10:14:28 CET 2012
Sir,
I am a student in bioinformaics.I am interested to use R 2.14.1 for codon usage analysis like example given at http://pbil.univ-lyon1.fr/datasets/charif04/. Here the example is customised for three genomes.I would like to run it for my sequence data/ genome. I have filtered set of sequences having length >300bp and stored in fasta format(mygenome1.fasta) in the folder in which R2.14.1 is installed. I have edited R script as follows.But it is showing error message during running the script.Could you please check and correct it.
SCRIPT
library(ade4)
library(seqinr)
sessionInfo()
mygenome1 <-read.fasta("mygenome1.fasta",seqonly=T)
seqmygenome1 <- lapply(mygenome1$req, getSequence)
####################################################################
#
# From sequences, build dataframes with codon usage for each sequence.
#
####################################################################
mkdata <- function(seqs)
{
tab <- sapply(seqs, uco)
tab <- as.data.frame(tab)
return( tab )
}
tabmygenome1 <- mkdata(seqmygenome1)
####################################################################
#
# Run correspondence analysis on merged dataset:
#
####################################################################
tab <- cbind(tabmygenome1)
names(tab) <- 1:ncol(tab)
coa <- dudi.coa(tab, scan = FALSE, nf = 2)
####################################################################
#
# Run synonymous codon usage analysis:
#
####################################################################
facaa <- as.factor(aaa(translate(sapply(rownames(tab),s2c))))
scua <- wca(coa, facaa, scan = FALSE, nf = 2)
####################################################################
#
# Plot first factorial map:
#
####################################################################
facsp <- as.factor(rep(c("mygenome1"),
c(ncol(tabmygenome1)))
s.class(scua$co, fac = facsp, cstar = 0, label = "",
col = c("green"), cell=0, cpoint=0.8,
sub="First factorial map for synonymous codon usage in Tri-Tryp")
s.label(scua$li, add.plot = TRUE, clab = 0.75)
legend( x = 0.1, y = -0.4, pch = 19, col = c("green"),
legend = c(expression(italic("mygenome1")),
xjust = 0, cex = 0.8)
####################################################################
#
# END
#
####################################################################
AFTER RUNNING
> library(ade4)
Attaching package: ‘ade4’
The following object(s) are masked from ‘package:base’:
within
> library(seqinr)
> sessionInfo()
R version 2.14.1 (2011-12-22)
Platform: i386-pc-mingw32/i386 (32-bit)
locale:
[1] LC_COLLATE=English_United States.1252
[2] LC_CTYPE=English_United States.1252
[3] LC_MONETARY=English_United States.1252
[4] LC_NUMERIC=C
[5] LC_TIME=English_United States.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] seqinr_3.0-6 ade4_1.4-17
> mygenome1 <-read.fasta("mygenome1.fasta",seqonly=T)
Error in file(con, "r") : cannot open the connection
In addition: Warning message:
In file(con, "r") :
cannot open file 'mygenome1.fasta': No such file or directory
Thanking you
Kinshuk Chandra Nayak
Lab. Tech & In-Charge IPR Cell
Institute of Life Sciences
Department of Biotechnology,Govt India
Bhubaneswar-751023,
India
Ph-0674-2300137,2301460,2301476,2301219 (EXTN:279)
Mail:
kinshuk at ils.res.in
kinshukils at hotmail.com
kinshuk_nayak at yahoo.co.in
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/seqinr-forum/attachments/20120313/0d8bf498/attachment.html>
More information about the Seqinr-forum
mailing list