[Seqinr-forum] Retrieve Gene Sequence from Ensembl
Jean lobry
lobry at biomserv.univ-lyon1.fr
Sat Oct 11 19:27:14 CEST 2008
Dear all,
How can i get all the human gene sequence from Ensembl?
How can i get the sequence for a certain gene? e.g. 'ENSG00000141510'
Best Regards,
Rocky
Rocky,
Try this as a starter:
library(seqinr)
choosebank("ensembl")
query("all","T=CDS", virtual = TRUE)
all$nelem # 892572 CDS
query("list1","all AND K=ENSG00000141510")
list1$nelem # 3 sequences because of alternative splicing
myseqs <- getSequence(list1)
myprots <- getTrans(myseqs)
dotPlot(myprots[[1]], myprots[[2]])
See also chapter 6 "importing zlib-compressed sequences" for
an example with 78573 human CDS (not supported on windows).
Best,
Z.
--
Jean R. Lobry (lobry at biomserv.univ-lyon1.fr)
Laboratoire BBE-CNRS-UMR-5558, Univ. C. Bernard - LYON I,
43 Bd 11/11/1918, F-69622 VILLEURBANNE CEDEX, FRANCE
allo : +33 472 43 27 56 fax : +33 472 43 13 88
http://pbil.univ-lyon1.fr/members/lobry/
More information about the Seqinr-forum
mailing list