[Seqinr-forum] Retrieve Gene Sequence from Ensembl

Jean lobry lobry at biomserv.univ-lyon1.fr
Sat Oct 11 19:27:14 CEST 2008



Dear all,

How can i get all the human gene sequence from Ensembl?

How can i get the sequence for a certain gene? e.g. 'ENSG00000141510'

Best Regards,

Rocky


Rocky,

Try this as a starter:

library(seqinr)
choosebank("ensembl")
query("all","T=CDS", virtual = TRUE)
all$nelem # 892572 CDS
query("list1","all AND K=ENSG00000141510")
list1$nelem # 3 sequences because of alternative splicing
myseqs <- getSequence(list1)
myprots <- getTrans(myseqs)
dotPlot(myprots[[1]], myprots[[2]])

See also chapter 6 "importing zlib-compressed sequences" for
an example with 78573 human CDS (not supported on windows).

Best,

Z.

-- 
Jean R. Lobry            (lobry at biomserv.univ-lyon1.fr)
Laboratoire BBE-CNRS-UMR-5558, Univ. C. Bernard - LYON I,
43 Bd 11/11/1918, F-69622 VILLEURBANNE CEDEX, FRANCE
allo  : +33 472 43 27 56     fax    : +33 472 43 13 88
http://pbil.univ-lyon1.fr/members/lobry/



More information about the Seqinr-forum mailing list