[Seqinr-forum] Error in oriloc for many sequences

Jean Lobry jean.lobry at univ-lyon1.fr
Tue Jul 25 08:31:51 CEST 2017


Dear Matthew,

is there something after the ORIGIN qualifier in your
genbank file? Try:

 > tail(readLines("U00096.gb"))
[1] "                     /db_xref=\"ASAP:ABE-0014442\""
[2] "                     /db_xref=\"UniProtKB/Swiss-Prot:P37005\""
[3] "                     /db_xref=\"EcoGene:EG12309\""
[4] "ORIGIN      "
[5] "//"
[6] ""

If you have the same result it means that your genbank file
contains only the features and not the sequence itself.

Best,

JLO

Le 25/07/2017 à 03:07, Matthew Mark Parks a écrit :
> Indeed, this stems from incorrect reading of the fasta file of the 
> Genbank record (GBK).  "lseq" has value e.g. 2 somehow, when it should 
> be ~ 4.5 Million.
> 
> ------------------------------------------------------------------------
> *From:* seqinr-forum-bounces at lists.r-forge.r-project.org 
> <seqinr-forum-bounces at lists.r-forge.r-project.org> on behalf of Matthew 
> Mark Parks <map2085 at med.cornell.edu>
> *Sent:* Monday, July 24, 2017 8:01:40 PM
> *To:* seqinr-forum at lists.r-forge.r-project.org
> *Subject:* [Seqinr-forum] Error in oriloc for many sequences
> 
> Dear SeqinR developers,
> 
> 
> The function "oriloc" returns the following error:
> 
> 
>     ERROR : wrong sign in 'by' argument
> 
> 
> For many records from Genbank, even for very common, well documented 
> organisms.
> 
> 
> For example:
> 
>     library(seqinr)
> 
>     oriloc(gbk = "U00096.gb" )
> 
>     ERROR : wrong sign in 'by' argument
> 
> This occurs for many other Genbank records as well, including: 
> CP007592.1, AMKD01000101.1, JXMX01000001.1
> 
> 
> Thank you,
> 
> Matthew
> 
> 
> 
> 
> 
> _______________________________________________
> Seqinr-forum mailing list
> Seqinr-forum at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/seqinr-forum
> 



More information about the Seqinr-forum mailing list