[Phylobase-devl] characters not being read
Ben Bolker
bolker at ufl.edu
Thu Aug 5 16:00:06 CEST 2010
François Michonneau wrote:
> Hi again Conrad,
>
> In the file you sent me there were no state labels (just
> charstatelabels), so the warning is normal.
>
> For the underscore issue, there isn't much we can do about. NCL
> converts underscores to spaces because there is no way to
> differentiate them in newick strings for instance. Then when the name
> is assigned to the column name in R, the space is converted to a dot.
>
> Cheers,
> -- François
If we wanted to we could (a) convert spaces back to underscores
ourselves after NCL import (b) disable conversion of spaces to dots by
using check.names=FALSE in the data.frame() call.
I don't know whether either of these is useful in general, or if we
just need to 'fix' this in the documentation.
Opinions?
Ben
>
> On Wed, Aug 4, 2010 at 22:31, Conrad Stack <stack at psu.edu> wrote:
>> Looks great. Two things: it looks like the warning message about 'state
>> labels' fires even when labels are found (only when DATATYPE=STANDARD) and,
>> perhaps relatedly, underscores in CHARSTATELABELS are automatically
>> converted to periods.
>> Thanks!
>> -Conrad
>>
>>
>>
>> On Wed, Aug 4, 2010 at 12:53 PM, François Michonneau
>> <francois.michonneau at gmail.com> wrote:
>>> Hi,
>>>
>>> My last commit should fix the issue Conrad mentioned.
>>>
>>> The problem came from the fact that the nexus file didn't have state
>>> labels and the default option in readNexus was to return them. It
>>> ended up attaching empty labels to the the 0 and 1 which resulted in
>>> the empty column observed. I modified readNexus to (1) return a
>>> warning when state labels are missing and return.labels is TRUE which
>>> leads to ignoring the return.labels option.
>>>
>>> Let me know if you find any other bugs.
>>>
>>> Cheers,
>>> -- François
>>>
>>> On Tue, Aug 3, 2010 at 17:42, Conrad Stack <stack at psu.edu> wrote:
>>>> I just upgraded to phylobase 0.6.1 and found that the data in my
>>>> CHARACTERS
>>>> blocks are no longer being read in (they show up as blanks in the
>>>> returned
>>>> phylo4d object).
>>>> this is from the geospiza.nex dataset:
>>>> BEGIN CHARACTERS;
>>>> DIMENSIONS NCHAR=1;
>>>> FORMAT DATATYPE = STANDARD GAP = - MISSING = ? SYMBOLS = " 0 1";
>>>> MATRIX
>>>> fuliginosa 1
>>>> fortis 1
>>>> magnirostris 0
>>>> conirostris 0
>>>> scandens 0
>>>> difficilis 1
>>>> pallida 0
>>>> parvulus 0
>>>> psittacula 0
>>>> pauper 0
>>>> Platyspiza 0
>>>> fusca 0
>>>> Pinaroloxias 0
>>>> ;
>>>> END;
>>>> When it reads in, it shows up like this:
>>>> readNexus("geospiza.nex")
>>>> label node ancestor edge.length node.type standard_char
>>>> 1 fuliginosa 1 22 0.05500 tip
>>>> 2 fortis 2 22 0.05500 tip
>>>> 3 magnirostris 3 21 0.11000 tip
>>>> 4 conirostris 4 20 0.18333 tip
>>>> 5 scandens 5 19 0.19250 tip
>>>> 6 difficilis 6 18 0.22800 tip
>>>> 7 pallida 7 23 0.08667 tip
>>>> 8 parvulus 8 25 0.02000 tip
>>>> 9 psittacula 9 25 0.02000 tip
>>>> 10 pauper 10 24 0.03500 tip
>>>> 11 Platyspiza 11 16 0.46550 tip
>>>> 12 fusca 12 15 0.53409 tip
>>>> 13 Pinaroloxias 13 14 0.58333 tip
>>>> 14 <NA> 14 0 0.29744 root <NA>
>>>> 15 <NA> 15 14 0.04924 internal <NA>
>>>> 16 <NA> 16 15 0.06859 internal <NA>
>>>> .......cont....
>>>> Any thoughts? (I'm using windows 7 32-bit with R 2.11, but the error
>>>> occurs
>>>> on my Mac leopard install as well.
>>>>
>>>>
>>>> --
>>>> Conrad Stack
>>>> -----------------------
>>>> PSU Department of Biology
>>>> 208 Mueller Lab
>>>> University Park, PA 16802
>>>> cell: 814.409.8310
>>>> email: conrad.stack at gmail.com
>>>>
>>>> _______________________________________________
>>>> Phylobase-devl mailing list
>>>> Phylobase-devl at lists.r-forge.r-project.org
>>>>
>>>> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/phylobase-devl
>>>>
>>>>
>>
>>
>> --
>> Conrad Stack
>> -----------------------
>> PSU Department of Biology
>> 208 Mueller Lab
>> University Park, PA 16802
>> cell: 814.409.8310
>> email: conrad.stack at gmail.com
>>
> _______________________________________________
> Phylobase-devl mailing list
> Phylobase-devl at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/phylobase-devl
--
Ben Bolker
*** NEW E-MAIL ADDRESSES:
*** bbolker at gmail.com , bolker at mcmaster.ca
http://www.math.mcmaster.ca/~bolker
GPG key: http://www.math.mcmaster.ca/~bolker/benbolker-publickey.asc
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 261 bytes
Desc: OpenPGP digital signature
URL: <http://lists.r-forge.r-project.org/pipermail/phylobase-devl/attachments/20100805/ecbf6387/attachment.pgp>
More information about the Phylobase-devl
mailing list