[datatable-help] fread() and UTF-8 support

Matthew Dowle mdowle at mdowle.plus.com
Sun Sep 15 10:34:29 CEST 2013


Ok, can you file as a feature request please.  Thanks.

Matthew

On 14/09/13 21:47, MICHELE DE MEO wrote:
>
> I think it could be very useful the possibility to specify the 
> encoding, as when you use the function 'file' with read.table .
>
> Michele
>
> Il giorno 14/set/2013 22:33, "Matthew Dowle" <mdowle at mdowle.plus.com 
> <mailto:mdowle at mdowle.plus.com>> ha scritto:
>
>
>     Sorry again - nope hadn't given UTF-8 any thought.
>
>     Matthew
>
>     On 14/09/13 20:57, Harish wrote:
>>     Does fread() support UTF-8?  I got a text file that is mostly
>>     Latin-1 characters but encoded as UTF-8. When I load the data,
>>     the first column name has a few extra characters in the beginning
>>     ("id"), but I do not get this when I convert the same file to
>>     ANSI format using Windows Notepad.
>>
>>     I am guessing that UTF-8 encoding puts a few extra characters in
>>     the beginning of the text file to indicate that it is an UTF-8
>>     encoding, and fread() is reading that literally as the first
>>     column name.
>>
>>     Thanks for the clarification.
>>
>>     Regards,
>>     Harish
>>
>>
>>     _______________________________________________
>>     datatable-help mailing list
>>     datatable-help at lists.r-forge.r-project.org  <mailto:datatable-help at lists.r-forge.r-project.org>
>>     https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help
>
>
>     _______________________________________________
>     datatable-help mailing list
>     datatable-help at lists.r-forge.r-project.org
>     <mailto:datatable-help at lists.r-forge.r-project.org>
>     https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/datatable-help/attachments/20130915/e2803980/attachment.html>


More information about the datatable-help mailing list