[datatable-help] fread() and UTF-8 support

Matthew Dowle mdowle at mdowle.plus.com
Sat Sep 14 22:33:42 CEST 2013


Sorry again - nope hadn't given UTF-8 any thought.

Matthew

On 14/09/13 20:57, Harish wrote:
> Does fread() support UTF-8?  I got a text file that is mostly Latin-1 
> characters but encoded as UTF-8.  When I load the data, the first 
> column name has a few extra characters in the beginning ("id"), but 
> I do not get this when I convert the same file to ANSI format using 
> Windows Notepad.
>
> I am guessing that UTF-8 encoding puts a few extra characters in the 
> beginning of the text file to indicate that it is an UTF-8 encoding, 
> and fread() is reading that literally as the first column name.
>
> Thanks for the clarification.
>
> Regards,
> Harish
>
>
> _______________________________________________
> datatable-help mailing list
> datatable-help at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/datatable-help/attachments/20130914/30a0a15b/attachment.html>


More information about the datatable-help mailing list