[datatable-help] fread() and UTF-8 support

MICHELE DE MEO micheledemeo at gmail.com
Sat Sep 14 22:47:26 CEST 2013


I think it could be very useful the possibility to specify the encoding, as
when you use the function 'file' with read.table .

Michele
Il giorno 14/set/2013 22:33, "Matthew Dowle" <mdowle at mdowle.plus.com> ha
scritto:

>
> Sorry again - nope hadn't given UTF-8 any thought.
>
> Matthew
>
> On 14/09/13 20:57, Harish wrote:
>
>  Does fread() support UTF-8?  I got a text file that is mostly Latin-1
> characters but encoded as UTF-8.  When I load the data, the first column
> name has a few extra characters in the beginning ("id"), but I do not
> get this when I convert the same file to ANSI format using Windows Notepad.
>
>  I am guessing that UTF-8 encoding puts a few extra characters in the
> beginning of the text file to indicate that it is an UTF-8 encoding, and
> fread() is reading that literally as the first column name.
>
>  Thanks for the clarification.
>
>  Regards,
> Harish
>
>
> _______________________________________________
> datatable-help mailing listdatatable-help at lists.r-forge.r-project.orghttps://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help
>
>
>
> _______________________________________________
> datatable-help mailing list
> datatable-help at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/datatable-help/attachments/20130914/e47b4224/attachment.html>


More information about the datatable-help mailing list