<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix"><br>
Sorry again - nope hadn't given UTF-8 any thought.<br>
<br>
Matthew<br>
<br>
On 14/09/13 20:57, Harish wrote:<br>
</div>
<blockquote
cite="mid:1379188675.10910.YahooMailNeo@web120205.mail.ne1.yahoo.com"
type="cite">
<div style="color:#000; background-color:#fff; font-family:times
new roman, new york, times, serif;font-size:12pt">
<div>Does fread() support UTF-8? I got a text file that is
mostly Latin-1 characters but encoded as UTF-8. When I load
the data, the first column name has a few extra characters in
the beginning ("id"), but I do not get this when I convert
the same file to ANSI format using Windows Notepad.</div>
<div><br>
</div>
<div>I am guessing that UTF-8 encoding puts a few extra
characters in the beginning of the text file to indicate that
it is an UTF-8 encoding, and fread() is reading that literally
as the first column name.</div>
<div><br>
</div>
<div>Thanks for the clarification.</div>
<div><br>
</div>
<div>Regards,</div>
<div>Harish</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
datatable-help mailing list
<a class="moz-txt-link-abbreviated" href="mailto:datatable-help@lists.r-forge.r-project.org">datatable-help@lists.r-forge.r-project.org</a>
<a class="moz-txt-link-freetext" href="https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help">https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help</a></pre>
</blockquote>
<br>
</body>
</html>