M
Matt Baker
Guest
Hi gaursaab, ISO8859-1 is a single-byte character set. en.wikipedia.org/.../IEC_8859-1 UTF-8 is a variable-byte character set en.wikipedia.org/.../UTF-8 There is no way to represent Chinese multi-byte characters in a single-byte code page. You need to change the internal code page for the session to use a multi-byte code page in order to work with the characters in that file properly. Also, be careful not to accidentally store any of those strings in a database that isn't configured with a code page capable of handling them. In an international world, you cannot get away with using ISO 8859-1. It is too limited. Your default selection should be UTF-8. For fuller explanation of why...read on: www.w3.org/.../qa-choosing-encodings The full manifesto: http://utf8everywhere.org/
Continue reading...
Continue reading...