[Progress Communities] [Progress OpenEdge ABL] Forum Post: RE: Issue while reading Chinese character from xml file

Status
Not open for further replies.
M

Matt Baker

Guest
Hi gaursaab, ISO8859-1 is a single-byte character set. en.wikipedia.org/.../IEC_8859-1 UTF-8 is a variable-byte character set en.wikipedia.org/.../UTF-8 There is no way to represent Chinese multi-byte characters in a single-byte code page. You need to change the internal code page for the session to use a multi-byte code page in order to work with the characters in that file properly. Also, be careful not to accidentally store any of those strings in a database that isn't configured with a code page capable of handling them. In an international world, you cannot get away with using ISO 8859-1. It is too limited. Your default selection should be UTF-8. For fuller explanation of why...read on: www.w3.org/.../qa-choosing-encodings The full manifesto: http://utf8everywhere.org/

Continue reading...
 
Status
Not open for further replies.
Top