[Progress Communities] [Progress OpenEdge ABL] Forum Post: RE: CODEPAGE question

Status
Not open for further replies.
S

slacroixak

Guest
I am a little bit confused with the chosen sample 'A' as it is actually encoded with one single byte in UTF-8, like all characters in the ASCII set (below 128). The 8 f UTF-8 means it can go down to 8 bits. In UTF-8, extended characters (those above 127 in single byte encodings) are encoded with 2, 3 or 4 bytes (so 16 to 32 bits) => en.wikipedia.org/.../UTF-8 Said differently, strings made with only ASCII chars (below 128) should be encoded the same in all single byte codepages as well as in UTF-8 There should be differences only if extended characters are involved (like letter with accents, etc...)

Continue reading...
 
Status
Not open for further replies.
Top