R
Riverside Software
Guest
1) ASC function doesn't work for full strings, only character by character. You have to use something like: DO k = 1 TO LENGTH('国王豪华房', 'CHARACTER'): MESSAGE ASC(SUBSTRING('国王豪华房', k, 1, 'CHARACTER'). END. This function will give you the integer representation of the codepoint in the internal codepage (-cpinternal), so a different value in UTF-8, UCS-2 and Big-5 for example. Using UTF-8, that will give : 15047613 15175307 15249834 15043982 15108287 So the second character is code point U+738B, which has different representations : www.fileformat.info/.../charset_support.htm 2) and 3) There are lots of Unicode code points for CJK ; see en.wikipedia.org/.../Plane_(Unicode) and search for CJK. But if you're looking for a way to verify that a text is written in Chinese, I'm not sure that verifying every single character is a good solution
Continue reading...
Continue reading...