| Go to previous page. |
|
Purpose: This page is a PC utility to discover the Unicode hex codes and their decimal ampersand equivalents associated with non-Latin-1 (non-Roman or accented) characters. (That works for a lot more than Chinese, but my interest is Chinese, so that is where it has been tested.)
Instructions: From any source, paste one or more characters into the top box, then click "Process." Hex and decimal equivalents will appear for all characters (except spaces). The last box can be used to "cut and paste" a value directly to an HTML editor. This page does NOT correctly process standard 7-bit ASCII characters (AaBbCc &c.), which can be used directly and do not require conversion to ampersand codes.
(Spaces between characters generate "NaN" errors in the Decimal line. I can't figure that out, but I don't use it for anything, so I have abandoned the effort.)
Here are some sample characters to play with:
Extended Roman: ê ü â Ì Ÿ ¼ ¾ ¿ ¡
Chinese: 京 仅 尽 径 惊 琎 痉 紧 经 警 谨 鲸
Esperanto: Ĉ ĉ Ĝ ĝ Ĥ ĥ Ĵ ĵ Ŝ ŝ Ŭ ŭ
Russian: Я говорю по-русски.
Greek: Αυτου οι θανατον μητσομαι