HTML/Xhtml
Unicode中的CJK(中日韩统一表意文字)字符小结
CJK,是CJK Unified Ideographs的缩写,意思是“中日韩统一表意文字”,把分别来自中文、日文、韩文、越文中,本质、意义相同、形状一样或稍异的表意文字赋予相同编码,其中主要为汉字,但也有仿汉字如日本国字、韩国独有汉字、越南的喃字等。这个字符集数量很大,大都可以使用汉字输入法直接输出,一般并不会通过查找Unicode码来显示。但其中包括一些偏旁部首、标点符号、特殊字符等,有时需要通过Unicode码来获取。
1. CJK中的偏旁部首CJK Radicals Supplement:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\u2E8 |
⺀ |
⺁ |
⺂ |
⺃ |
⺄ |
⺅ |
⺆ |
⺇ |
⺈ |
⺉ |
⺊ |
⺋ |
⺌ |
⺍ |
⺎ |
⺏ |
\u2E9 |
⺐ |
⺑ |
⺒ |
⺓ |
⺔ |
⺕ |
⺖ |
⺒ |
⺓ |
⺙ |
|
⺛ |
⺜ |
⺝ |
⺞ |
⺟ |
\u2EA |
⺠ |
⺡ |
⺢ |
⺣ |
⺤ |
⺥ |
⺦ |
⺧ |
⺨ |
⺩ |
⺪ |
⺫ |
⺬ |
⺭ |
⺮ |
⺯ |
\u2EB |
⺰ |
⺱ |
⺲ |
⺳ |
⺴ |
⺵ |
⺶ |
⺷ |
⺸ |
⺹ |
⺺ |
⺻ |
⺼ |
⺽ |
⺾ |
⺿ |
\u2EC |
⻀ |
⻁ |
⻂ |
⻃ |
⻄ |
⻅ |
⻆ |
⻇ |
⻈ |
⻉ |
⻊ |
⻋ |
⻌ |
⻍ |
⻎ |
⻏ |
\u2ED |
⻐ |
⻑ |
⻒ |
⻓ |
⻔ |
⻕ |
⻖ |
⻗ |
⻘ |
⻙ |
⻚ |
⻛ |
⻜ |
⻝ |
⻞ |
⻟ |
\u2EE |
⻠ |
⻡ |
⻢ |
⻣ |
⻤ |
⻥ |
⻦ |
⻧ |
⻨ |
⻩ |
⻪ |
⻫ |
⻬ |
⻭ |
⻮ |
⻯ |
\u2EF |
⻰ |
⻱ |
⻲ |
⻳ |
|
|
|
|
|
|
|
|
|
|
|
|
2. CJK标点符号CJK Symbols and Punctuation:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\u300 |
|
、 |
。 |
〃 |
〄 |
々 |
〆 |
〇 |
〈 |
〉 |
《 |
》 |
「 |
」 |
『 |
』 |
\u301 |
【 |
】 |
〒 |
〓 |
〔 |
〕 |
〖 |
〒 |
〓 |
〙 |
〚 |
〛 |
〜 |
〝 |
〞 |
〟 |
\u302 |
〠 |
〡 |
〢 |
〣 |
〤 |
〥 |
〦 |
〧 |
〨 |
〩 |
〪 |
〫 |
〬 |
〭 |
〮 |
〯 |
\u303 |
〰 |
〱 |
〲 |
〳 |
〴 |
〵 |
〶 |
〷 |
〸 |
〹 |
〺 |
〻 |
〼 |
〽 |
〾 |
〿 |
3. CJK笔画CJK Strokes:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\u31C |
㇀ |
㇁ |
㇂ |
㇃ |
㇄ |
㇅ |
㇆ |
㇇ |
㇈ |
㇉ |
㇊ |
㇋ |
㇌ |
㇍ |
㇎ |
㇏ |
\u31D |
㇐ |
㇑ |
㇒ |
㇓ |
㇔ |
㇕ |
㇖ |
㇒ |
㇓ |
㇙ |
㇚ |
㇛ |
㇜ |
㇝ |
㇞ |
㇟ |
\u31E |
㇠ |
㇡ |
㇢ |
㇣ |
|
|
|
|
|
|
|
|
|
|
|
|
4. CJK组合字符CJK Compatibility:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\u335 |
㍐ |
㍑ |
㍒ |
㍓ |
㍔ |
㍕ |
㍖ |
㍗ |
㍘ |
㍙ |
㍚ |
㍛ |
㍜ |
㍝ |
㍞ |
㍟ |
\u336 |
㍠ |
㍡ |
㍢ |
㍣ |
㍤ |
㍥ |
㍦ |
㍢ |
㍣ |
㍩ |
㍪ |
㍫ |
㍬ |
㍭ |
㍮ |
㍯ |
\u337 |
㍰ |
㍱ |
㍲ |
㍳ |
㍴ |
㍵ |
㍶ |
㍷ |
㍸ |
㍹ |
㍺ |
㍻ |
㍼ |
㍽ |
㍾ |
㍿ |
\u338 |
㎀ |
㎁ |
㎂ |
㎃ |
㎄ |
㎅ |
㎆ |
㎇ |
㎈ |
㎉ |
㎊ |
㎋ |
㎌ |
㎍ |
㎎ |
㎏ |
\u339 |
㎐ |
㎑ |
㎒ |
㎓ |
㎔ |
㎕ |
㎖ |
㎗ |
㎘ |
㎙ |
㎚ |
㎛ |
㎜ |
㎝ |
㎞ |
㎟ |
\u33A |
㎠ |
㎡ |
㎢ |
㎣ |
㎤ |
㎥ |
㎦ |
㎧ |
㎨ |
㎩ |
㎪ |
㎫ |
㎬ |
㎭ |
㎮ |
㎯ |
\u33B |
㎰ |
㎱ |
㎲ |
㎳ |
㎴ |
㎵ |
㎶ |
㎷ |
㎸ |
㎹ |
㎺ |
㎻ |
㎼ |
㎽ |
㎾ |
㎿ |
\u33C |
㏀ |
㏁ |
㏂ |
㏃ |
㏄ |
㏅ |
㏆ |
㏇ |
㏈ |
㏉ |
㏊ |
㏋ |
㏌ |
㏍ |
㏎ |
㏏ |
\u33D |
㏐ |
㏑ |
㏒ |
㏓ |
㏔ |
㏕ |
㏖ |
㏗ |
㏘ |
㏙ |
㏚ |
㏛ |
㏜ |
㏝ |
㏞ |
㏟ |
\u33E |
㏠ |
㏡ |
㏢ |
㏣ |
㏤ |
㏥ |
㏦ |
㏧ |
㏨ |
㏩ |
㏪ |
㏫ |
㏬ |
㏭ |
㏮ |
㏯ |
\u33F |
㏰ |
㏱ |
㏲ |
㏳ |
㏴ |
㏵ |
㏶ |
㏷ |
㏸ |
㏹ |
㏺ |
㏻ |
㏼ |
㏽ |
㏾ |
㏿ |
5. 易经64卦Yijing Hexagram Symbols:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\u4DC |
䷀ |
䷁ |
䷂ |
䷃ |
䷄ |
䷅ |
䷆ |
䷇ |
䷈ |
䷉ |
䷊ |
䷋ |
䷌ |
䷍ |
䷎ |
䷏ |
\u4DD |
䷐ |
䷑ |
䷒ |
䷓ |
䷔ |
䷕ |
䷖ |
䷒ |
䷓ |
䷙ |
䷚ |
䷛ |
䷜ |
䷝ |
䷞ |
䷟ |
\u4DE |
䷠ |
䷡ |
䷢ |
䷣ |
䷤ |
䷥ |
䷦ |
䷧ |
䷨ |
䷩ |
䷪ |
䷫ |
䷬ |
䷭ |
䷮ |
䷯ |
\u4DF |
䷰ |
䷱ |
䷲ |
䷳ |
䷴ |
䷵ |
䷶ |
䷷ |
䷸ |
䷹ |
䷺ |
䷻ |
䷼ |
䷽ |
䷾ |
䷿ |
6. CJK增补格式字符CJK Compatibility Forms:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\uFE3 |
︰ |
︱ |
︲ |
︳ |
︴ |
︵ |
︶ |
︷ |
︸ |
︹ |
︺ |
︻ |
︼ |
︽ |
︾ |
︿ |
\uFE4 |
﹀ |
﹁ |
﹂ |
﹃ |
﹄ |
﹅ |
﹆ |
﹂ |
﹃ |
﹉ |
﹊ |
﹋ |
﹌ |
﹍ |
﹎ |
﹏ |
7. 竖排格式字符Vertical Forms:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\uFE1 |
︐ |
︑ |
︒ |
︓ |
︔ |
︕ |
︖ |
︗ |
︘ |
︙ |
|
|
|
|
|
|
8. 小格式变体Small Form Variants:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\uFE5 |
﹐ |
﹑ |
﹒ |
|
﹔ |
﹕ |
﹖ |
﹗ |
﹘ |
﹙ |
﹚ |
﹛ |
﹜ |
﹝ |
﹞ |
﹟ |
\uFE6 |
﹠ |
﹡ |
﹢ |
﹣ |
﹤ |
﹥ |
﹦ |
﹢ |
﹣ |
﹩ |
﹪ |
﹫ |
|
|
|
|
9. 半宽和全宽格式字符Halfwidth and Fullwidth Forms:
Unicode |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
\uFF0 |
|
! |
" |
# |
$ |
% |
& |
' |
( |
) |
* |
+ |
, |
- |
. |
/ |
\uFF1 |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
2 |
3 |
9 |
: |
; |
< |
= |
> |
? |
\uFF2 |
@ |
A |
B |
C |
D |
E |
F |
G |
H |
I |
J |
K |
L |
M |
N |
O |
\uFF3 |
P |
Q |
R |
S |
T |
U |
V |
W |
X |
Y |
Z |
[ |
\ |
] |
^ |
_ |
\uFF4 |
` |
a |
b |
c |
d |
e |
f |
g |
h |
i |
j |
k |
l |
m |
n |
o |
\uFF5 |
p |
q |
r |
s |
t |
u |
v |
w |
x |
y |
z |
{ |
| |
} |
~ |
⦅ |
\uFFE |
¢ |
£ |
¬ |
 ̄ |
¦ |
¥ |
₩ |
|
│ |
← |
↑ |
→ |
↓ |
■ |
○ |
|
10. CJK统一表意文字CJK Unified Ideographs:4E00--9FEA
11. CJK扩展CJK Unified Ideographs Extension A:3430--4DB5
12. CJK增补字符CJK Compatibility Ideographs:F900--FAD9
CJK字符数量庞大,列表略去了,只列出一部分特殊字符,其中一部分无法正常显示,那是因为字库中没有对相关字符的支持。使用Unicode字符时,在HTML中使用“&#xhhhh”形式,其中hhhh为4位十六进制的Unicode码,也可以转为10进制的码后使用“&#dddd”形式;在JavaScript脚本中使用,使用“\uhhhh”形式;CSS中使用时使用“\hhhh”形式。