DEV Community

Zhangwuji
Zhangwuji

Posted on

Unicode ASCII the history of character set

1.
one byte have eight bit;one bit is 0 or 1. is basical unit;

computer developed in Us。So one bytes could stand for all alphabet of english and common character。they make a character set called ASCll, one character corresponded to one byte。
对应
with computer developping more and more popular,more countries lauange wanted to be joined;china make character set call GBK that chinese character occupied 2 byte.the situation is so complicated and unmananged becauseof many contries has own character set。 in order to
solve this problem apple company invented unicode table initially。

in order to compatible ASCll and the charactor of ASCll occupied one bytes;

But how to clearly know how many bytes one character corresponds to?
they invented utf-8 encode and decode plan。
it specified In a byte if the leading bytes are 111.it indicates three bytes sequence。
if one bytes
0 ....
if two bytes
110 ...... 10.....

if three bytes
110 ...... 10..... 10 ....
... correspond true code of unicode set

Top comments (0)