On this page about Unicode:
Unicode is an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard, Unicode consists of a character repertoire, an encoding methodology and set of standard character encodings, a set of code charts for visual reference, an enumeration of character properties such as upper and lower case, a set of reference data computer files, and rules for normalization, decomposition, collation and rendering.
The Unicode Consortium, the non-profit organization that coordinates Unicode's development, has the ambitious goal of eventually replacing existing character encoding schemes with Unicode and its standard Unicode Transformation Format (UTF) schemes, as many of the existing schemes are limited in size and scope and are incompatible with multilingual environments. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java programming language and modern operating systems.
How to say "Unicode" in other languages:
![]() |
(Chinese) | Unicode |
![]() |
(Japanese) | Unicode |
![]() |
(German) | Unicode |
![]() |
(Spanish) | Unicode |
![]() |
(French) | Unicode |
![]() |
(Italian) | Unicode |
The ConScript Unicode Registry is a volunteer project to coordinate the assignment of code points in the Unicode private use area for the encoding of artificial scripts. It was founded by John Cowan and is maintained by John Cowan and Michael Everson. It is not an official project of the Unicode...
Standard Compression Scheme for Unicode
Unicode series Unicode Unicode Consortium UCS UTF-7 UTF-8 UTF-16 UTF-32 SCSU Punycode Bi-directional text BOM Han unification Unicode and HTML The Standard Compression Scheme for Unicode (SCSU) is a Unicode Technical Standard to reduce the number of bytes needed...
...
...
The Kanji 犬 Unicode character number - hex 72AC/decimal 29356 On Readings ken Kun Readings (hiragana suffix in brackets) inu Meaning(s) Japanese English ken, inu dog...
SCSU can mean several things: St. Cloud State University South Carolina State University Southern Connecticut State University the Standard Compression Scheme for Unicode...
...
GB 18030 is a mainland Chinese computer character encoding. It contains all the Chinese, Japanese and Korean characters in the Unicode 3.0 standard. The SimSun 18303 font enables the display of the GB 18030 characters, which includes all the characters in Unicode 2.1 plus new characters found in...
...
Unicode series Unicode Unicode Consortium UCS UTF-7 UTF-8 UTF-16 UTF-32 SCSU Punycode Bi-directional text BOM Han unification Unicode and HTML Punycode , defined in RFC 3492, is a self-proclaimed "Bootstring encoding" of Unicode strings into the limited...