U

UAAG · UBL · UCD · UCI · UCLP · UCS · UCS-2 · UCS-2E · UCS-4 · UDDI · UDF · UDF Bridge · UDI · UDP · UDP-Lite · UFS · UI · UIML · UMD · UML · UMTS · UNC · UN/CEFACT · UNIMARC · UPC · UPML · UPP · URI · URL · URN · USB · USMARC · USSD · UT · UT0 · UT1 · UT2 · UTC · UTF · UTF-1 · UTF-16 · UTF-16BE · UTF-16LE · UTF-2 · UTF-32 · UTF-32BE · UTF-32LE · UTF-7 · UTF-8 · UUCP · UUID · UWCC · UXGA · Unicode · Unix · Usenet News

Webdret.net/glossary

UCS (Universal Multiple-Octet Coded Character Set)

UCS standardized in ISO 10646 integrates all previous internationally/nationally agreed character sets into a single code set. UCS is based on 4-octet (32-bit) coding scheme known as the "canonical form" (UCS-4), but a 2-octet (16-bit) form (UCS-2) is used for the BMP, where octets 1 and 2 are assumed to be 00 00. The code set is split into 128 "groups" of "planes" containing 256 "rows" with 256 "cells" for characters. Each character is addressed using multiple octets, the third (in UCS-2 the first) of which identifies the row containing the character and the fourth (in UCS-2 the second) its cell number. The first 127 characters of the BMP used for 16-bit code interchange are those of ASCII. The characters forming the second half of the first row are those used in ISO 8859-1.

Type Associations

Associations

Mentioned in...

UTF · Unicode

Bibliographic References

Additional Information

  • Topic Creation: 2000-06-08; HTML Creation: 2012-01-22, 07:00:09
  • Comments? Corrections? Updates? Please send Email!

1 2 3 4 5 6 7 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z