| UnicodeUnicode defines a 31-bit character set. Unicode is closely aligned with UCS. The most commonly used characters, including all those found in older encoding standards, have been placed in one of the first 65534 positions (0x0000 to 0xFFFD). This 16-bit subset is called the BMP or "Plane 0". The characters that were later added outside the 16-bit BMP are mostly for specialist applications such as historic scripts and scientific notation. New characters are still being added on a continuous basis, but the existing characters will not be changed any more and are stable. Unicode assigns to each character not only a code number but also an official name. A hexadecimal number that represents a Unicode or UCS value is commonly preceded by "U+" as in U+0041 for the character "Latin capital letter A". The Unicode characters U+0000 to U+007F are identical to those in ASCII, and the range U+0000 to U+00FF is identical to ISO 8859-1. Type Associations- Topic(s) from which this Topic is derived:
Associations- Unicode
contains
- Unicode
is based on
- Unicode
is used as a base by
- Unicode
is informatively described at
Mentioned in...Character Set · CRVX · OpenType · UTF-32 Bibliographic References- Unicode Consortium, The Unicode Standard: Version 4.0, Addison Wesley, Reading, Massachusetts, August 2003, 0321185781 [1]
- Unicode Consortium, The Unicode Standard: Version 3.0, Addison Wesley, Reading, Massachusetts, February 2000, 0201616335 [1]
- Unicode Consortium, The Unicode Standard: Version 2.0, Addison Wesley, Reading, Massachusetts, August 1996, 0201483459 [1]
- Unicode Consortium, The Unicode Standard: Worldwide Character Encoding, Addison Wesley, Reading, Massachusetts, October 1991, 0201567881 [1]
- Martin J. Dürst, Asmus Freytag, Unicode in XML and other Markup Languages, World Wide Web Consortium, Note NOTE-unicode-xml-20070516, May 2007 [0.9]
- François Yergeau, Martin J. Dürst, Richard Ishida, Addison P. Phillips, Misha Wolf, Tex Texin, Character Model for the World Wide Web 1.0: Normalization, World Wide Web Consortium, Working Draft charmod-norm-20051027, October 2005 [0.9]
- Martin J. Dürst, François Yergeau, Richard Ishida, Misha Wolf, Tex Texin, Character Model for the World Wide Web 1.0: Fundamentals, World Wide Web Consortium, Recommendation REC-charmod-20050215, February 2005 [0.9]
- Martin J. Dürst, François Yergeau, Richard Ishida, Misha Wolf, Tex Texin, Character Model for the World Wide Web 1.0: Resource Identifiers, World Wide Web Consortium, Proposed Recommendation CR-charmod-resid-20041122, November 2004 [0.9]
- Martin J. Dürst, François Yergeau, Richard Ishida, Misha Wolf, Tex Texin, Character Model for the World Wide Web 1.0, World Wide Web Consortium, Working Draft WD-charmod-20030822, August 2003 [0.9]
- David Carlisle, Patrick Ion, XML Entity definitions for Characters, World Wide Web Consortium, Working Draft WD-xml-entity-names-20080721, July 2008 [0.8]
- Proceedings of the Twenty-fourth Internationalization and Unicode Conference, Atlanta, Georgia, September 2003 [0.8]
- Proceedings of the Nineteenth Internationalization and Unicode Conference, San Jose, California, September 2001 [0.8]
- David Goldsmith, Mark Davis, UTF-7: A Mail-Safe Transformation Format of Unicode, Internet RFC 2152, May 1997 [0.8]
- Cliff Schmidt, XML and Unicode Normalization, Proceedings of XML Europe 2003, London, UK, May 2003 [0.7]
Additional Information- Topic Creation: 2000-06-07, Modification Date: 2003-01-23; HTML Creation: 2008-08-31, 07:00:32
- Comments? Corrections? Updates? Please send Email!
|