Introduction to Computer Data Representation

Characters and Text

Author(s): Peter Fenwick

Pp: 133-157 (25)

DOI: 10.2174/9781608058822114010011

* (Excluding Mailing and Handling)


Characters are the most-visible aspect of computing. This chapter outlines the development of the EBCDIC codes (from card code), and ASCII (initially from paper tape), and the extension of these codes to include a full range of alphabets, to give UNICODE. Other topics include the collection of characters into text strings, and especially the problems of transmitting binary data over systems designed for handling text. Thus it describes UTF-8 and UTF-7 coding, as well as "punycode", for encoding Internet domain names with arbitrary alphabets.

Keywords: EBCDIC, ASCII, text strings, EBCDIC-ASCII incompatibilities, ASCII extension to UNICODE, UCS-2, UCS-4. MIME en- coding, UTF-7. UTF-8, Punycode.

Related Journals
Related Books
© 2024 Bentham Science Publishers | Privacy Policy