Computers process only binary — every character, whether a letter, digit, or punctuation mark, must be stored as a number. Character encoding is the system that assigns a unique number (a code point) to each character, allowing software to consistently translate between binary values and readable text.
Without an agreed standard, text created on one system would display as gibberish on another. A file of English text is just a sequence of bytes; the encoding standard tells the software which character each byte represents.
Two encoding standards are required by the AQA GCSE spec:
| Standard | Bits per character | Characters covered |
|---|
| ASCII (7-bit) | 7 (stored in 8) | 128 characters (0–127) |
| Unicode | Variable | Over 140,000 characters |
A character encoding is a lookup table: a number maps to a character, and that number is what the computer stores in binary.