4.1.10
Character Sets
Character Sets
Character Sets
Text data is made up of characters. Character sets allow us to store characters digitally.
![Illustrative background for Character sets](https://image-v2.cdn.app.senecalearning.com/courseImages/biology/OCR Gateway/5.1.5/coding ,h_400,q_80,w_640.jpg)
![Illustrative background for Character sets ?? "content](https://image-v2.cdn.app.senecalearning.com/courseImages/biology/OCR Gateway/5.1.5/coding ,h_400,q_80,w_640.jpg)
Character sets
Character sets
- Text data is made up of characters.
- Each character is assigned its own character code.
- A character set is a collection of all the characters that a computer recognises, along with their binary codes.
![Illustrative background for What's in a character set?](https://image-v2.cdn.app.senecalearning.com/2018-07/95d17dd9-261f-4238-9065-f411ea8c7210/code-Data-Computer-Wifi-Internet,h_400,q_80,w_640.jpg)
![Illustrative background for What's in a character set? ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-07/95d17dd9-261f-4238-9065-f411ea8c7210/code-Data-Computer-Wifi-Internet,h_400,q_80,w_640.jpg)
What's in a character set?
What's in a character set?
- Character sets include:
- Alphanumeric characters e.g. letters, numbers, and symbols.
- Special characters e.g. new line.
![Illustrative background for Examples of character sets](https://image-v2.cdn.app.senecalearning.com/2018-08/0c868665-5d32-4e57-b2ef-eac6b395424a/monitor-933392_1920,h_400,q_80,w_640.jpg)
![Illustrative background for Examples of character sets ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/0c868665-5d32-4e57-b2ef-eac6b395424a/monitor-933392_1920,h_400,q_80,w_640.jpg)
Examples of character sets
Examples of character sets
- There are two main character sets in use:
- American Standard Code for Information Interchange.
- Unicode.
American Standard Code for Information Interchange
American Standard Code for Information Interchange
The American Standard Code for Information Interchange (ASCII) character set is the most common character set.
![Illustrative background for ASCII](https://image-v2.cdn.app.senecalearning.com/2018-08/aacaeef8-fc6d-4984-99cc-be44f0b44e55/Hexadecimal-numbers-code-,h_400,q_80,w_640.jpg)
![Illustrative background for ASCII ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/aacaeef8-fc6d-4984-99cc-be44f0b44e55/Hexadecimal-numbers-code-,h_400,q_80,w_640.jpg)
ASCII
ASCII
- Each character in ASCII is represented by a seven-bit binary code.
- That means there is a maximum of 128 characters.
- ASCII includes all commonly used letters and symbols in the English language.
![Illustrative background for 7-bit letters?](https://image-v2.cdn.app.senecalearning.com/2018-08/3431afcc-ad71-4c9e-b5af-4c8be1bed7f9/binary-code-,h_400,q_80,w_640.jpg)
![Illustrative background for 7-bit letters? ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/3431afcc-ad71-4c9e-b5af-4c8be1bed7f9/binary-code-,h_400,q_80,w_640.jpg)
7-bit letters?
7-bit letters?
- Each letter is represented by seven bits.
- This is useful because when used in an 8-bit system, the extra bit can be used as a check digit.
![Illustrative background for Limitations of ASCII](https://image-v2.cdn.app.senecalearning.com/2018-08/9b3e51b8-472b-44a3-adbe-62a3e2df8363/unicode-code-binary-,h_400,q_80,w_640.jpg)
![Illustrative background for Limitations of ASCII ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/9b3e51b8-472b-44a3-adbe-62a3e2df8363/unicode-code-binary-,h_400,q_80,w_640.jpg)
Limitations of ASCII
Limitations of ASCII
- 128 characters is perfectly fine for the English language. But it does not leave space for characters from other languages.
- An extended ASCII set was released which used all eight bits, but it was still not enough.
- This led to the release of Unicode.
Unicode
Unicode
Unicode is a character set which was released because of the need to standardise character sets internationally.
![Illustrative background for Unicode](https://image-v2.cdn.app.senecalearning.com/2018-08/aacaeef8-fc6d-4984-99cc-be44f0b44e55/Hexadecimal-numbers-code-,h_400,q_80,w_640.jpg)
![Illustrative background for Unicode ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/aacaeef8-fc6d-4984-99cc-be44f0b44e55/Hexadecimal-numbers-code-,h_400,q_80,w_640.jpg)
Unicode
Unicode
- Unicode aims to represent every possible character in the world.
- The most common form of Unicode is UTF-8 and uses between eight and 32 bit binary codes to represent each character.
![Illustrative background for Compatability with ASCII](https://image-v2.cdn.app.senecalearning.com/2018-08/61d41c25-a8c0-4b32-b101-5487ab592e7a/computer-code-,h_400,q_80,w_640.jpg)
![Illustrative background for Compatability with ASCII ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/61d41c25-a8c0-4b32-b101-5487ab592e7a/computer-code-,h_400,q_80,w_640.jpg)
Compatability with ASCII
Compatability with ASCII
- The first 256 characters in Unicode are identical to extended ASCII, which makes it backwards compatible with documents encoded using older character sets.
![Illustrative background for Types of characters](https://image-v2.cdn.app.senecalearning.com/2018-08/997c682a-908c-474d-81ae-a3e57e9fd7fd/emoji-,h_400,q_80,w_640.jpg)
![Illustrative background for Types of characters ?? "content](https://image-v2.cdn.app.senecalearning.com/2018-08/997c682a-908c-474d-81ae-a3e57e9fd7fd/emoji-,h_400,q_80,w_640.jpg)
Types of characters
Types of characters
- Unicode represents characters from all major alphabets of the world.
- Unicode is also used to represent emojis!
1Components of a Computer
1.1Structure & Function of the Processor
1.2Types of Processors
2Software & Software Development
2.1Systems Software
2.2Applications Generation
2.3Software Development
3Exchanging Data
3.1Compression, Encryption & Hashing
3.3Networks
4Data Types, Data Structures & Algorithms
4.1Data Types
5Legal, Moral, Cultural & Ethical Issues
5.1Computing Related Legislation
6Elements of Computational Thinking
6.1Thinking Abstractly
6.2Thinking Procedurally
6.3Thinking Logically
7Problem Solving & Programming
7.1Programming Techniques
7.2Programming Construction
Jump to other topics
1Components of a Computer
1.1Structure & Function of the Processor
1.2Types of Processors
2Software & Software Development
2.1Systems Software
2.2Applications Generation
2.3Software Development
3Exchanging Data
3.1Compression, Encryption & Hashing
3.3Networks
4Data Types, Data Structures & Algorithms
4.1Data Types
5Legal, Moral, Cultural & Ethical Issues
5.1Computing Related Legislation
6Elements of Computational Thinking
6.1Thinking Abstractly
6.2Thinking Procedurally
6.3Thinking Logically
7Problem Solving & Programming
7.1Programming Techniques
7.2Programming Construction
![Go student ad image](/en-GB/revision-notes/_next/image?url=%2Fen-GB%2Frevision-notes%2Fimages%2Fgo-student-uk-ad.jpg&w=640&q=100)
Unlock your full potential with GoStudent tutoring
Affordable 1:1 tutoring from the comfort of your home
Tutors are matched to your specific learning needs
30+ school subjects covered