For the use of Indian scripts for data entry, which encoding scheme is

For the use of Indian scripts for data entry, which encoding scheme is used on most modern computer systems and smartphones?

ASCII
EBCDIC
Unicode
ISO/IEC 646
This question was previously asked in
UPSC CISF-AC-EXE – 2019
– ASCII (American Standard Code for Information Interchange) is a character encoding standard for electronic communication. It uses 7 bits and can represent 128 characters, primarily the English alphabet, numbers, and some control characters. It is insufficient for representing characters from other languages, including Indian scripts.
– EBCDIC (Extended Binary Coded Decimal Interchange Code) is another character encoding used primarily on IBM mainframe systems. It is also limited and cannot widely represent the diverse characters of Indian scripts.
– ISO/IEC 646 is a standard based on ASCII. It also lacks the capacity for representing a wide range of international characters.
– Unicode is a universal character encoding standard that aims to represent every character from every writing system in the world, including all major Indian scripts like Devanagari, Bengali, Tamil, Telugu, etc. Modern computer systems and smartphones widely use Unicode (e.g., UTF-8, UTF-16) to handle text in multiple languages.
Therefore, Unicode is the encoding scheme used on most modern computer systems and smartphones for the use of Indian scripts.
– Character encoding standards map characters to numerical values for digital representation.
– Older standards like ASCII and EBCDIC have limited character sets.
– Unicode is a modern, comprehensive standard designed to support characters from all languages globally.
Unicode uses different encoding forms, such as UTF-8, UTF-16, and UTF-32, to represent code points. UTF-8 is the most common encoding on the web and in many operating systems due to its variable-length encoding which is backward compatible with ASCII.