Conversational modules
Programmable Channels
Platform functionality
Business segments
Industry verticals
Department
Our services
Solutions for telecoms
The Infobip advantage for telcos
See why leading telecoms around the world choose Infobip to transform their network
Telecom CPaaS partnerships
Create new B2B revenue streams with our omnichannel communications platform
Telecom core & security
Anam Protect Firewall
Secure your network from SMS and Voice fraud with our firewall that protects 120+ operators
SMS Firewall
Ensure all A2P SMS traffic is properly charged and eliminate revenue leakage with our SMS Firewall
Community & Resources
Knowledge hub
Title
What is Unicode?
The Universal Coded Character Set (UCS) or Unicode is an industry standard allowing computers to consistently represent and manipulate text expressed in any of the world’s writing systems.
As an SMS MT feature, it allows any content to be sent within a message.
What is the difference between Unicode and ASCII?
Unicode and ASCII are computer systems’ two most used character encoding schemes. While Unicode is used to exchange, process, and store text data in any language, ASCII is used to represent text in the form of symbols, numbers, and characters.
How many characters are in Unicode?
As of Version 15.0, Unicode contains 149,186 characters, but the answer is more complicated than that because of all the different kinds of characters that people might be interested in counting.
What is the difference between Unicode and ISO/IEC 10646?
Unicode and ISO/IEC 10646 contain the same characters at the exact location, but Unicode has rules and specifications outside the scope of ISO/IEC 10646. In other words, Unicode offers an extensive set of functional character specifications, data, algorithms, and background material that are not part of ISO/IEC 10646.
How does Unicode look like?
Unicode uses two encoding formats, 8-bit and 16-bit. The default encoding format is 16-bit, where each character is 16 bits wide (2 bytes), with the form usually shown as U+hhhh, where hhhh is the character’s hexadecimal code point.
For example, “Infobip” written in Unicode looks like: U+0049 U+006E U+0066 U+006F U+0062 U+0069 U+0070.