8 min read
2026-02-06
Unicode is an international coding standard that assigns a unique number (code point) to every character in every language in the world.
A code point is a numeric character identifier in the format U+XXXX. For example:
| Symbol | Code point | Description |
|---|---|---|
| A | U+0041 | Latin A |
| I | U+042F | Cyrillic I |
| € | U+20AC | Euro sign |
| 😀 | U+1F600 | Smiley |
Enter a character to search for its code point
Or enter the code point to find the character
Browse character blocks by category
Copy symbols to the clipboard
Unicode is organized into blocks: Basic Latin, Cyrillic, CJK Ideographs, Emoji and others. Each block contains symbols for a specific purpose.
Unicode defines a set of characters, and encodings (UTF-8, UTF-16, UTF-32) determine how those characters are stored in memory.
The standard is constantly expanding. Each new version adds symbols: emoji, historical writing, mathematical symbols.
See also: HTML encoding, URL encoding, Binary text