Unicode Character Inspector
5 code points · 6 UTF-16 code units · 11 UTF-8 bytes
| Char | Code point | Decimal | HTML entity | UTF-8 | UTF-16 | Name | Copy |
|---|---|---|---|---|---|---|---|
| H | U+0048 | 72 | H | 48 | 0048 | — | |
| é | U+00E9 | 233 | é | C3 A9 | 00E9 | — | |
| 😀 | U+1F600 | 128512 | 😀 | F0 9F 98 80 | D83D DE00 | — | |
| · | U+200B | 8203 | ​ | E2 80 8B | 200B | Zero Width Space | |
| A | U+0041 | 65 | A | 41 | 0041 | — |
See exactly what is in a string
Text pasted from documents, chat, and spreadsheets routinely carries invisible or look-alike characters — non-breaking spaces, zero-width joiners, smart quotes, combining marks — that silently break diffs, regexes, and CSV or JSON parsing. This inspector breaks your input into one row per Unicode code point and reveals each character's code point, decimal value, HTML numeric entity, and UTF-8 and UTF-16 byte encodings.
Astral characters and most emoji are a single code point even though they take two UTF-16 code units, and combining marks count as their own code points — which is why the code-point, UTF-16, and UTF-8 counts in the summary can all differ. Invisible and non-printing characters show as a · placeholder with their name so they never collapse into a blank cell.