Unicode Character Inspector

Text to inspect

5 code points · 6 UTF-16 code units · 11 UTF-8 bytes

Char	Code point	Decimal	HTML entity	UTF-8	UTF-16	Name
H	U+0048	72	H	48	0048	—
é	U+00E9	233	é	C3 A9	00E9	—
😀	U+1F600	128512	😀	F0 9F 98 80	D83D DE00	—
·	U+200B	8203		E2 80 8B	200B	Zero Width Space
A	U+0041	65	A	41	0041	—

Inspected locally in your browser. Your text stays only on this page.

See exactly what is in a string

Text pasted from documents, chat, and spreadsheets routinely carries invisible or look-alike characters — non-breaking spaces, zero-width joiners, smart quotes, combining marks — that silently break diffs, regexes, and CSV or JSON parsing. This inspector breaks your input into one row per Unicode code point and reveals each character's code point, decimal value, HTML numeric entity, and UTF-8 and UTF-16 byte encodings.

Astral characters and most emoji are a single code point even though they take two UTF-16 code units, and combining marks count as their own code points — which is why the code-point, UTF-16, and UTF-8 counts in the summary can all differ. Invisible and non-printing characters show as a · placeholder with their name so they never collapse into a blank cell.