UTF-8 Tool Suite
Explore and deploy over 70 specialized serverless utilities to encode, decode, validate, and repair UTF-8 byte streams instantly.
UTF-8 to Bytes
Convert UTF-8 text into its raw byte values.
Bytes to UTF-8
Convert raw byte values back into UTF-8 text.
UTF-8 to Binary
Convert UTF-8 bytes into binary digits.
Binary to UTF-8
Convert binary digits back into UTF-8 text.
UTF-8 to Octal
Convert UTF-8 bytes into octal values.
Octal to UTF-8
Convert octal values back into UTF-8 text.
UTF-8 to Decimal
Convert UTF-8 bytes into decimal values.
Decimal to UTF-8
Convert decimal byte values back into UTF-8 text.
UTF-8 to Hex
Convert UTF-8 bytes into hexadecimal values.
Hex to UTF-8
Convert hexadecimal byte values back into UTF-8 text.
UTF-8 to Arbitrary Base
Convert UTF-8 bytes into numbers of any base up to 36.
Arbitrary Base to UTF-8
Convert numbers of any base back into UTF-8 text.
UTF-8 to UTF-16
Re-encode UTF-8 text as UTF-16 code units.
UTF-16 to UTF-8
Re-encode UTF-16 data as UTF-8 text.
UTF-8 to UTF-32
Re-encode UTF-8 text as UTF-32 code units.
UTF-32 to UTF-8
Re-encode UTF-32 data as UTF-8 text.
UTF-16LE to UTF-16BE
Swap UTF-16 between little-endian and big-endian.
UTF-32LE to UTF-32BE
Swap UTF-32 between little-endian and big-endian.
UTF-8 to UCS-2
Re-encode UTF-8 text as UCS-2 code units.
UCS-2 to UTF-8
Re-encode UCS-2 data as UTF-8 text.
Detect Endianness
Detect the byte order of an encoded stream.
Swap Endianness
Reverse the byte order of multi-byte units.
UTF-16 Surrogate Bytes
Show the surrogate-pair bytes of UTF-16 characters.
Encoding Detect
Heuristically detect the encoding of the input.
Add BOM
Prepend a byte order mark for UTF-8, UTF-16, or UTF-32.
Remove BOM
Strip a leading byte order mark.
Detect BOM
Identify the byte order mark and its encoding.
Byte Length per Char
Show how many UTF-8 bytes each character uses.
Leading Byte Analysis
Break down characters by 1, 2, 3, or 4-byte sequences.
Split Byte Sequences
Group bytes into their per-character UTF-8 sequences.
Highlight Continuation Bytes
Mark leading versus continuation bytes in the stream.
Byte Order Visualizer
Visualize how bytes are ordered for each character.
UTF-8 to Base64
Encode UTF-8 text into Base64.
Base64 to UTF-8
Decode Base64 back into UTF-8 text.
UTF-8 URL Encode
Percent-encode UTF-8 text byte by byte.
UTF-8 URL Decode
Decode percent-encoded UTF-8 back into text.
UTF-8 to Data URI
Wrap UTF-8 text as a data URI.
Data URI to UTF-8
Extract UTF-8 text from a data URI.
UTF-8 to HTML Bytes
Convert UTF-8 bytes into HTML numeric entities.
Percent Bytes to UTF-8
Convert raw percent-encoded bytes back into UTF-8.
UTF-8 to Quoted-Printable
Encode UTF-8 text as quoted-printable.
Quoted-Printable to UTF-8
Decode quoted-printable back into UTF-8 text.
Validate UTF-8
Check a byte stream for UTF-8 errors and report positions.
Fix Mojibake
Repair garbled text caused by wrong UTF-8 decoding.
Detect Double Encoding
Detect text that was UTF-8 encoded twice.
Fix Double-Encoded UTF-8
Undo a double UTF-8 encoding.
Find Invalid Bytes
Locate bytes that are not valid in UTF-8.
Overlong Detector
Detect overlong UTF-8 encodings.
Strip Invalid UTF-8
Remove invalid bytes from the stream.
Replace Invalid with U+FFFD
Swap invalid bytes for the replacement character.
Roundtrip Check
Verify text survives an encode-decode roundtrip.
Latin-1 to UTF-8
Re-encode ISO-8859-1 or cp1252 text as UTF-8.
UTF-8 Byte Count
Count the total UTF-8 bytes in the text.
Byte vs Char Count
Compare the byte count against the character count.
Size Estimator
Estimate the size of the text in each encoding.
Count Multibyte Chars
Count how many characters take more than one byte.
Random UTF-8
Generate random multi-byte UTF-8 text.
Byte-Class Histogram
Show a histogram of 1, 2, 3, and 4-byte characters.
UTF-8 to Code Points
Extract Unicode code points from UTF-8 text.
Code Points to UTF-8
Build UTF-8 text from a list of code points.
Mojibake Reverse Lab
Try many encoding pairs and rank candidate mojibake fixes.
Encoding Confidence Meter
Guess the encoding with a confidence score per candidate.
UTF-8 Byte Diff
Compare two strings byte by byte and highlight differences.
Truncation-Safe Cut
Cut text to a byte limit without breaking a multi-byte character.
BOM Audit
Run a full BOM and encoding audit on the content.
Emoji Byte Cost
Show the UTF-8 byte cost of each emoji and the total.
UTF-8 vs UTF-16 Size
Compare storage size across UTF-8, UTF-16, and UTF-32.
Byte Decoding Preview
Preview how the bytes look under each candidate encoding.
Normalization Byte Impact
Show byte-size change after NFC, NFD, NFKC, and NFKD.
Percent Encoding Minimizer
Percent-encode only the bytes that strictly require it.