Question 1

What is URL encoding (percent-encoding) and when do I need it?

Accepted Answer

URL encoding represents characters that have a special meaning inside a URL — or that aren't allowed at all — as a three-character %HH escape, where HH is the byte's hex value. You need it whenever a value travels through a URL: a search query containing '&' or '=', a path segment with a space, a redirect parameter that itself contains another URL, or a form field submitted with a GET. Without it, the URL parser would split your value at the wrong place.

Question 2

What's the difference between encodeURIComponent and encodeURI?

Accepted Answer

encodeURIComponent escapes everything except the 66 RFC 3986 unreserved characters (A-Z, a-z, 0-9, '-', '.', '_', '~'). It is for a single component value — a query value, a path segment, a fragment. encodeURI also preserves the reserved characters ': / ? # [ ] @ ! $ & ' ( ) * + , ; ='. It is for a whole URI that is already structured — you only want to escape illegal characters (spaces, control characters, non-ASCII). The two produce identical output for purely-unreserved input.

Question 3

Why does a space become '%20' here but '+' in some other tools?

Accepted Answer

Both are correct in their own contexts. RFC 3986 percent-encoding always represents a space as %20. The 'application/x-www-form-urlencoded' serializer defined in the WHATWG URL spec — used by HTML forms submitted with GET, by URLSearchParams, and by most server-side query parsers — instead uses '+'. The Form profile on this page outputs '+' for spaces; the Component and Whole-URI profiles output %20. Pick by where the value is going.

Question 4

Does this tool handle Sinhala, Tamil, and emoji correctly?

Accepted Answer

Yes. Every input is converted to UTF-8 bytes by the browser's TextEncoder before percent-encoding, then each non-unreserved byte is emitted as %HH. The decoder reverses both legs: it percent-decodes to bytes, then runs a UTF-8 decoder. Sinhala ශ්‍රී ලංකා, Tamil தமிழ், combining marks, zero-width joiners, and emoji round-trip without loss — try the sample text in the encode tab.

Question 5

Why did my decode fail with 'malformed percent-escape'?

Accepted Answer

Every '%' in a percent-encoded string must be followed by exactly two hex digits. The error fires when that rule is broken — for example, '%2World' (only one hex digit after %) or '100%' (no hex digits at the end). Two common causes: a literal '%' from a non-encoded source was pasted in, or characters were trimmed off the end of the string. Either re-encode the original input, or escape any literal '%' as %25 before decoding.

Question 6

Is it safe to double-encode? When would I do that?

Accepted Answer

Double-encoding is rarely what you want. It happens when a value containing percent-escapes is run through an encoder a second time — each '%' becomes '%25'. Real cases for it: passing an already-encoded URL as a query value to another service (then that service decodes once, you decode again on receipt), and storing user-entered text that may contain literal '%' in an interpolated template. The page shows a yellow hint when the input on the encode tab already contains a %HH pattern.

Question 7

Does this tool send my URL to a server?

Accepted Answer

No. Encoding, decoding, and the optional URL parsing all run as pure JavaScript inside your browser tab. Open the Network panel in DevTools and you'll see it stay silent while you work. The page is safe for unreleased URLs, signed tokens, internal API keys, and any private payload.

Question 8

Which characters are 'unreserved' and pass through unchanged?

Accepted Answer

RFC 3986 §2.3 defines 66 unreserved characters: the 26 uppercase letters, the 26 lowercase letters, the 10 digits, and the four marks '-', '.', '_', '~'. The encoder leaves all of these alone under every profile. The 'reserved' characters (':', '/', '?', '#', '[', ']', '@', '!', '$', '&', "'", '(', ')', '*', '+', ',', ';', '=') have a special meaning in some parts of a URL; encodeURIComponent escapes them, encodeURI preserves them.

Question 9

Why is the encoded length sometimes 4× the input?

Accepted Answer

Because each non-ASCII character is encoded as its full UTF-8 byte sequence, with each byte taking three output characters (%HH). A common Latin Extended character ('é', 2 bytes UTF-8) becomes 6 chars. A Sinhala letter (typically 3 bytes UTF-8) becomes 9 chars. A 4-byte emoji becomes 12 chars. The output counter in the result panel shows the exact ratio for what you pasted, so you can decide whether a shorter alternative (base64url, or a server-side ID lookup) makes more sense.

Question 10

What does the 'Verified · round-trip' badge mean?

Accepted Answer

Every page load runs a built-in test: the tool encodes a probe string containing ASCII reserved characters, a Sinhala word, an emoji, and a literal '+' through each of the three profiles (component, whole URI, form), then decodes the output and confirms it matches the original. When all three legs pass, the badge is green. If you ever see it flip to red on a real input, please email me — that is a regression worth fixing.

URL Encoder & Decoder

How it works

Worked examples

Frequently asked questions

Sources & references

Related tools

Slug Generator

JSON Formatter

Base64

Comments & feedback