URL Encoder / Decoder

Q: What is percent-encoding?

A scheme defined in RFC 3986 that replaces unsafe characters in URLs with their hexadecimal representation prefixed by %. For example, a space encodes to %20.

Q: When does a URL break?

When it contains reserved characters left unencoded: spaces, &, ?, =, # inside a parameter, or non-ASCII characters like accents and ñ. The server or browser may cut the URL at the first problem character.

Q: encodeURI or encodeURIComponent?

encodeURI encodes a complete URL preserving structure (it leaves / ? & = alone). encodeURIComponent encodes a value going inside a parameter (it encodes everything). For query strings you almost always want encodeURIComponent.

What percent-encoding is and why it exists

URLs were born in 1994 with a restricted alphabet: ASCII letters, digits, and a handful of special characters that carry structural meaning (/, ?, &, =, #). Anything else — a space, an accented letter, an emoji — must be represented some other way so browser and server transmit it without ambiguity. That mechanism is percent-encoding (a.k.a. URL encoding), defined in RFC 3986.

The basic rule

Each unsafe byte in a URL is replaced by % followed by the two hex digits representing that byte. A space (byte 0x20) becomes %20. A ñ in UTF-8 is two bytes (0xC3 0xB1) and encodes to %C3%B1. Encoding operates on bytes, not characters, so the result depends on the encoding (UTF-8 has been the de facto standard for 20 years).

encodeURI vs encodeURIComponent

encodeURI: respects URL structure. It does not encode / ? & = # :. Use when you have a complete URL with special characters in the path that should still be navigable.
encodeURIComponent: encodes aggressively. It also encodes / ? & = # :. Use for values going inside a parameter (after =) that must not be confused with structure.

Typical scenarios

Building URLs dynamically. If you concatenate ?q= with a user search, that search must go through encodeURIComponent. If the user typed "coffee & cake", an unencoded & looks like a new parameter to the browser.
Calling REST APIs. Same idea: any value traveling in a query string must be encoded.
Decoding what you receive. When you read req.query in a backend, frameworks usually decode for you, but if you handle raw URLs you must decode manually.
UTM and tracking. If a campaign uses accents in the medium or campaign field, encode them; otherwise analytics tools lose them.

Reserved characters

RFC 3986 defines two groups. Reserved characters carry structural meaning: : / ? # [ ] @ ! $ & ' ( ) * + , ; =. Unreserved characters are always safe: letters, digits, - _ . ~. Everything else must be encoded. encodeURIComponent encodes all reserved; encodeURI leaves the structurally meaningful ones alone.

Common mistakes

Double-encoding. %20 becomes %2520. If the server decodes only once, you end up with literal %20 in your data.
Forgetting + in query strings. Some systems encode space as + (legacy of application/x-www-form-urlencoded). Know which one you're dealing with.
Encoding domains. Hostnames with non-ASCII characters use IDN (punycode), not percent-encoding. café.com becomes xn--caf-dma.com, not caf%C3%A9.com.

FAQ

What is percent-encoding?

An RFC 3986 scheme that replaces unsafe characters with % followed by two hex digits. A space encodes to %20.

When does a URL break?

When it contains unencoded reserved characters (spaces, &, ?, =, #) or non-ASCII characters where they're not allowed.

encodeURI or encodeURIComponent?

encodeURI for whole URLs. encodeURIComponent for parameter values. Almost always encodeURIComponent for query strings.