Reverse Proxies & API Gateways

Every system needs a front door — one place the outside world talks to, behind which your real servers hide. That front door does jobs that every request needs but that you don’t want scattered across every service: terminating encryption, checking who’s calling, deciding which backend handles the request. The blocks that play this role are the reverse proxy and, its souped-up cousin, the API gateway. They’re the last block in this part because they tie the others together: a gateway is a place where routing, caching, load balancing, and security all converge.

Forward vs reverse proxy: who is hidden

Both are intermediaries, but they hide opposite ends of the conversation:

   FORWARD PROXY  — sits in front of CLIENTS, hides the client from the server
        [ clients ] ──► [forward proxy] ──► internet ──► server
        (corporate web filter, VPN egress; the server sees the proxy, not you)

   REVERSE PROXY  — sits in front of SERVERS, hides the servers from the client
        client ──► internet ──► [reverse proxy] ──► [ your servers ]
        (the client sees one address; it never learns which backend served it)

A forward proxy acts on behalf of clients — it’s what a company puts between its employees and the internet (filtering, caching, anonymizing). A reverse proxy acts on behalf of servers — the client thinks it’s talking to one machine, but the proxy is quietly distributing requests across a fleet behind it. The rest of this page is about the reverse direction, because that’s where system design lives: it’s the public face of your backend.

What a reverse proxy does

At its plainest, a reverse proxy receives client requests and forwards them to backend servers, returning their responses. Even this simple role earns its keep:

TLS termination — handle HTTPS encryption/decryption once at the edge, so your backends speak plain HTTP internally and don’t each carry the crypto burden or the certificates.
Load balancing — spread requests across backends (this is exactly the L7 load balancer from earlier; the roles overlap heavily).
Caching — serve cached responses without bothering the backend at all.
Compression & buffering — gzip responses, absorb slow clients so backends aren’t tied up.
Hiding topology — clients see one address; you can rewire, add, or remove backends freely.

The API gateway: a reverse proxy with opinions

An API gateway is a reverse proxy that takes on application-level responsibilities — the cross-cutting concerns every API needs. Instead of each service implementing auth, rate limiting, and logging separately (and inconsistently), you implement them once, at the gateway, in front of everything. This is especially central in microservices, where the gateway is the single, coherent entry point to a sprawl of internal services.

   client ──► [ API GATEWAY ] ──► service A (orders)
                  │  • TLS termination          ──► service B (users)
                  │  • authentication & authz    ──► service C (catalog)
                  │  • rate limiting / quotas
                  │  • routing (path/host → service)
                  │  • request/response aggregation
                  │  • logging, metrics, tracing
                  └────────────────────────────────►

What it centralizes:

Authentication & authorization — verify the token / API key once, at the door, so backends can trust that whoever reached them is already authenticated.
Rate limiting & quotas — protect everything behind it from abuse and overload in one place. This pairs directly with Rate Limiting, which the gateway is the natural home for.
Routing — map a public path or host to the right internal service (/orders/* → orders service), so the messy internal map never leaks to clients.
Aggregation — combine several backend calls into one client response, so a mobile app makes one request instead of five (sometimes split out as a “backend-for-frontend”).
Observability — uniform logging, metrics, and request tracing for every call, in one spot.

The cost: a single point of failure (and a fat one)

Everything funnels through the gateway, which means the gateway is your availability and, often, your latency:

Single point of failure. If it’s down, everything is down — no service is reachable. So a gateway must be run redundantly (multiple instances behind a load balancer / anycast IP), which reintroduces the very infrastructure it sits in front of.
Performance bottleneck. Every request pays the gateway’s processing cost; a slow gateway slows the whole system.
Operational coupling. It becomes a place where many teams’ concerns collide. Pile too much logic into it and it turns into a fragile, hard-to-change chokepoint — a distributed monolith’s front desk.

What does a gateway buy us, and what does it cost? It buys one consistent, secure, observable front door and dramatically simpler backend services. It costs you a critical component that must be made highly available, a potential bottleneck, and the discipline to keep it thin — routing and cross-cutting concerns, not business logic.

How it ties the part together

Notice how much of this part converges here: the gateway terminates TLS, does L7 load balancing, caches responses, enforces rate limits, and routes to the right service — which might then drop work onto a message queue and read from the right database. DNS got the user to the gateway’s address; the gateway gets the request to the right place. The toolkit is complete — and the gateway is where the pieces meet.

Check your understanding

Forward vs reverse proxy: in each, which party is hidden, and who is the proxy acting on behalf of?
List three jobs a plain reverse proxy does, and explain why TLS termination at the edge is valuable.
What does an API gateway add beyond a reverse proxy? Why is implementing auth once at the gateway better than in each service?
Why is a gateway a single point of failure, and what must you do to keep that from being fatal?
Explain the failure mode of putting business logic into the gateway, and the rule that prevents it.