API Gateway

Intro

An API Gateway is a single entry point between external clients and a set of backend services. It centralizes cross-cutting concerns such as request routing, authentication and authorization enforcement, rate limiting, TLS termination, and traffic policies so individual services do not have to re-implement them. This matters because it gives you one place to enforce consistency and security while keeping clients simpler, especially when each client would otherwise need to call many services directly. You reach for it when you have microservices with multiple consumer types, or when you want a BFF approach where each client family gets an API surface tailored to its needs.

In .NET ecosystems, a common implementation is to run a reverse proxy gateway at the system edge and keep service-level business behavior inside domain services.

Core Responsibilities

flowchart LR
    Client[Client Apps] --> Gateway[API Gateway]
    Gateway --> SvcA[Service A]
    Gateway --> SvcB[Service B]
    Gateway --> SvcC[Service C]

Request routing: Map incoming paths, headers, hostnames, or methods to the right downstream service.
Authentication and authorization: Validate tokens at the edge and enforce coarse-grained access policy before forwarding.
Rate limiting and quotas: Protect services from abusive or accidental traffic spikes.
Request and response transformation: Normalize payload shape, hide internal endpoint changes, or project data for specific clients.
Load balancing: Distribute requests across service instances using health-aware selection.
Circuit breaking and resiliency policies: Fail fast when a downstream is unhealthy and apply retries or fallback only where safe.
TLS termination: Offload certificate handling and HTTPS policy enforcement from every backend service.
Observability: Emit centralized logs, traces, metrics, and correlation IDs for end-to-end troubleshooting.

Patterns

Gateway Routing

Use the gateway as the policy and routing edge. Clients call one host, and route rules dispatch traffic to internal services.

When it works best:

Many services are private on internal networks.
You need consistent auth and throttling policy.
You want controlled API evolution at the boundary.

Gateway Aggregation

The gateway composes a single response from multiple service calls to reduce client round trips.

Concrete example:

Mobile app needs order summary page.
Gateway calls Orders, Payments, and Shipping services.
Gateway returns one payload tuned for the mobile screen.

Use carefully: aggregation is orchestration logic, not domain logic. Keep it thin and response-oriented.

Gateway Offloading

The gateway handles edge concerns such as TLS, compression, CORS, header normalization, and request size limits.

Benefit:

Service teams focus on domain behavior.
Security and policy changes roll out in one place.

BFF (Backend for Frontend)

Separate gateways or gateway routes per client type (web, mobile, partner API) when each has different payload, latency, or auth requirements.

Why this is useful:

Web clients might need richer payloads.
Mobile clients might need smaller aggregated responses.
Partner APIs often need stricter contract stability and separate throttling.

.NET Implementation (YARP)

For .NET, YARP (Yet Another Reverse Proxy) is a Microsoft-maintained reverse proxy library (Yarp.ReverseProxy) that you can use as the core of an API gateway.

Minimal appsettings.json routing and cluster example:

{
  "ReverseProxy": {
    "Routes": {
      "orders-route": {
        "ClusterId": "orders-cluster",
        "Match": {
          "Path": "/api/orders/{**catch-all}"
        }
      },
      "catalog-route": {
        "ClusterId": "catalog-cluster",
        "Match": {
          "Path": "/api/catalog/{**catch-all}"
        }
      }
    },
    "Clusters": {
      "orders-cluster": {
        "Destinations": {
          "orders-d1": {
            "Address": "https://orders-service.internal/"
          }
        }
      },
      "catalog-cluster": {
        "Destinations": {
          "catalog-d1": {
            "Address": "https://catalog-service.internal/"
          }
        }
      }
    }
  }
}

Minimal registration in ASP.NET Core:

var builder = WebApplication.CreateBuilder(args);

builder.Services
    .AddReverseProxy()
    .LoadFromConfig(builder.Configuration.GetSection("ReverseProxy"));

var app = builder.Build();

app.MapReverseProxy();

app.Run();

YARP composes well with ASP.NET Core middleware and observability tooling. Ocelot is a known alternative and can be a pragmatic fit in teams already invested in its ecosystem.

Gateway vs Service Mesh

API Gateway and Service Mesh solve different traffic planes and are often used together.

Gateway (north-south): Handles client-to-system traffic, public API exposure, edge auth, and external policy enforcement.
Service mesh (east-west): Handles service-to-service traffic inside the platform, including mTLS, retries, traffic shifting, and per-service telemetry.

Rule of thumb:

Put internet-facing boundary policy in the gateway.
Put internal service communication policy in the mesh.

Tradeoffs

Direct client to services vs gateway: Direct calls reduce one network hop but increase client complexity and duplicate policy enforcement.
Single gateway vs BFF gateways: Single gateway is simpler to operate; BFF improves client optimization and team autonomy at the cost of more moving parts.
Centralized transformation vs service-owned contracts: Gateway transformations can shield clients from churn, but too much translation can hide unhealthy service boundaries.

Pitfalls

Gateway becomes a monolith bottleneck
- What goes wrong: every change flows through one oversized gateway, and outages impact all consumers.
- Why it happens: uncontrolled feature growth and weak horizontal scaling strategy.
- How to prevent/detect: keep gateway stateless, scale out aggressively, split by bounded context or BFF when ownership and traffic diverge.
Business logic creeps into the gateway
- What goes wrong: domain rules are duplicated at the edge, causing inconsistent behavior and hard-to-test flows.
- Why it happens: aggregation code gradually turns into orchestration and then decision logic.
- How to prevent/detect: enforce a boundary rule that gateway owns transport and policy only; domain invariants stay in services.
Extra latency from the additional hop
- What goes wrong: p95 and p99 latency increase, especially under fan-out aggregation.
- Why it happens: more network hops, serialization work, and downstream dependency chains.
- How to prevent/detect: measure end-to-end traces, cap fan-out depth, use parallel downstream calls, and cache only where freshness allows.
Configuration sprawl with many routes
- What goes wrong: route conflicts, accidental exposure, and hard-to-review config changes.
- Why it happens: rapid service growth without governance for route naming and ownership.
- How to prevent/detect: define route conventions, enforce config validation in CI, and assign clear ownership per route group.

Questions

How do you design gateway aggregation endpoints for client efficiency, and what do you explicitly keep out of the gateway?

Expected answer:

Use gateway routing plus targeted aggregation endpoints for mobile-specific read models.
Keep gateway concerns to auth, throttling, routing, transformation, and observability.
Keep business rules, transactions, and domain invariants inside backend services.
Add correlation IDs and tracing across fan-out calls.
Why this matters: This tests whether you can design for client efficiency without turning the gateway into a distributed monolith.

When would you choose a single API gateway versus a BFF approach?

Expected answer:

Choose a single gateway for simpler systems with similar client needs and shared release cadence.
Choose BFF when web/mobile/partner clients have materially different payload, auth, or latency profiles.
Consider team ownership boundaries and deployment autonomy.
Evaluate operational cost versus client performance and change isolation.
Why this matters: This reveals whether you can apply tradeoffs, not just name patterns.

Explain API Gateway vs Service Mesh in one architecture and where each policy belongs.

Expected answer:

Gateway owns north-south concerns: edge auth, TLS termination, external rate limits, API surface control.
Mesh owns east-west concerns: mTLS, retries, traffic policy, and internal telemetry.
They complement each other rather than compete.
Why this matters: Interviewers want clear boundary thinking for platform design decisions.

References

API Gateway pattern (Azure Architecture Center) — pattern description covering routing, aggregation, and offloading cross-cutting concerns.
YARP documentation — official getting-started guide for Microsoft's YARP reverse proxy library for .NET.
YARP GitHub repository — source code, samples, and issue tracker for the YARP project.
Ocelot documentation — configuration reference for the Ocelot .NET API gateway including routing, authentication, and rate limiting.
Microservices.io — API Gateway pattern (Chris Richardson) — pattern catalog entry covering API gateway vs BFF, forces, and consequences in microservices architectures.

Whats next

Parent
05 Architecture

Topics

Pages