What's the difference between concurrency and arrival rate?

Question

Accepted Answer

Concurrency (closed model) holds N users active at once — if the system slows, fewer requests per second. Arrival rate (open model) injects R requests per second regardless of latency — if the system slows, queue grows. Real production traffic behaves like an open model. Closed-model (concurrency-based) load: you specify N virtual users, each loops "send request, wait for response, sleep, repeat." If response time doubles, the same N users send half as many requests per second. Throughput is bounded by the system's speed. This is what JMeter Thread Groups and k6 vus do by default. Open-model (arrival-rate-based) load: you specify R requests per second injected into the system, regardless of how long each takes. If response time doubles, requests pile up — queue depth grows, latency tail explodes, errors arrive. This is what k6 constant-arrival-rate or Gatling constantUsersPerSec do. Why it matters: production traffic is open. Real users don't politely wait for one slow response before

What's the difference between concurrency and arrival rate?

// EXAMPLE

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL

What's the difference between concurrency and arrival rate?

Short answer

Detail

// EXAMPLE

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL