Traffic Shifting/Splitting
Envoy’s router can split traffic to a route in a virtual host across two or more upstream clusters. There are two common use cases.
1. Version upgrades: traffic to a route is shifted gradually from one cluster to another. The traffic shifting section describes this scenario in more detail.
2. A/B testing or multivariate testing: two or more versions
of
the same service are tested simultaneously. The traffic to the route has to
be split between clusters running different versions of the same
service. The
traffic splitting
section describes this scenario in more detail.
Traffic shifting between two upstreams
The runtime object
in the route configuration determines the probability of selecting a
particular route (and hence its cluster). By using the runtime_fraction
configuration, traffic to a particular route in a virtual host can be
gradually shifted from one cluster to another. Consider the following
example configuration, where two versions helloworld_v1
and
helloworld_v2
of a service named helloworld
are declared in the
envoy configuration file.
virtual_hosts:
- name: www2
domains:
- '*'
routes:
- match:
prefix: /
runtime_fraction:
default_value:
numerator: 50
denominator: HUNDRED
runtime_key: routing.traffic_shift.helloworld
route:
cluster: helloworld_v1
- match:
prefix: /
route:
cluster: helloworld_v2
Envoy matches routes with a first match policy. If the route has a runtime_fraction object, the request will be additionally matched based on the runtime_fraction value (or the default, if no value is specified). Thus, by placing routes back-to-back in the above example and specifying a runtime_fraction object in the first route, traffic shifting can be accomplished by changing the runtime_fraction value. The following are the approximate sequence of actions required to accomplish the task.
In the beginning, set
routing.traffic_shift.helloworld
to100
, so that all requests to thehelloworld
virtual host would match with the v1 route and be served by thehelloworld_v1
cluster.To start shifting traffic to
helloworld_v2
cluster, setrouting.traffic_shift.helloworld
to values0 < x < 100
. For instance at90
, 1 out of every 10 requests to thehelloworld
virtual host will not match the v1 route and will fall through to the v2 route.Gradually decrease the value set in
routing.traffic_shift.helloworld
so that a larger percentage of requests match the v2 route.When
routing.traffic_shift.helloworld
is set to0
, no requests to thehelloworld
virtual host will match to the v1 route. All traffic would now fall through to the v2 route and be served by thehelloworld_v2
cluster.
Traffic splitting across multiple upstreams
Consider the helloworld
example again, now with three versions (v1, v2 and
v3) instead of two. To split traffic evenly across the three versions
(i.e., 33%, 33%, 34%
), the weighted_clusters
option can be used to
specify the weight for each upstream cluster.
Unlike the previous example, a single route entry is sufficient. The weighted_clusters configuration block in a route can be used to specify multiple upstream clusters along with weights that indicate the percentage of traffic to be sent to each upstream cluster.
virtual_hosts:
- name: www2
domains:
- '*'
routes:
- match: { prefix: / }
route:
weighted_clusters:
runtime_key_prefix: routing.traffic_split.helloworld
clusters:
- name: helloworld_v1
weight: 33
- name: helloworld_v2
weight: 33
- name: helloworld_v3
weight: 34
The sum of the weights needs to be greater than 0. In the V2 API, the total weight defaults to 100, but can be modified to allow finer granularity. The total weight is now deprecated, and the relative value of each cluster weight compared to the sum of all cluster weights will be used.
The weights assigned to each cluster can be dynamically adjusted using the
following runtime variables: routing.traffic_split.helloworld.helloworld_v1
,
routing.traffic_split.helloworld.helloworld_v2
and
routing.traffic_split.helloworld.helloworld_v3
.