Configuring Retries
In order for Linkerd to do automatic retries of failures, there are two questions that need to be answered:
- Which requests should be retried?
- How many times should the requests be retried?
Both of these questions can be answered by adding annotations to the Service, HTTPRoute, or GRPCRoute resource you’re sending requests to.
The reason why these pieces of configuration are required is because retries can potentially be dangerous. Automatically retrying a request that changes state (e.g. a request that submits a financial transaction) could potentially impact your user’s experience negatively. In addition, retries increase the load on your system. A set of services that have requests being constantly retried could potentially get taken down by the retries instead of being allowed time to recover.
Check out the retries section of the books demo for a tutorial of how to configure retries.
Retries
For HTTPRoutes that are idempotent, you can add the retry.linkerd.io/http: 5xx
annotation which instructs Linkerd to retry any requests which fail with an HTTP
response status in the 500s.
Note that requests will not be retried if the body exceeds 64KiB.
Retry Limits
You can also add the retry.linkerd.io/limit
annotation to specify the maximum
number of times a request may be retried. By default, this limit is 1
.
gRPC Retries
Retries can also be configured for gRPC traffic by adding the
retry.linkerd.io/grpc
annotation to a GRPCRoute or Service resource. The value
of this annotation is a comma seperated list of gRPC status codes that should
be retried.