Retry

This policy enables Kuma to know how to behave if there is a failed scenario (i.e. HTTP request) which could be retried.

Usage

As usual, we can apply sources and destinations selectors to determine how retries will be performed across our data plane proxies.

The policy let you configure retry behaviour for HTTP, GRPC and TCP protocols.

Example

  1. apiVersion: kuma.io/v1alpha1
  2. kind: Retry
  3. mesh: default
  4. metadata:
  5. name: web-to-backend-retry-policy
  6. spec:
  7. sources:
  8. - match:
  9. kuma.io/service: web
  10. destinations:
  11. - match:
  12. kuma.io/service: backend
  13. conf:
  14. http:
  15. numRetries: 5
  16. perTryTimeout: 200ms
  17. backOff:
  18. baseInterval: 20ms
  19. maxInterval: 1s
  20. retriableStatusCodes:
  21. - 500
  22. - 504
  23. grpc:
  24. numRetries: 5
  25. perTryTimeout: 200ms
  26. backOff:
  27. baseInterval: 20ms
  28. maxInterval: 1s
  29. retryOn:
  30. - cancelled
  31. - deadline_exceeded
  32. - internal
  33. - resource_exhausted
  34. - unavailable
  35. tcp:
  36. maxConnectAttempts: 3

We will apply the configuration with kubectl apply -f [..].

  1. type: Retry
  2. name: web-to-backend-retry-policy
  3. mesh: default
  4. sources:
  5. - match:
  6. kuma.io/service: web
  7. destinations:
  8. - match:
  9. kuma.io/service: backend
  10. conf:
  11. http:
  12. numRetries: 5
  13. perTryTimeout: 200ms
  14. backOff:
  15. baseInterval: 20ms
  16. maxInterval: 1s
  17. retriableStatusCodes:
  18. - 500
  19. - 504
  20. grpc:
  21. numRetries: 5
  22. perTryTimeout: 200ms
  23. backOff:
  24. baseInterval: 20ms
  25. maxInterval: 1s
  26. retryOn:
  27. - cancelled
  28. - deadline_exceeded
  29. - internal
  30. - resource_exhausted
  31. - unavailable
  32. tcp:
  33. maxConnectAttempts: 3

We will apply the configuration with kumactl apply -f [..] or via the HTTP API.

HTTP

  • numRetries (optional)

    Amount of attempts which will be made on failed (and retriable) requests

  • perTryTimeout (optional)

    Amount of time after which retry attempt should timeout (i.e. all the values: 30000000ns, 30ms, 0.03s, 0.0005m are equivalent and can be used to express the same timeout value, equal to 30ms)

  • backOff (optional)

    Configuration of durations which will be used in exponential backoff strategy between retries

    • baseDuration (required)

      Base amount of time which should be taken between retries (i.e. 30ms, 0.03s, 0.0005m)

    • maxDuration (optional)

      A maximal amount of time which will be taken between retries (i.e. 1s, 0.5m)

  • retriableStatusCodes (optional)

    A list of status codes which will cause the request to be retried. When this field will be provided it will overwrite the default behaviour of accepting as retriable codes: 502, 503 and 504 and if they also should be considered as retriable you hove to manually place them in the list

    For example to add a status code 418:

    1. retriableStatusCodes:
    2. - 418
    3. - 502
    4. - 503
    5. - 504

    Note that if you won’t provide retriableStatusCodes, the default behaviour of the policy is to retry:

    • when server responds with one of status codes: 502, 503 or 504,
    • when server won’t respond at all (disconnect/reset/read timeout),
    • when server resets the stream with a REFUSED_STREAM error code.

GRPC

You can configure your GRPC Retry policy in similar fashion as the HTTP one with the only difference of the retryOn property which replace the retriableStatusCodes from the HTTP policy

  • retryOn (optional)

    List of values which will cause retry.

    Acceptable values

    • cancelled
    • deadline_exceeded
    • internal
    • resource_exhausted
    • unavailable

    Note that if retryOn is not defined or if it’s empty, the policy will default to all values and is equivalent to:

    1. retryOn:
    2. - cancelled
    3. - deadline_exceeded
    4. - internal
    5. - resource_exhausted
    6. - unavailable

TCP

  • maxConnectAmount (required)

    A maximal amount of TCP connection attempts which will be made before giving up

    This policy will make attempt to retry the TCP connection which fail to be established and will be applied in the scenario when both, the dataplane, and the TCP service matched as a destination will be down.