Configuring your driver for ArangoDB access
In this chapter you’ll learn how to configure a driver for accessing an ArangoDB deployment in Kubernetes.
The exact methods to configure a driver are specific to that driver.
Database endpoint(s)
The endpoint(s) (or URLs) to communicate with is the most important parameter your need to configure in your driver.
Finding the right endpoints depend on wether your client application is running in the same Kubernetes cluster as the ArangoDB deployment or not.
Client application in same Kubernetes cluster
If your client application is running in the same Kubernetes cluster as the ArangoDB deployment, you should configure your driver to use the following endpoint:
https://<deployment-name>.<namespace>.svc:8529
Only if your deployment has set spec.tls.caSecretName
to None
, should you use http
instead of https
.
Client application outside Kubernetes cluster
If your client application is running outside the Kubernetes cluster in which the ArangoDB deployment is running, your driver endpoint depends on the external-access configuration of your ArangoDB deployment.
If the external-access of the ArangoDB deployment is of type LoadBalancer
, then use the IP address of that LoadBalancer
like this:
https://<load-balancer-ip>:8529
If the external-access of the ArangoDB deployment is of type NodePort
, then use the IP address(es) of the Nodes
of the Kubernetes cluster, combined with the NodePort
that is used by the external-access service.
For example:
https://<kubernetes-node-1-ip>:30123
You can find the type of external-access by inspecting the external-access Service
. To do so, run the following command:
kubectl get service -n <namespace-of-deployment> <deployment-name>-ea
The output looks like this:
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE SELECTOR
example-simple-cluster-ea LoadBalancer 10.106.175.38 192.168.10.208 8529:31890/TCP 1s app=arangodb,arango_deployment=example-simple-cluster,role=coordinator
In this case the external-access is of type LoadBalancer
with a load-balancer IP address of 192.168.10.208
. This results in an endpoint of https://192.168.10.208:8529
.
TLS settings
As mentioned before the ArangoDB deployment managed by the ArangoDB operator will use a secure (TLS) connection unless you set spec.tls.caSecretName
to None
in your ArangoDeployment
.
When using a secure connection, you can choose to verify the server certificates provides by the ArangoDB servers or not.
If you want to verify these certificates, configure your driver with the CA certificate found in a Kubernetes Secret
found in the same namespace as the ArangoDeployment
.
The name of this Secret
is stored in the spec.tls.caSecretName
setting of the ArangoDeployment
. If you don’t set this setting explicitly, it will be set automatically.
Then fetch the CA secret using the following command (or use a Kubernetes client library to fetch it):
kubectl get secret -n <namespace> <secret-name> --template='{{index .data "ca.crt"}}' | base64 -D > ca.crt
This results in a file called ca.crt
containing a PEM encoded, x509 CA certificate.
Query requests
For most client requests made by a driver, it does not matter if there is any kind of load-balancer between your client application and the ArangoDB deployment.
Note that even a simple Service
of type ClusterIP
already behaves as a load-balancer.
The exception to this is cursor-related requests made to an ArangoDB Cluster
deployment. The Coordinator that handles an initial query request (that results in a Cursor
) will save some in-memory state in that Coordinator, if the result of the query is too big to be transfer back in the response of the initial request.
Follow-up requests have to be made to fetch the remaining data. These follow-up requests must be handled by the same Coordinator to which the initial request was made. As soon as there is a load-balancer between your client application and the ArangoDB cluster, it is uncertain which Coordinator will receive the follow-up request.
ArangoDB will transparently forward any mismatched requests to the correct Coordinator, so the requests can be answered correctly without any additional configuration. However, this incurs a small latency penalty due to the extra request across the internal network.
To prevent this uncertainty client-side, make sure to run your client application in the same Kubernetes cluster and synchronize your endpoints before making the initial query request. This will result in the use (by the driver) of internal DNS names of all Coordinators. A follow-up request can then be sent to exactly the same Coordinator.
If your client application is running outside the Kubernetes cluster the easiest way to work around it is by making sure that the query results are small enough to be returned by a single request. When that is not feasible, it is also possible to resolve this when the internal DNS names of your Kubernetes cluster are exposed to your client application and the resulting IP addresses are routable from your client application. To expose internal DNS names of your Kubernetes cluster, your can use CoreDNS.