Migrating from Elasticsearch OSS to OpenSearch

If you want to migrate from an existing Elasticsearch OSS cluster to OpenSearch and find the snapshot approach unappealing, you can migrate your existing nodes from Elasticsearch OSS to OpenSearch.

If your existing cluster runs an older version of Elasticsearch OSS, the first step is to upgrade to version 6.x or 7.x.

Before deciding on the version of Elasticsearch OSS for your upgrade, refer to the Migrating to OpenSearch and limits on the number of nested JSON objects documentation in Breaking changes to see whether the issue will have an impact on your cluster and, therefore, your decisions about upgrades and migration.

Elasticsearch OSS supports two types of upgrades: rolling and cluster restart.

  • Rolling upgrades let you shut down one node at a time for minimal disruption of service.

    Rolling upgrades work between minor versions (for example, 6.5 to 6.8) and also support a single path to the next major version (for example, 6.8 to 7.10.2). Performing these upgrades might require intermediate upgrades to arrive at your desired version and can affect cluster performance as nodes leave and rejoin, but the cluster remains available throughout the process.

  • Cluster restart upgrades require you to shut down all nodes, perform the upgrade, and restart the cluster.

    Cluster restart upgrades work between minor versions (for example, 6.5 to 6.8) and the next major version (for example, 6.x to 7.10.2). Cluster restart upgrades are faster to perform and require fewer intermediate upgrades, but require downtime.

To migrate a post-fork version of Elasticsearch (7.11+) to OpenSearch, you can use Logstash. You’ll need to employ the Elasticsearch input plugin within Logstash to extract data from the Elasticsearch cluster, and the Logstash Output OpenSearch plugin to write the data to the OpenSearch 2.x cluster. We suggest using Logstash version 7.13.4 or earlier, as newer versions may encounter compatibility issues when establishing a connection with OpenSearch due to changes introduced by Elasticsearch subsequent to the fork. We strongly recommend that users test this solution with their own data to ensure effectiveness.

Migration paths

Elasticsearch OSS versionRolling upgrade pathCluster restart upgrade path
5.xUpgrade to 5.6, upgrade to 6.8, reindex all 5.x indexes, upgrade to 7.10.2, and migrate to OpenSearch.Upgrade to 6.8, reindex all 5.x indexes, and migrate to OpenSearch.
6.xUpgrade to 6.8, upgrade to 7.10.2, and migrate to OpenSearch.Migrate to OpenSearch.
7.xMigrate to OpenSearch.Migrate to OpenSearch.

If you are migrating an Open Distro for Elasticsearch cluster, we recommend first upgrading to ODFE 1.13 and then migrating to OpenSearch.

Upgrade Elasticsearch OSS

  1. Disable shard allocation to prevent Elasticsearch OSS from replicating shards as you shut down nodes:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "primaries"
    5. }
    6. }
  2. Stop Elasticsearch OSS on one node (rolling upgrade) or all nodes (cluster restart upgrade).

    On Linux distributions that use systemd, use this command:

    1. sudo systemctl stop elasticsearch.service

    For tarball installations, find the process ID (ps aux) and kill it (kill <pid>).

  3. Upgrade the node (rolling) or all nodes (cluster restart).

    The exact command varies by package manager, but likely looks something like this:

    1. sudo yum install elasticsearch-oss-7.10.2 --enablerepo=elasticsearch

    For tarball installations, extract to a new directory to ensure you do not overwrite your config, data, and logs directories. Ideally, these directories should have their own, independent paths and not be colocated with the Elasticsearch application directory. Then set the ES_PATH_CONF environment variable to the directory that contains elasticsearch.yml (for example, /etc/elasticsearch/). In elasticsearch.yml, set path.data and path.logs to your data and logs directories (for example, /var/lib/elasticsearch and /var/log/opensearch).

  4. Restart Elasticsearch OSS on the node (rolling) or all nodes (cluster restart).

    On Linux distributions that use systemd, use this command:

    1. sudo systemctl start elasticsearch.service

    For tarball installations, run ./bin/elasticsearch -d.

  5. Wait for the node to rejoin the cluster (rolling) or for the cluster to start (cluster restart). Check the _nodes summary to verify that all nodes are available and running the expected version:

    1. # Elasticsearch OSS
    2. curl -XGET 'localhost:9200/_nodes/_all?pretty=true'
    3. # Open Distro for Elasticsearch with Security plugin enabled
    4. curl -XGET 'https://localhost:9200/_nodes/_all?pretty=true' -u 'admin:<custom-admin-password>' -k

    Specifically, check the nodes.<node-id>.version portion of the response. Also check _cat/indices?v for a green status on all indexes.

  6. (Rolling) Repeat steps 2–5 until all nodes are using the new version.

  7. After all nodes are using the new version, re-enable shard allocation:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "all"
    5. }
    6. }
  8. If you upgraded from 5.x to 6.x, reindex all indexes.

  9. Repeat all steps as necessary until you arrive at your desired Elasticsearch OSS version.

Migrate to OpenSearch

  1. Disable shard allocation to prevent Elasticsearch OSS from replicating shards as you shut down nodes:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "primaries"
    5. }
    6. }
  2. Stop Elasticsearch OSS on one node (rolling upgrade) or all nodes (cluster restart upgrade).

    On Linux distributions that use systemd, use this command:

    1. sudo systemctl stop elasticsearch.service

    For tarball installations, find the process ID (ps aux) and kill it (kill <pid>).

  3. Upgrade the node (rolling) or all nodes (cluster restart).

    1. Extract the OpenSearch tarball to a new directory to ensure you do not overwrite your Elasticsearch OSS config, data, and logs directories.

    2. (Optional) Copy or move your Elasticsearch OSS data and logs directories to new paths. For example, you might move /var/lib/elasticsearch to /var/lib/opensearch.

    3. Set the OPENSEARCH_PATH_CONF environment variable to the directory that contains opensearch.yml (for example, /etc/opensearch).

    4. In opensearch.yml, set path.data and path.logs. You might also want to disable the Security plugin for now. opensearch.yml might look something like this:

      1. path.data: /var/lib/opensearch
      2. path.logs: /var/log/opensearch
      3. plugins.security.disabled: true
    5. Port your settings from elasticsearch.yml to opensearch.yml. Most settings use the same names. At a minimum, specify cluster.name, node.name, discovery.seed_hosts, and cluster.initial_cluster_manager_nodes.

    6. (Optional) If you’re actively connecting to the cluster with legacy clients that check for a particular version number, such as Logstash OSS, add a compatibility setting to opensearch.yml:

      1. compatibility.override_main_response_version: true
    7. (Optional) Add your certificates to your config directory, add them to opensearch.yml, and initialize the Security plugin.

  4. Start OpenSearch on the node (rolling) or all nodes (cluster restart).

    For the tarball, run ./bin/opensearch -d.

  5. Wait for the OpenSearch node to rejoin the cluster (rolling) or for the cluster to start (cluster restart). Check the _nodes summary to verify that all nodes are available and running the expected version:

    1. # Security plugin disabled
    2. curl -XGET 'localhost:9200/_nodes/_all?pretty=true'
    3. # Security plugin enabled
    4. curl -XGET -k -u 'admin:<custom-admin-password>' 'https://localhost:9200/_nodes/_all?pretty=true'

    Specifically, check the nodes.<node-id>.version portion of the response. Also check _cat/indices?v for a green status on all indexes.

  6. (Rolling) Repeat steps 2–5 until all nodes are using OpenSearch.

  7. After all nodes are using the new version, re-enable shard allocation:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "all"
    5. }
    6. }

Upgrade tool

The opensearch-upgrade tool lets you automate some of the steps in Migrate to OpenSearch, eliminating the need for error-prone manual operations.

The opensearch-upgrade tool performs the following functions:

  • Imports any existing configurations and applies it to the new installation of OpenSearch.
  • Installs any existing core plugins.

Limitations

The opensearch-upgrade tool doesn’t perform an end-to-end upgrade:

  • You need to run the tool on each node of the cluster individually as part of the upgrade process.
  • The tool doesn’t provide a rollback option after you’ve upgraded a node, so make sure you follow best practices and take backups.
  • You must install all community plugins (if available) manually.
  • The tool only validates any keystore settings at service start-up time, so you must manually remove any unsupported settings for the service to start.

Using the upgrade tool

To perform a rolling upgrade using the OpenSearch tarball distribution:

Check Migration paths to make sure that the version you’re upgrading to is supported and whether you need to upgrade to a supported Elasticsearch OSS version first.

  1. Disable shard allocation to prevent Elasticsearch OSS from replicating shards as you shut down nodes:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "primaries"
    5. }
    6. }
  2. On any one of the nodes, download and extract the OpenSearch tarball to a new directory.

  3. Make sure the following environment variables are set:

    • ES_HOME - Path to the existing Elasticsearch installation home.

      1. export ES_HOME=/home/workspace/upgrade-demo/node1/elasticsearch-7.10.2
    • ES_PATH_CONF - Path to the existing Elasticsearch config directory.

      1. export ES_PATH_CONF=/home/workspace/upgrade-demo/node1/os-config
    • OPENSEARCH_HOME - Path to the OpenSearch installation home.

      1. export OPENSEARCH_HOME=/home/workspace/upgrade-demo/node1/opensearch-1.0.0
    • OPENSEARCH_PATH_CONF - Path to the OpenSearch config directory.

      1. export OPENSEARCH_PATH_CONF=/home/workspace/upgrade-demo/node1/opensearch-config
  4. The opensearch-upgrade tool is in the bin directory of the distribution. Run the following command from the distribution home:

    Make sure you run this tool as the same user running the current Elasticsearch service.

    1. ./bin/opensearch-upgrade
  5. Stop Elasticsearch OSS on the node.

    On Linux distributions that use systemd, use this command:

    1. sudo systemctl stop elasticsearch.service

    For tarball installations, find the process ID (ps aux) and kill it (kill <pid>).

  6. Start OpenSearch on the node:

    1. ./bin/opensearch -d.
  7. Repeat steps 2–6 until all nodes are using the new version.

  8. After all nodes are using the new version, re-enable shard allocation:

    1. PUT _cluster/settings
    2. {
    3. "persistent": {
    4. "cluster.routing.allocation.enable": "all"
    5. }
    6. }

How it works

Behind the scenes, the opensearch-upgrade tool performs the following tasks in sequence:

  1. Looks for a valid Elasticsearch installation on the current node. After it finds the installation, it reads the elasticsearch.yml file to get the endpoint details and connects to the locally running Elasticsearch service. If the tool can’t find an Elasticsearch installation, it tries to get the path from the ES_HOME location.
  2. Verifies if the existing version of Elasticsearch is compatible with the OpenSearch version. It prints a summary of the information gathered to the console and prompts you for a confirmation to proceed.
  3. Imports the settings from the elasticsearch.yml config file into the opensearch.yml config file.
  4. Copies across any custom JVM options from the $ES_PATH_CONF/jvm.options.d directory into the $OPENSEARCH_PATH_CONF/jvm.options.d directory. Similarly, it also imports the logging configurations from the $ES_PATH_CONF/log4j2.properties file into the $OPENSEARCH_PATH_CONF/log4j2.properties file.
  5. Installs the core plugins that you’ve currently installed in the $ES_HOME/plugins directory. You must install all other third-party community plugins manually.
  6. Imports the secure settings from the elasticsearch.keystore file (if any) into the opensearch.keystore file. If the keystore file is password protected, the opensearch-upgrade tool prompts you to enter the password.