Installing the Greenplum Load Tools

This section contains information for installing the Greenplum data loading programs on your client machine and for enabling Greenplum Database to accept remote client connections:

See the Greenplum Database Release Notes for the list of currently supported platforms for the Load Tools.

Parent topic: Greenplum Database Load Tools for UNIX

Running the Load Tools Installer

The Greenplum Database load tools installer installs the following data loading tools:

  • Greenplum parallel file distribution program (gpfdist)
  • Greenplum data loading utility (gpload)

To install the Greenplum Database load tools

  1. Download the appropriate Greenplum Loaders installer package for your platform from VMware Tanzu Network.

  2. Follow the instructions in Verifying the Greenplum Database Software Download to verify the integrity of the Greenplum Loaders software.

  3. Unzip the installer:

    1. unzip greenplum-loaders-<version>-<PLATFORM>.zip
  4. Run the installer:

    1. /bin/bash greenplum-loaders-<version>-<PLATFORM>.bin
  5. The installer will prompt you to accept the license agreement and to provide an installation path. For the installation path, be sure to enter an absolute path if you choose not to accept the default location (for example, /mydir/gp-loader-tools). The default install location for the load tools is /usr/local/greenplum-loaders-<version\>

About Your Installation

Your Greenplum Database load tools installation contains the following files and directories:

  • bin — data loading command-line tools
  • ext — python dependencies
  • lib — load tools library files
  • greenplum_loaders_path.sh — environment variables

Configuring the Command-Line Load Tools

As a convenience, a greenplum_loaders_path.sh file is provided in your load tools installation directory following installation. It has the following environment variable settings:

  • GPHOME_LOADERS — The installation directory of the Greenplum Database load tools.
  • PATH — Adds the path to the data loading command-line utilities.
  • LD_LIBRARY_PATH — The path to additional Python library files needed for gpload.

You can source this file in your user’s startup shell profile (such as .bashrc or .bash_profile).

For example, you could add a line similar to the following to your chosen profile files (making sure the right install path is used):

  1. source greenplum-loaders-version>/greenplum_loaders_path.sh

After editing the chosen profile file, source it as the correct user to make the changes active. For example:

  1. source ~/.bashrc

Additional Connection Environment Variables

The Greenplum load tools require several connection parameters in order to be able to connect to a Greenplum Database instance. In order to save some typing on the command-line, you can create the following environment variables in your preferred profile file (such as .bashrc).

  • PGDATABASE — The name of the default Greenplum database to connect to.
  • PGHOST — The Greenplum master host name or IP address.
  • PGPORT — The port number that the Greenplum master instance is running on.
  • PGUSER — The default database role name to use for login.

Enabling Greenplum Database for Remote Client Connections

In order for Greenplum Database to be able to accept remote client connections, you must configure your Greenplum Database master so that connections are allowed from the client hosts and database users that will be connecting to Greenplum Database.

To enable remote client connections

  1. Make sure that the pg_hba.conf file of the Greenplum Database master is correctly configured to allow connections from the users to the database(s) using the authentication method you want. For details, see “Editing the pg_hba.conf File” in the Greenplum Database Administration Guide, and also see the Greenplum Database Security Configuration Guide.

    Make sure the authentication method you choose is supported by the client tool you are using.

  2. If you edited pg_hba.conf file, the change requires a server reload (using the gpstop -u command) to take effect.

  3. Make sure that the databases and roles you are using to connect exist in the system and that the roles have the correct priviliges to the database objects.

Next Steps

Refer to the load tool reference documentation for further instructions: