Skip to main content

Celonis Product Documentation

Connecting to Cloudera Impala

You can connect your Cloudera Impala account to the Celonis Platform using a JDBC connector, available from Cloudera directly. This connector is then used with the Celonis JDBC extractor, available from the download portal.

To connect to your Cloudera Impala account, follow these steps:

Step 1: Download and install the Cloudera Impala connector

The first step is to download and install the latest JDBC connector for Cloudera Impala:

  1. Download the relevant driver from the Cloudera website: Cloudera.com - Impala JDBC Connector

  2. Extract the zipped file and follow the installation guidance provided in the 'docs' folder.

    For example:

    install_cloudera.png
Step 2: Download and extract the Celonis JDBC extractor

You can now download the latest Celonis JDBC extractor from the Celonis Download Portal (which requires admin access to your Celonis Platform team).

  1. Click Admin & Settings.

    admin_and_settings.png
  2. Click Download Portal.

    download_portal.png
  3. Download the latest JDBC (database) Extractor.

    download_jdbc.png
  4. Extract the zipped file locally.

    extracted_package.png

The next step is to adapt the application-local.yml file to include your Cloudera account and your Celonis Platform team details:

  1. From the Celonis JDBC extractor extracted above, open the application-local.yml file.

  2. Edit the client ID, client secret, and URL fields:

    • Client ID: This is from your Cloudera account.

    • Client secret: This is from your Cloudera account.

    • URL: Replace the [team] and [eu-1] with your Celonis team ID and your realm. This information can be obtained from your Celonis Platform URL once logged in.

    application_yaml.png

You now need to create the .jar directory for the Cloudera .jar file:

  1. Create a directory named 'jar' and move the .jar file from the extracted Cloudera connector file into this folder.

  2. From this directory, run the following command (referencing the correct file name of the .jar file):

    java -Dloader.path=jar -Dspring.config.location=application-local.yml -jar connector-jdbc.jar

You can now create the connection between Cloudera Impala and the Celonis Platform from your data pool diagram:

  1. Click Data Connections.

    data_connections_within_data_pool_diagram.png
  2. Click Add Data Connection and select Connect to Data Source.

    add_data_connection.png
  3. Select Cloud - Database.

  4. Configure the following connection details:

    • Name: An internal reference for this data connection.

    • Database type: Select Cloudera Impala.

    • Configuration type: Select Custom JDBC String.

    • JDBC connection string: This depends on your connection configuration:

      • When using SSL:

        jdbc:impala://yourImpalaHostname:yourImpalaPort;AuthMech=3;
      • When using SSL and a certificate file: Add the certificate to the JDK keystore and then add the following string:

        jdbc:impala://yourImpalaHostname:yourImpalaPort;AuthMech=3;SSL=1;
      • When using SSL and a .jks file: Change the SSLTrustStore parameter to the directory of the .jks file containing the SSL key and set the SSLTrustStorePwd to the correct password. Then add the following string:

        jdbc:impala://yourImpalaHostname:yourImpalaPort;AuthMech=4;SSLTrustStore=<path>/<to>/<file>/cacerts.jks; SSLTrustStorePwd=changeit;tsasltransportbufsize=1000;usenativequery=0;useonlysspi=1;usesqlunicodetypes=0;
    • Schema name: The schema you want to use.

    • Additional properties: Not needed.

    • Username and password: Enter the details from your Cloudera Impala account.

  5. Click Test Connection and correct any highlighted issues.

  6. Click Save.

    The connection between your Cloudera Impala account and the Celonis Platform is establised. You can manage this connection at any time by clicking Options:

    managing_data_connections.png