Connect to Cloudera Impala
Please follow the steps below to connect to the Cloudera Impala data source.
Click "New Data Connection" in the upper right corner of the Data Connection page.
Select the Cloudera Impala data source from the Data Source Types.
Fill in the parameters for the data source connection as required.
- Name: The name of the connection, must be unique.
- Machine Address: The address of the database. If the url field is filled, the url will be prioritized.
- Port: The port of the database. If the url field is filled, the url will be prioritized.
- Username: The username for the database.
- Password: The password for the database.
- Database: The name of the database. If the url field is filled, the url will be prioritized.
- Maximum Connections: The maximum number of connections in the connection pool.
- Encoding: The encoding setting for the database connection.
- Prefer Database Comment for Dataset Title: Whether to display the table name or the table comment as the title. When enabled, the title is displayed; when disabled, the table comment is displayed.
- Hadoop Authentication Method: The authentication method for Hadoop, where simple is basic authentication without additional information, and Kerberos requires additional information to be filled.
- realmA: This field needs to be filled when the Hadoop authentication method is Kerberos.
- kdcA: This field needs to be filled when the Hadoop authentication method is Kerberos.
- realmB: This field needs to be filled when the Hadoop authentication method is Kerberos.
- kdcB: This field needs to be filled when the Hadoop authentication method is Kerberos.
- Server Principal: This field needs to be filled when the Hadoop authentication method is Kerberos.
- Data Gateway: Fill in the Data Gateway ID when the connection is made through a data gateway.
- URL: The JDBC URL for the database.
- Additional JDBC Parameters: Additional JDBC parameters. It is recommended to use the url to write the complete url. This parameter is only appended to the automatically generated JDBC URL.
- Hierarchical Loading of Schemas and Tables: When closed, both schemas and tables are loaded simultaneously. When enabled, schemas and tables are loaded hierarchically, with only schemas loaded during the connection process, allowing the data source to be quickly integrated into the system.
- Support Uploading Files to Specified Path: The specific path indicates the database name where files are uploaded. This parameter can only be configured after verification.
- Support Operations on Connected Database: Indicates that this connection can be selected as an output connection in Data Integration, Batch Sync, and Data Filling. The user must ensure write permissions for the database. This parameter can only be configured after verification.
- Show Tables Only Under Specified Database/Schema: When this option is selected and the database field is not empty, only the tables under this database will be displayed.
After filling in the parameters, click the "Verify" button to obtain the verification result (verifying the connectivity between HENGSHI SENSE and the configured data connection; adding is not allowed if the verification is not passed).
Click to execute the preset code, and the preset code corresponding to the data source will pop up. Click the execute button.
Click the "Add" button to add the configured SQL Server connection.
Please note
- Parameters marked with * are required, while others are optional.
- When connecting to a data source, the preset code must be executed. Failure to do so will result in certain functions being unavailable during data analysis. Additionally, when upgrading from versions prior to 4.4 to 4.4, the preset code needs to be executed for existing data connections in the system.