Skip to content

Connect to Hive

Please follow the steps below to connect to the Hive data source.

  1. Click "New Data Connection" in the upper right corner of the Data Connection page.

  2. Select Hive Data Source from the Data Source Types.

  3. Fill in the parameters for the data source as required.

  • Name: The name of the connection, must be unique.
  • Machine Address: The address of the database. If the URL field is filled, the URL will be prioritized.
  • Port: The port of the database. If the URL field is filled, the URL will be prioritized.
  • Username: The username for the database.
  • Password: The password for the database.
  • Database: The name of the database. If the URL field is filled, the URL will be prioritized.
  • Maximum Connections: The maximum number of connections in the connection pool.
  • Prefer Database Comment for Dataset Title: Whether to display the table name or the table comment. When enabled, the title is displayed; when disabled, the table comment is displayed.
  • Hive Execution Engine: The Hive execution engine, with options of mr, tez, and spark.
  • Hadoop Authentication Method: The "Hadoop Authentication Method" has three options: "simple", "kerberos", and "tbds". When "kerberos" or "tbds" is selected, the "Username" and "Password" above should be filled with the corresponding username and password in the "kerberos" or "tbds" system.
  • realmA: This field needs to be filled when the Hadoop authentication method is kerberos.
  • kdcA: This field needs to be filled when the Hadoop authentication method is kerberos.
  • realmB: This field needs to be filled when the Hadoop authentication method is kerberos.
  • kdcB: This field needs to be filled when the Hadoop authentication method is kerberos.
  • Server Principal: This field needs to be filled when the Hadoop authentication method is kerberos.
  • Data Gateway: Fill in the Data Gateway ID when the connection is made through a data gateway.
  • URL: The JDBC URL for the database.
  • Hierarchical Loading of Schema and Tables: When closed, both schema and tables are loaded simultaneously. When enabled, schema and tables are loaded hierarchically, with only the schema being loaded during the connection process, allowing the data source to be quickly integrated into the system.
  • Show Tables Only from Specified Database/Schema: When this option is selected and the database field is not empty, only the tables under that database will be displayed.
  1. After filling in the parameters, click the "Verify" button to obtain the verification result (verifying the connectivity between HENGSHI SENSE and the configured data connection; adding is not allowed if the verification is not passed).

  2. Click to execute the preset code, and the preset code corresponding to the data source will pop up. Click the execute button.

  3. Click the "Add" button to add the configured Hive connection.

Please note

  1. Parameters marked with * are required, while others are optional.
  2. When connecting to a data source, the preset code must be executed. Failure to do so will result in certain functions being unavailable during data analysis. Additionally, when upgrading from versions prior to 4.4 to 4.4, the preset code needs to be executed for existing data connections in the system.

HENGSHI SENSE Platform User Manual