Arcadia Enterprise supports Hive data connections.
The following steps demonstrate how to create new Hive data connections.
Note that the following known issue affects Hive connections: CONNECTION: User-Defined Aggregate Expressions Do Not Validate.
On the main navigation bar, click Data.
The Data view appears, open on the Datasets tab.
In the side bar, click New Connection.
The Create New Data Connection modal window appears.
HiveConnection
.Under Hostname or IP address, specify the name of your database host, or
its IP address; use localhost
when the data source is local.
10000
.Under Credentials, complete the following entries.
Under Username, enter the username for establishing the connection.
Under Password, enter the password for establishing the connection.
At the bottom of the modal, click Test.
If the connection is valid, the system returns a 'Connection Verified' message.
Click Save.
After this operation succeeds, the new connection name appears on the side navigation bar.
To select advanced connection options, click the Advanced tab, and make changes to any of the following options:
Choose Connection mode: either Binary (default), or HTTP.
Choose one of the following Socket types:
Default setting
If you select this option, you can use the Allow Common Name - Host Name Mismatch option, which means that the issued SSL certificate name does not have to match the host name of the server. By default, the names must match.
Choose one of the Authentication modes:
Kerberos authentication is only available on Linux platforms.
Specify Query Timeout. The default value is 60
seconds.
Specify Session Timeout. The default value is 0
,
which means that sessions do not time out.
Specify Socket Timeout. The default value is
60
.
Queue depth controls the maximum number of simultaneous queries on the connection.
By default, the value is 2
and does not have to be specified.
Valid values are integers 1
through 100
.
Check Impersonation if using this feature to control individual table-level access for a user or a user group.
Click the Parameters tab.
Click the first row to add a Parameter name/value pair. Type in the parameters and their values.
If you want to remove a parameter, click (trash can) icon to remove existing parameters.
Click the Cache tab.
Select the Result Cache option to enable periodic cache updates.
In the Retention Time field, specify the frequency of cache updates, in seconds.
For example, the default value of 86400 indicates an update every 24 hours, and a value of 300 initiates a refresh every 5 minutes.