Creating New Spark SQL Connections

The following steps demonstrate how to create new Spark SQL data connections.

Availability Note. This connection type requires the presence of a Spark Thrift Server.
Availability Note. This connection type provides basic connectivity and functionality for Spark SQL Connections, running queries faster then Hive connections. However, it does not support the use of save table and data sampling features, analytical views, or flow and funnel visuals.
Create New Data Connection Modal Window: Spark SQL
Create a New Spark SQL Connection
  1. On the main navigation bar, click Data.

    The Data view appears, open on the Datasets tab.

  2. In the side bar, click New Connection.

    The Create New Data Connection modal window appears.

  3. In the Create New Data Connection modal window, under Connection type, select Spark SQL.
  4. Under Connection name, specify the name of the new connection. Here, we use SparkSQLConnection.
  5. Under Hostname or IP address, specify the name of your database host, or its IP address; use localhost when the data source is local.

  6. Under Port #, enter the port number. The default port # for Spark SQL connections is 100000.
  7. Under Credentials, complete the following entries.

    • Under Username, enter the username for establishing the connection.

    • Under Password, enter the password for establishing the connection.

  8. Click Advanced tab.

    Advanced connection options
    Advanced Connection Options
  9. Choose Socket type.

    Note: Windows platforms support only normal sockets; therefore, socket options do not appear on the modal window.

    • Normal [default]

    • SSL

    • SSL with certificate

  10. Choose Authentication mode.

    Note: Windows platforms support only NoSasl authentication; therefore, authentication options do not appear on the modal window.

    • NoSasl

    • Plain

    • LDAP

    • Kerberos

      Note: Kerberos authentication is only available on Linux platforms.

  11. Specify Timeout interval. The default value is 60 seconds.

  12. Choose Queue depth.

    This controls the maximum number of simultaneous queries on the connection.

    • By default, the value is 2 and does not have to be specified.

    • Valid values are integers 1 through 100.

  13. Click Parameters tab.

    Parameter tabs, specifying parameter name/value
    Specifying Connection Parameters
  14. Click the first row to add a Parameter name/value pair. Type in the parameters and their values.

  15. If you want to remove a parameter, click (trash can) icon to remove existing parameters.

  16. At the bottom of the modal, click Test.

    testing connection.
    Testing the New Connection
  17. If the connection is valid, the system returns a 'Connection Verified' message.

    connection verified.
    New Connection Verified
  18. Click Connect.

After this operation succeeds, the new connection name appears on the side navigation bar.