Edit

Share via


Set up your Impala database connection

This article outlines the steps to create an Impala database connection.

Supported authentication types

The Impala connector supports the following authentication types for copy and Dataflow Gen2 respectively.

Authentication type Copy Dataflow Gen2
Anonymous n/a
Windows n/a
Database n/a

Set up your connection for Dataflow Gen2

You can connect Dataflow Gen2 in Microsoft Fabric to Impala using Power Query connectors. Follow these steps to create your connection:

  1. Check capabilities, limitations, and considerations to make sure your scenario is supported.
  2. Get data in Fabric.
  3. Connect to an Impala database.

Capabilities

  • Import
  • DirectQuery (Power BI semantic models)
  • Advanced options
    • Connection timeout duration
    • Command timeout duration

Get data

To get data in Data Factory:

  1. On the left side of Data Factory, select Workspaces.

  2. From your Data Factory workspace, select New > Dataflow Gen2 to create a new dataflow.

    Screenshot showing the workspace where you choose to create a new dataflow.

  3. In Power Query, either select Get data in the ribbon or select Get data from another source in the current view.

    Screenshot showing the Power Query workspace with the Get data option emphasized.

  4. In the Choose data source page, use Search to search for the name of the connector, or select View more on the right hand side the connector to see a list of all the connectors available in Power BI service.

    Screenshot of the Data Factory Choose data source page with the search box and the view more selection emphasized.

  5. If you choose to view more connectors, you can still use Search to search for the name of the connector, or choose a category to see a list of connectors associated with that category.

    Screenshot of the Data Factory Choose data source page displayed after selecting view more, with the list of connectors.

Connect to an Impala database

To connect to an Impala database, take the following steps:

  1. Select the Impala option in the connector selection.

  2. In Connect to data source, provide the name of the server and a port number if necessary.

    Screenshot of the Connect to data source dialog where you enter the Impala database online connection.

  3. If necessary, select the name of your on-premises data gateway.

  4. If you're connecting to this Impala database for the first time, select the type of credentials for the connection in Authentication kind.

  5. Enter your credentials.

  6. Select Use Encrypted Connection if you want to use an encrypted connection, or clear the option if you want to use an unencrypted connection.

  7. Select Next to continue.

  8. In Navigator, select the data you require, then select Transform data to transform the data in the Power Query editor.

Limitations and considerations

Here are a few considerations and limitations to keep in mind with the Impala connector:

  • The Impala connector is supported on the on-premises data gateway, using any of the three supported authentication mechanisms.
  • The Impala connector uses the Impala driver, which limits the size of string types to 32 K by default.
  • The Impala connector doesn't support overriding the Realm option for Kerberos authentication.

Set up your connection in a pipeline

Data Factory in Microsoft Fabric doesn't currently support an Impala database in pipelines.