Skip to Content

Google BigQuery Query Activity

Introduction

A Google BigQuery Query activity, using its Google BigQuery connection, queries a table in a dataset in Google BigQuery and is intended to be used as a source in an operation.

Create a Google BigQuery Query Activity

An instance of a Google BigQuery Query activity is created from a Google BigQuery connection using its Query activity type.

To create an instance of an activity, drag the activity type to the design canvas or copy the activity type and paste it on the design canvas. For details, see Create an Activity Instance in Component Reuse.

An existing Google BigQuery Query activity can be edited from these locations:

Configure a Google BigQuery Query Activity

Follow these steps to configure a Google BigQuery Query activity:

Step 1: Enter a Name and Select the Dataset

In this step, provide a name for the activity and select the dataset. Each user interface element of this step is described below.

Google BigQuery Query Activity Configuration Step 1

  • Name: Enter a name to identify the activity. The name must be unique for each Google BigQuery Query activity and must not contain forward slashes / or colons :.

  • Select the Dataset: This section displays datasets available in the Google BigQuery endpoint.

    • Selected Dataset: After a dataset is selected, it is listed here.

    • Search: Enter any part of the dataset name into the search box to filter the list of datasets. The search is not case-sensitive. If datasets are already displayed within the table, the table results are filtered in real time with each keystroke. To reload datasets from the endpoint when searching, enter search criteria and then refresh, as described below.

    • Refresh: Click the refresh icon Refresh icon or the word Refresh to reload datasets from the Google BigQuery endpoint. This may be useful if datasets have been added to Google BigQuery. This action refreshes all metadata used to build the table of datasets displayed in the configuration.

    • Selecting a Dataset: Within the table, click anywhere on a row to select a dataset. Only one dataset can be selected. The information available for each dataset is fetched from the Google BigQuery endpoint:

      • Name: The name of the dataset.

      • Description: The description of the dataset.

    Tip

    If the table does not populate with available datasets, the Google BigQuery connection may not be successful. Ensure you are connected by reopening the connection and retesting the credentials.

  • Save & Exit: If enabled, click to save the configuration for this step and close the activity configuration.

  • Next: Click to temporarily store the configuration for this step and continue to the next step. The configuration will not be saved until you click the Finished button on the last step.

  • Discard Changes: After making changes, click to close the configuration without saving changes made to any step. A message asks you to confirm that you want to discard changes.

Step 2: Select the Table

In this step, select the table. Each user interface element of this step is described below.

Google BigQuery Query Activity Configuration Step 2

  • Select the Table: This section displays tables available in the Google BigQuery endpoint.

    • Selected Dataset: The dataset selected in the previous step is listed here.

    • Selected Table: After a table is selected, it is listed here.

    • Search: Enter any part of the table name into the search box to filter the list of tables. The search is not case-sensitive. If tables are already displayed within the table, the table results are filtered in real time with each keystroke. To reload tables from the endpoint when searching, enter search criteria and then refresh, as described below.

    • Refresh: Click the refresh icon Refresh icon or the word Refresh to reload tables from the Google BigQuery endpoint. This may be useful if tables have been added to Google BigQuery. This action refreshes all metadata used to build the table of tables displayed in the configuration.

    • Selecting a Table: Within the table, click anywhere on a row to select a table. Only one table can be selected. The information available for each table is fetched from the Google BigQuery endpoint:

      • Name: The name of the table.

      • Description: The description of the table.

    Tip

    If the table does not populate with available tables, the Google BigQuery connection may not be successful. Ensure you are connected by reopening the connection and retesting the credentials.

  • Back: Click to temporarily store the configuration for this step and return to the previous step.

  • Next: Click to temporarily store the configuration for this step and continue to the next step. The configuration will not be saved until you click the Finished button on the last step.

  • Discard Changes: After making changes, click to close the configuration without saving changes made to any step. A message asks you to confirm that you want to discard changes.

Step 3: Build Your Query

In this step, set conditions on a query using the object fields and apply paging to a query. Each user interface element of this step is described below.

Google BigQuery Query Activity Configuration Step 3

Tip

Fields with a variable icon Variable icon support using global variables, project variables, and Jitterbit variables. Begin either by typing an open square bracket [ into the field or by clicking the variable icon to display a list of the existing variables to choose from.

  • Search: Enter any part of a field name into the search box to filter the list of fields for the selected object. The search is not case-sensitive. The listed results are filtered in real time with each keystroke.

  • Refresh: Click the refresh icon Refresh icon or the word Refresh to reload fields of the object from the Google BigQuery endpoint.

  • Select All: When using the search box to filter, you can use this checkbox to select all visible fields at once.

  • Select Fields: Select the checkboxes of the fields you want included in the query to have them automatically added to the SELECT statement in the Query String. You can also Select All of the fields at once using the checkbox.

  • Paging: To add a paging clause (a limit on the number of records with an optional record offset), you can use the dropdown to set the paging limit and the field to enter an offset. If an offset is not specified, it defaults to 0. A single paging clause is supported. If paging clause is not included, all records are returned.

    • Apply: Click to automatically construct the clause based on the dropdown selections and entered value. The automatically constructed paging clause appears in the Query String text box.

    • Remove: Click to remove a paging clause that has been applied.

  • Conditions: To add conditional clauses, use the fields below as input to help construct the clauses, which then appear in the Query String text box.

    • Object: Field: Use the dropdown to select a field from the selected object.

    • Operator: Use the dropdown to select an operator that is appropriate for the field data type:

      Operator Label Description
      = Equals
      != Not equals
      IN (value1, value2) In In list of values.
      IS NULL Is Null Has no value.
      IS NOT NULL Is Not Null Has a value.
      LIKE 'string' Like Like string.
      LIKE 'string%' Starts with Starts with string.
      LIKE '%string' Ends with Ends with string.
      LIKE '%string%' Contains Contains string.
      < Less than
      <= Less or equal
      > Greater than
      >= Greater or equal
    • Value: Enter the desired value to use with the dropdown selections.

      Tip

      When using global variables in the conditions of a WHERE clause, it is recommended to specify a default value so that script testing is possible. Otherwise, since global variables obtain their value at runtime, the syntax may be invalidated during testing or no data may be returned if no default value is specified.

    • Add: Click to automatically construct the clause based on the dropdown selections and entered value. The conditional clause is added to the Query String text box.

    • Remove All: Click to remove all entered conditional clauses.

  • Query String: As you select fields, specify conditions, and set paging, the query statement in this text box is autopopulated with the selected fields, conditions, and paging limits. This statement is editable.

  • Test Query: Click to validate the query. If the query is valid, a sample of up to 10 records retrieved from the query is displayed in a table. If the query is not valid, relevant error messages are displayed.

  • Back: Click to temporarily store the configuration for this step and return to the previous step.

  • Next: Click to temporarily store the configuration for this step and continue to the next step. The configuration will not be saved until you click the Finished button on the last step.

  • Discard Changes: After making changes, click to close the configuration without saving changes made to any step. A message asks you to confirm that you want to discard changes.

Step 4: Review the Data Schemas

Any request or response schemas are displayed. Each user interface element of this step is described below.

Google BigQuery Query Activity Configuration Step 4

  • Data Schemas: These data schemas are inherited by adjacent transformations and are displayed again during transformation mapping.

    The Google BigQuery connector uses the Google SDK version 25.4.0. Refer to the SDK documentation for information on the schema nodes and fields.

    The Query activity uses JSON in its response schema.

  • Refresh: Click the refresh icon Refresh icon or the word Refresh to regenerate schemas from the Google BigQuery endpoint. This action also regenerates a schema in other locations throughout the project where the same schema is referenced, such as in an adjacent transformation.

  • Back: Click to temporarily store the configuration for this step and return to the previous step.

  • Finished: Click to save the configuration for all steps and close the activity configuration.

  • Discard Changes: After making changes, click to close the configuration without saving changes made to any step. A message asks you to confirm that you want to discard changes.

Next Steps

After configuring a Google BigQuery Query activity, complete the configuration of the operation by adding and configuring other activities, transformations, or scripts as operation steps. You can also configure the operation settings, which include the ability to chain operations together that are in the same or different workflows.

Menu actions for an activity are accessible from the project pane and the design canvas. For details, see Activity Actions Menu in Connector Basics.

Google BigQuery Query activities can be used as a source with these operation patterns:

To use the activity with scripting functions, write the data to a temporary location and then use that temporary location in the scripting function.

When ready, deploy and run the operation and validate behavior by checking the operation logs.