Infoworks 6.2.0
Onboard Data

Onboarding Data

Perform the following steps to ingest data:

  • Click the Ingest Data tab.
  • Select the tables to be ingested and click the Ingest button.
  • In the Ingest Data wizard, enter the ingestion details and click Ingest.
  • Click OK in the Job Submitted pop-up window.

Ingestion Details

FieldDescription
Job NameThe name of the ingestion job.
Max. Connections to SourceThe maximum number of source database connections allocated to this ingestion table group.
Max. Parallel TablesThe maximum number of tables that can be crawled at a given instance.
Compute TemplateThe template based on which the cluster will spin up for each table.
Overwrite Worker CountThe option to overwrite the minimum and maximum worker values configured in the compute template.
Min WorkersThe minimum number of nodes that will spin up in the cluster.
Max WorkersThe maximum number of nodes that can be spun up in the cluster.
Number of Worker NodesThe number of nodes that can be spun up in the cluster.
Snowflake WarehouseThis field appears only if the Data Environment selected in a CDW env. For e.g. Snowflake. Select from the list of associated warehouses.
Query Tags

A string that is added to the Snowflake query and can be accessed via Query history in Snowflake.

NOTE This field appears only if the source is using Snowflake CDW environment

This allows specifying lower and upper watermark (not as offset) for a particular run. The default upper bound is the maximum of the watermark column(s) which can be changed by the user. For lower limit, the last ingested cdc value is the default.

NOTE The watermark values from the restatement job will not be updated in the table.

For running new restatement jobs on 6.0.0 using databricks environment, after upgrading from lower release (5.5.1.x), user need to remove existing jars from old cluster and then restart the cluster for using the same or user can create a new cluster for running the jobs.

  Last updated by Monika Momaya