This functionality allows you to get the metadata of already existing delta tables, so that they can be used in pipelines downstream and can be used in conjunction with tables ingested from other sources. |
The following are the steps to create a Delta Table source:
The following are the steps to configure a Delta Table source:
Field | Description |
---|---|
Source Name | Provide a source name for the target table. |
Source Catalog Name | Provide a source catalog name. The source will have limited assets belonging to just this catalog. |
Fetch Data Using | The mechanism through which Infoworks fetches data from the database. Default option is Spark. |
Data Environment | Select the environment where the tables are registered. Infoworks will spawn a spark session in the persistent cluster running in the environment and fetch all the tables registered. |
Storage | Select from one of the storage options defined in the environment. |
Base Location | The path to the base/target directory where all the data should be stored. |
Schema Name | The schema name of the target table. |
You can select the tables for which the metadata crawl is required. You can add more tables later.