Infoworks 5.4.2
Prepare Data

Performing Lookup Operation

Lookup node can be used to fetch column information (derived column) from the lookup source based on the rule that applies to each input record.

NOTE The Lookup node is not supported for pipelines in snowflake environment.

Following are the steps to apply Lookup node in pipeline:

  1. Double-click the Lookup node. The properties page is displayed.
  2. Enter the following details:
FieldDescription
Lookup Source Type

The source types supported are File and Table. If you select file, the File Path field is displayed. Provide the CSV file path.

NOTE Ensure you provide absolute path of the CSV file.

If you select table, the Schema Name and Table Name fields are displayed. Provide the Hive table details.

NOTE A lookup record is matched only if all the rules (Range/Lookup) are satisfied (AND of all the rules).

Priority Column NameColumn that indicates the lookup record priority. In case of multiple matching lookup records, the record with the highest priority will be used.
Derived Column NameColumn which must be derived from the lookup table.
Default Value for Derived ColumnDefault value if no rule matches from the lookup table.
Rule TypeLookup - to perform the exact value comparison with the lookup column, Range - to perform the range comparison with the lookup columns.
Input ColumnThe input column which must be compared against lookup column(s).
Lookup Table Column (Lookup rule)The lookup table column to be compared with given input column.
Start Column (Range rule)The lookup table column that specifies the start range for the specified input column.
End Column (Range rule)The lookup table column that specifies the end range for the specified input column.

NOTE For details on derivations, see Derivations.

Limitation

Currently, only non-overlapping ranges are supported.

  Last updated by Prerana Dutta