The Aggregate node allows you to derive new columns by applying aggregate functions such as count, distinct count, sum, distinct sum, min, max, average, distinct average, and variance over a group of values. |
Following are the steps to apply the Aggregate node in a pipeline:
Drag and drop the Aggregate node from the Transformations section to the pipeline editor page.
Connect the source node to the Aggregate node.
Double-click the Aggregate node. The properties page is displayed.
Click Add Group By, select the Group By Column for which the derived data is to be grouped by, and click Save.
5. To set the expression, select the Advanced mode and enter the expression and click Save.
Field | Description |
---|---|
Column Name | The name for the output column with aggregated data. |
From Column | The column on which the aggregate function is to be performed. |
Function | The aggregate function to be applied on the column. The options include Count, Distinct Count, Sum, Distinct Sum, Avg, Distinct Avg, Min, Max, and Variance. |
For details, see Configurations-Post Processing.