The Aggregate node allows you to derive new columns by applying aggregate functions such as count, distinct count, sum, distinct sum, min, max, average, distinct average, and variance over a group of values. |
Following are the steps to apply the Aggregate node in a pipeline:
5. To set the expression, select the Advanced mode and enter the expression and click Save.
Field | Description |
---|---|
Column Name | The name for the output column with aggregated data. |
From Column | The column on which the aggregate function is to be performed. |
Function | The aggregate function to be applied on the column. The options include Count, Distinct Count, Sum, Distinct Sum, Avg, Distinct Avg, Min, Max, and Variance. |
DISTINCT(col1, col2)
is not supported by Snowflake and BigQuery: The pipeline build fails if this SQL is imported in Snowflake or BigQuery execution engine environments.For details, see Configurations-Post Processing.