Skip to main content

Celonis Product Documentation

PU_COUNT_DISTINCT
Description

Calculates the number of distinct elements in the specified source column for each element in the given target table.

PU_COUNT_DISTINCT can be applied on any data type. The data type of the result is always an INT.

Syntax
 PU_COUNT_DISTINCT ( target_table, source_table.column [, filter_expression] )
  • target_table: The table to which the aggregation result should be pulled. This can be:

  • source_table.column: The column which should be aggregated for every row of the target_table.

  • filter_expression (optional): An optional filter expression to specify which values of the source_table.column should be taken into account for the aggregation.

NULL handling

If no value in the source table column exists for the element in the target table (either because all values of the source table are filtered out, or because no corresponding value exists in the first place), 0 will be returned. NULL values in the source table column are treated as if the row does not exist.

Examples

[1]

Count the number of distinct values for each company code:

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_COUNT_DISTINCT ( "companyDetail" , "caseTable"."value" )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

3

'002'

1

'003'

1

[2]

PU-functions can be used in a FILTER. In this example, the company codes are filtered such that the corresponding distinct number of case table values is smaller than 2:

Query

Filter

         FILTER PU_COUNT_DISTINCT ( "companyDetail" , "caseTable"."value" ) < 2;
        

Column1

         "companyDetail"."companyCode"
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

'002'

'003'

[3]

PU-functions can be used inside another aggregation function. In this example, the maximum value of all distinct number of case table values for each company code is calculated:

Query

Column1

         MAX ( PU_COUNT_DISTINCT ( "companyDetail" , "caseTable"."value" ) )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : int

3

[4]

Count the number of distinct values for each company code. Only consider cases with an ID larger than 2:

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_COUNT_DISTINCT ( "companyDetail" , "caseTable"."value" , "caseTable"."caseID" > 2 )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

1

'002'

1

'003'

1

[5]

Count the number of distinct values for each company code. Only consider cases with an ID larger than 3. All case table values for companyCode '001' are filtered out, which means that in this case, 0 is returned:

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_COUNT_DISTINCT ( "companyDetail" , "caseTable"."value" , "caseTable"."caseID" > 3 )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

0

'002'

1

'003'

1

See also: