Skip to main content

Celonis Product Documentation

PU_STRING_AGG
Description

The PU_STRING_AGG operator returns the concatenation of strings from the specified source column for each element in the given target table. The delimiter will be always inserted between the concatenation of the strings. Multiple order by expressions can be used in order to determine the order of the concatenation.

The PU_STRING_AGG function can only be applied to STRINGs.

Syntax
 PU_STRING_AGG ( target_table, source_table.column, delimiter [, filter_expression ] [, ORDER BY source_table.column [ASC|DESC] ] )
  • target_table: The table to which the aggregation result should be pulled. This can be:

  • source_table.column: The column which should be aggregated for every row of the target_table.

  • delimiter: A delimiter STRING what should be used to separate the strings in the result.

  • filter_expression (optional): An optional filter expression to specify which values of the source_table.column should be taken into account for the aggregation.

  • ORDER BY (optional): Elements of the specified column are used to determine the order in which the strings should be concatenated in the result. After the column, ASC (for ascending direction) or DESC (descending direction) can be specified. If the order direction is not specified, the ascending (ASC) order is used.

NULL handling

If no value in the source table column exists for the element in the target table (either because all values of the source table are filtered out, or because no corresponding value exists in the first place), NULL will be returned. NULL values in the source table column are treated as if the row does not exist.

Examples

[1]

Aggregate the activity column per case.

Query

Column1

         PU_STRING_AGG ( "caseTable" , "activityTable"."activity" , ' - ' )
        

Input

Output

activityTable

caseId : string

activity : string

timestamp : date

'1'

'A'

Tue Jan 01 2019 13:00:00.000

'1'

'B'

Tue Jan 01 2019 15:00:00.000

'2'

'C'

Fri Jan 04 2019 03:00:00.000

'2'

'D'

Sat Jan 05 2019 05:00:00.000

caseTable

caseId : string

'1'

'2'

Foreign Keys

activityTable.caseId

caseTable.caseId

Result

Column1 : string

'A - B'

'C - D'

[2]

Aggregate the activity column per casewith respect to the ascending sorted order of value 2 and the descending sorted order of value 1.

Query

Column1

         PU_STRING_AGG ( "caseTable" , "activityTable"."activity" , ' | ' , ORDER BY "activityTable"."value2" ASC , "activityTable"."value1" DESC )
        

Input

Output

activityTable

caseId : string

activity : string

timestamp : date

value1 : int

value2 : int

'3'

'Y'

Thu Jan 02 2020 12:00:00.000

7

10

'1'

'A'

Sat Jan 04 2020 22:00:00.000

31

15

'1'

'B'

Sun Jan 05 2020 23:00:00.000

62

12

'2'

'C'

Mon Jan 06 2020 12:00:00.000

12

19

'2'

'D'

Mon Jan 06 2020 14:00:00.000

14

9

'2'

'E'

Mon Jan 06 2020 20:00:00.000

3

9

'3'

'X'

Tue Jan 07 2020 01:00:00.000

31

9

'2'

'F'

Wed Jan 08 2020 08:00:00.000

15

9

'2'

'G'

Wed Jan 08 2020 10:00:00.000

4

21

'3'

'Z'

Thu Jan 09 2020 12:00:00.000

8

32

caseTable

caseId : string

'1'

'2'

'3'

Foreign Keys

activityTable.caseId

caseTable.caseId

Result

Column1 : string

'B | A'

'F | D | E | C | G'

'X | Y | Z'

[3]

Aggregate the activity column per case with respect to the ascending sorted order of value while only the activities which have a value between 10 and 30 are concatenated.

Query

Column1

         PU_STRING_AGG ( "caseTable" , "activityTable"."activity" , ' | ' , "activityTable"."value" BETWEEN 10 AND 30 , ORDER BY "activityTable"."value" )
        

Input

Output

activityTable

caseId : string

activity : string

timestamp : date

value : int

'1'

'A'

Sat Jan 04 2020 22:00:00.000

15

'1'

'B'

Sun Jan 05 2020 23:00:00.000

12

'2'

'C'

Mon Jan 06 2020 12:00:00.000

19

'2'

'D'

Mon Jan 06 2020 14:00:00.000

34

'2'

'E'

Mon Jan 06 2020 20:00:00.000

9

'2'

'F'

Wed Jan 08 2020 08:00:00.000

17

'2'

'G'

Wed Jan 08 2020 10:00:00.000

21

'3'

'Y'

Thu Jan 02 2020 12:00:00.000

10

'3'

'X'

Tue Jan 07 2020 01:00:00.000

9

'3'

'Z'

Thu Jan 09 2020 12:00:00.000

32

caseTable

caseId : string

'1'

'2'

'3'

Foreign Keys

activityTable.caseId

caseTable.caseId

Result

Column1 : string

'B | A'

'F | C | G'

'Y'

[4]

Aggregate the activities per case where country is not 'FR' while the aggregation of the activities happens with respect to the ascending order of the eventtimes from the activityTable

Query

Column1

         PU_STRING_AGG ( "companyDetail" , "activityTable"."activity" , ',' , "companyDetail"."country" <> 'FR' , ORDER BY "activityTable"."eventtime" )
        

Input

Output

activityTable

caseId : int

activity : string

eventtime : date

1

'A'

Thu Dec 31 2015 22:13:20.000

1

'B'

Fri Jan 01 2016 01:00:00.000

1

'C'

Fri Jan 01 2016 03:46:40.000

2

'A'

Thu Dec 31 2015 22:30:00.000

2

'E'

Thu Dec 31 2015 22:46:40.000

2

'F'

Thu Dec 31 2015 22:46:40.000

2

'G'

Thu Dec 31 2015 23:03:20.000

3

'A'

Fri Jan 01 2016 03:46:40.000

3

'H'

Fri Jan 01 2016 06:33:20.000

4

'A'

Thu Dec 31 2015 22:15:00.000

5

'A'

Thu Dec 31 2015 22:16:40.000

5

'B'

Thu Dec 31 2015 22:18:20.000

5

'C'

Thu Dec 31 2015 22:20:00.000

6

'A'

Thu Dec 31 2015 22:14:10.000

7

'A'

Fri Jan 01 2016 17:40:00.000

8

'A'

Fri Jan 01 2016 20:26:40.000

companyDetail

caseId : int

country : string

1

'DE'

2

'DE'

3

'UK'

4

'DE'

5

'US'

6

'FR'

7

'DE'

8

'US'

companyResource

caseId : int

resource : int

1

500

2

400

3

100

4

-400

5

50

6

0

7

10

8

5000

Foreign Keys

companyDetail.caseId

activityTable.caseId

companyResource.caseId

activityTable.caseId

Result

Column1 : string

'A,B,C'

'A,E,F,G'

'A,H'

'A'

'A,B,C'

null

'A'

'A'

[5]

One can use the INDEX_ORDER to aggregate only distinct values. The example below calculates the distinct currencies involved in each case.

Query

Column1

         PU_STRING_AGG ( "caseTable" , "activityTable"."currency" , '-' , INDEX_ORDER ( "activityTable"."currency" , GROUP ( "caseTable"."caseId" , "activityTable"."currency" ) ) = 1 )
        

Input

Output

activityTable

caseId : int

activity : string

timestamp : date

currency : string

1

'A'

Sat Jan 04 2020 22:00:00.000

'EUR'

1

'B'

Sun Jan 05 2020 22:00:00.000

'USD'

1

'C'

Mon Jan 06 2020 22:00:00.000

'EUR'

1

'B'

Tue Jan 07 2020 22:00:00.000

'EUR'

2

'A'

Wed Jan 08 2020 22:00:00.000

'USD'

2

'B'

Thu Jan 09 2020 22:00:00.000

'USD'

caseTable

caseId : int

values : int

1

100

2

200

Foreign Keys

activityTable.caseId

caseTable.caseId

Result

Column1 : string

'EUR-USD'

'USD'

See also: