Sequence and Rowkind
When creating a table, you can specify the 'sequence.field'
by specifying fields to determine the order of updates, or you can specify the 'rowkind.field'
to determine the changelog kind of record.
Sequence Field
By default, the primary key table determines the merge order according to the input order (the last input record will be the last to merge). However, in distributed computing, there will be some cases that lead to data disorder. At this time, you can use a time field as sequence.field
, for example:
Flink
CREATE TABLE my_table (
pk BIGINT PRIMARY KEY NOT ENFORCED,
v1 DOUBLE,
v2 BIGINT,
update_time TIMESTAMP
) WITH (
'sequence.field' = 'update_time'
);
The record with the largest sequence.field
value will be the last to merge, if the values are the same, the input order will be used to determine which one is the last one. sequence.field
supports fields of all data types.
You can define multiple fields for sequence.field
, for example 'update_time,flag'
, multiple fields will be compared in order.
User defined sequence fields conflict with features such as
first_row
andfirst_value
, which may result in unexpected results.
Row Kind Field
By default, the primary key table determines the row kind according to the input row. You can also define the 'rowkind.field'
to use a field to extract row kind.
The valid row kind string should be '+I'
, '-U'
, '+U'
or '-D'
.