Asynchronous materialized view
Construction and maintenance of materialized views
Create materialized views
Prepare two tables and data
use tpch;
CREATE TABLE IF NOT EXISTS orders (
o_orderkey integer not null,
o_custkey integer not null,
o_orderstatus char(1) not null,
o_totalprice decimalv3(15,2) not null,
o_orderdate date not null,
o_orderpriority char(15) not null,
o_clerk char(15) not null,
o_shippriority integer not null,
o_comment varchar(79) not null
)
DUPLICATE KEY(o_orderkey, o_custkey)
PARTITION BY RANGE(o_orderdate)(
FROM ('2023-10-17') TO ('2023-10-20') INTERVAL 1 DAY)
DISTRIBUTED BY HASH(o_orderkey) BUCKETS 3
PROPERTIES ("replication_num" = "1");
insert into orders values
(1, 1, 'ok', 99.5, '2023-10-17', 'a', 'b', 1, 'yy'),
(2, 2, 'ok', 109.2, '2023-10-18', 'c','d',2, 'mm'),
(3, 3, 'ok', 99.5, '2023-10-19', 'a', 'b', 1, 'yy');
CREATE TABLE IF NOT EXISTS lineitem (
l_orderkey integer not null,
l_partkey integer not null,
l_suppkey integer not null,
l_linenumber integer not null,
l_quantity decimalv3(15,2) not null,
l_extendedprice decimalv3(15,2) not null,
l_discount decimalv3(15,2) not null,
l_tax decimalv3(15,2) not null,
l_returnflag char(1) not null,
l_linestatus char(1) not null,
l_shipdate date not null,
l_commitdate date not null,
l_receiptdate date not null,
l_shipinstruct char(25) not null,
l_shipmode char(10) not null,
l_comment varchar(44) not null
)
DUPLICATE KEY(l_orderkey, l_partkey, l_suppkey, l_linenumber)
PARTITION BY RANGE(l_shipdate)
(FROM ('2023-10-17') TO ('2023-10-20') INTERVAL 1 DAY)
DISTRIBUTED BY HASH(l_orderkey) BUCKETS 3
PROPERTIES ("replication_num" = "1");
insert into lineitem values
(1, 2, 3, 4, 5.5, 6.5, 7.5, 8.5, 'o', 'k', '2023-10-17', '2023-10-17', '2023-10-17', 'a', 'b', 'yyyyyyyyy'),
(2, 2, 3, 4, 5.5, 6.5, 7.5, 8.5, 'o', 'k', '2023-10-18', '2023-10-18', '2023-10-18', 'a', 'b', 'yyyyyyyyy'),
(3, 2, 3, 6, 7.5, 8.5, 9.5, 10.5, 'k', 'o', '2023-10-19', '2023-10-19', '2023-10-19', 'c', 'd', 'xxxxxxxxx');
Create materialized views
CREATE MATERIALIZED VIEW mv1
BUILD DEFERRED REFRESH AUTO ON MANUAL
partition by(l_shipdate)
DISTRIBUTED BY RANDOM BUCKETS 2
PROPERTIES ('replication_num' = '1')
AS
select l_shipdate, o_orderdate, l_partkey, l_suppkey, sum(o_totalprice) as sum_total
from lineitem
left join orders on lineitem.l_orderkey = orders.o_orderkey and l_shipdate = o_orderdate
group by
l_shipdate,
o_orderdate,
l_partkey,
l_suppkey;
Specific syntax can be viewed CREATE MATERIALIZED VIEW
View materialized view meta information
select * from mv_infos("database"="tpch") where Name="mv1";
The unique features of materialized views can be viewed through mv_infos()
Properties related to table, still viewed through SHOW TABLES
Refresh materialized view
The materialized view supports different refresh strategies, such as scheduled refresh and manual refresh. It also supports different refresh granularity, such as full refresh, incremental refresh of partition granularity, etc. Here we take manually refreshing partial partitions of the materialized view as an example.
First, check the list of materialized view partitions
SHOW PARTITIONS FROM mv1;
Refresh partition named p_20231017_20231018
REFRESH MATERIALIZED VIEW mv1 partitions(p_20231017_20231018);
Specific syntax can be viewed REFRESH MATERIALIZED VIEW
task management
Each materialized view defaults to a job responsible for refreshing data, which is used to describe the refresh strategy and other information of the materialized view. Each time a refresh is triggered, a task is generated, Task is used to describe specific refresh information, such as the time used for refreshing, which partitions were refreshed, etc
View jobs in materialized views
select * from jobs("type"="mv") order by CreateTime;
Specific syntax can be viewed jobs(“type”=”mv”)
Pause materialized view job scheduled scheduling
PAUSE MATERIALIZED VIEW JOB ON mv1;
Can pause the scheduled scheduling of materialized views
Specific syntax can be viewed PAUSE MATERIALIZED VIEW JOB
RESUME materialized view job scheduling
RESUME MATERIALIZED VIEW JOB ON mv1;
Can RESUME scheduled scheduling of materialized views
Specific syntax can be viewed RESUME MATERIALIZED VIEW JOB
Viewing tasks in materialized views
select * from tasks("type"="mv");
Specific syntax can be viewed tasks(“type”=”mv”)
Cancel the task of objectifying the view
CANCEL MATERIALIZED VIEW TASK realTaskId on mv1;
Can cancel the operation of this task
Specific syntax can be viewed CANCEL MATERIALIZED VIEW TASK
Modifying materialized views
Modify the properties of materialized views
ALTER MATERIALIZED VIEW mv1 set("grace_period"="3333");
Modify the name of the materialized view, the refresh method of the materialized view, and the unique properties of the materialized view can be viewed through ALTER MATERIALIZED VIEW
The materialized view itself is also a Table, so Table related properties, such as the number of copies, are still modified through the syntax related to ALTER TABLE
.
Delete materialized view
DROP MATERIALIZED VIEW mv1;
The materialized view has a dedicated deletion syntax and cannot be deleted through the drop table,
Specific syntax can be viewed DROP MATERIALIZED VIEW
The use of materialized views
can be viewed Query async materialized view
Notice
- Asynchronous materialized views are only supported for use in the Nereids optimizer, Nereids optimizer
- Currently, determining the synchronization between materialized views and base tables is only supported for
OlapTable
. For other types of external tables, they are directly considered to be synchronized. For instance, if the base tables of a materialized view are all external tables, they are assumed to be synchronized. When queryingmv_infos()
, the SyncWithBaseTables flag will always return 1 (true) for these external tables. When refreshing a materialized view, it is necessary to manually refresh specific partitions or specifycomplete
to refresh all partitions.