Functions
Aggregate Functions
Aggregate queries are supported in TDengine by the following aggregate functions and selection functions.
COUNT
SELECT COUNT([*|field_name]) FROM tb_name [WHERE clause];
Description: Get the number of rows or the number of non-null values in a table or a super table.
Return value type: Long integer INT64
Applicable column types: All
Applicable table types: table, super table, sub table
More explanation:
- Wildcard (*) is used to represent all columns. The
COUNT
function is used to get the total number of rows. - The number of non-NULL values will be returned if this function is used on a specific column.
Examples:
taos> SELECT COUNT(*), COUNT(voltage) FROM meters;
count(*) | count(voltage) |
================================================
9 | 9 |
Query OK, 1 row(s) in set (0.004475s)
taos> SELECT COUNT(*), COUNT(voltage) FROM d1001;
count(*) | count(voltage) |
================================================
3 | 3 |
Query OK, 1 row(s) in set (0.001075s)
AVG
SELECT AVG(field_name) FROM tb_name [WHERE clause];
Description: Get the average value of a column in a table or STable
Return value type: Double precision floating number
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
Examples:
taos> SELECT AVG(current), AVG(voltage), AVG(phase) FROM meters;
avg(current) | avg(voltage) | avg(phase) |
====================================================================================
11.466666751 | 220.444444444 | 0.293333333 |
Query OK, 1 row(s) in set (0.004135s)
taos> SELECT AVG(current), AVG(voltage), AVG(phase) FROM d1001;
avg(current) | avg(voltage) | avg(phase) |
====================================================================================
11.733333588 | 219.333333333 | 0.316666673 |
Query OK, 1 row(s) in set (0.000943s)
TWA
SELECT TWA(field_name) FROM tb_name WHERE clause;
Description: Time weighted average on a specific column within a time range
Return value type: Double precision floating number
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- Since version 2.1.3.0, function TWA can be used on stable with
GROUP BY
, i.e. timelines generated byGROUP BY tbname
on a STable.
IRATE
SELECT IRATE(field_name) FROM tb_name WHERE clause;
Description: instantaneous rate on a specific column. The last two samples in the specified time range are used to calculate instantaneous rate. If the last sample value is smaller, then only the last sample value is used instead of the difference between the last two sample values.
Return value type: Double precision floating number
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- Since version 2.1.3.0, function IRATE can be used on stble with
GROUP BY
, i.e. timelines generated byGROUP BY tbname
on a STable.
SUM
SELECT SUM(field_name) FROM tb_name [WHERE clause];
Description: The sum of a specific column in a table or STable
Return value type: Double precision floating number or long integer
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
Examples:
taos> SELECT SUM(current), SUM(voltage), SUM(phase) FROM meters;
sum(current) | sum(voltage) | sum(phase) |
================================================================================
103.200000763 | 1984 | 2.640000001 |
Query OK, 1 row(s) in set (0.001702s)
taos> SELECT SUM(current), SUM(voltage), SUM(phase) FROM d1001;
sum(current) | sum(voltage) | sum(phase) |
================================================================================
35.200000763 | 658 | 0.950000018 |
Query OK, 1 row(s) in set (0.000980s)
STDDEV
SELECT STDDEV(field_name) FROM tb_name [WHERE clause];
Description: Standard deviation of a specific column in a table or STable
Return value type: Double precision floating number
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable (since version 2.0.15.1)
Examples:
taos> SELECT STDDEV(current) FROM d1001;
stddev(current) |
============================
1.020892909 |
Query OK, 1 row(s) in set (0.000915s)
LEASTSQUARES
SELECT LEASTSQUARES(field_name, start_val, step_val) FROM tb_name [WHERE clause];
Description: The linear regression function of the specified column and the timestamp column (primary key), start_val
is the initial value and step_val
is the step value.
Return value type: A string in the format of “(slope, intercept)”
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table only
Examples:
taos> SELECT LEASTSQUARES(current, 1, 1) FROM d1001;
leastsquares(current, 1, 1) |
=====================================================
{slop:1.000000, intercept:9.733334} |
Query OK, 1 row(s) in set (0.000921s)
Selection Functions
When any select function is used, timestamp column or tag columns including tbname
can be specified to show that the selected value are from which rows.
MIN
SELECT MIN(field_name) FROM {tb_name | stb_name} [WHERE clause];
Description: The minimum value of a specific column in a table or STable
Return value type: Same as the data type of the column being operated upon
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
Examples:
taos> SELECT MIN(current), MIN(voltage) FROM meters;
min(current) | min(voltage) |
======================================
10.20000 | 218 |
Query OK, 1 row(s) in set (0.001765s)
taos> SELECT MIN(current), MIN(voltage) FROM d1001;
min(current) | min(voltage) |
======================================
10.30000 | 218 |
Query OK, 1 row(s) in set (0.000950s)
MAX
SELECT MAX(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The maximum value of a specific column of a table or STable
Return value type: Same as the data type of the column being operated upon
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
Examples:
taos> SELECT MAX(current), MAX(voltage) FROM meters;
max(current) | max(voltage) |
======================================
13.40000 | 223 |
Query OK, 1 row(s) in set (0.001123s)
taos> SELECT MAX(current), MAX(voltage) FROM d1001;
max(current) | max(voltage) |
======================================
12.60000 | 221 |
Query OK, 1 row(s) in set (0.000987s)
FIRST
SELECT FIRST(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The first non-null value of a specific column in a table or STable
Return value type: Same as the column being operated upon
Applicable column types: Any data type
Applicable table types: table, STable
More explanations:
- FIRST(*) can be used to get the first non-null value of all columns
- NULL will be returned if all the values of the specified column are all NULL
- A result will NOT be returned if all the columns in the result set are all NULL
Examples:
taos> SELECT FIRST(*) FROM meters;
first(ts) | first(current) | first(voltage) | first(phase) |
=========================================================================================
2018-10-03 14:38:04.000 | 10.20000 | 220 | 0.23000 |
Query OK, 1 row(s) in set (0.004767s)
taos> SELECT FIRST(current) FROM d1002;
first(current) |
=======================
10.20000 |
Query OK, 1 row(s) in set (0.001023s)
LAST
SELECT LAST(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The last non-NULL value of a specific column in a table or STable
Return value type: Same as the column being operated upon
Applicable column types: Any data type
Applicable table types: table, STable
More explanations:
- LAST(*) can be used to get the last non-NULL value of all columns
- If the values of a column in the result set are all NULL, NULL is returned for that column; if all columns in the result are all NULL, no result will be returned.
- When it’s used on a STable, if there are multiple values with the timestamp in the result set, one of them will be returned randomly and it’s not guaranteed that the same value is returned if the same query is run multiple times.
Examples:
taos> SELECT LAST(*) FROM meters;
last(ts) | last(current) | last(voltage) | last(phase) |
========================================================================================
2018-10-03 14:38:16.800 | 12.30000 | 221 | 0.31000 |
Query OK, 1 row(s) in set (0.001452s)
taos> SELECT LAST(current) FROM d1002;
last(current) |
=======================
10.30000 |
Query OK, 1 row(s) in set (0.000843s)
TOP
SELECT TOP(field_name, K) FROM { tb_name | stb_name } [WHERE clause];
Description: The greatest k values of a specific column in a table or STable. If a value has multiple occurrences in the column but counting all of them in will exceed the upper limit k, then a part of them will be returned randomly.
Return value type: Same as the column being operated upon
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- k must be in range [1,100]
- The timestamp associated with the selected values are returned too
- Can’t be used with
FILL
Examples:
taos> SELECT TOP(current, 3) FROM meters;
ts | top(current, 3) |
=================================================
2018-10-03 14:38:15.000 | 12.60000 |
2018-10-03 14:38:16.600 | 13.40000 |
2018-10-03 14:38:16.800 | 12.30000 |
Query OK, 3 row(s) in set (0.001548s)
taos> SELECT TOP(current, 2) FROM d1001;
ts | top(current, 2) |
=================================================
2018-10-03 14:38:15.000 | 12.60000 |
2018-10-03 14:38:16.800 | 12.30000 |
Query OK, 2 row(s) in set (0.000810s)
BOTTOM
SELECT BOTTOM(field_name, K) FROM { tb_name | stb_name } [WHERE clause];
Description: The least k values of a specific column in a table or STable. If a value has multiple occurrences in the column but counting all of them in will exceed the upper limit k, then a part of them will be returned randomly.
Return value type: Same as the column being operated upon
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- k must be in range [1,100]
- The timestamp associated with the selected values are returned too
- Can’t be used with
FILL
Examples:
taos> SELECT BOTTOM(voltage, 2) FROM meters;
ts | bottom(voltage, 2) |
===============================================
2018-10-03 14:38:15.000 | 218 |
2018-10-03 14:38:16.650 | 218 |
Query OK, 2 row(s) in set (0.001332s)
taos> SELECT BOTTOM(current, 2) FROM d1001;
ts | bottom(current, 2) |
=================================================
2018-10-03 14:38:05.000 | 10.30000 |
2018-10-03 14:38:16.800 | 12.30000 |
Query OK, 2 row(s) in set (0.000793s)
PERCENTILE
SELECT PERCENTILE(field_name, P) FROM { tb_name } [WHERE clause];
Description: The value whose rank in a specific column matches the specified percentage. If such a value matching the specified percentage doesn’t exist in the column, an interpolation value will be returned.
Return value type: Double precision floating point
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table
More explanations: P is in range [0,100], when P is 0, the result is same as using function MIN; when P is 100, the result is same as function MAX.
Examples:
taos> SELECT PERCENTILE(current, 20) FROM d1001;
percentile(current, 20) |
============================
11.100000191 |
Query OK, 1 row(s) in set (0.000787s)
APERCENTILE
SELECT APERCENTILE(field_name, P[, algo_type])
FROM { tb_name | stb_name } [WHERE clause]
Description: Similar to PERCENTILE
, but a simulated result is returned
Return value type: Double precision floating point
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations
- P is in range [0,100], when P is 0, the result is same as using function MIN; when P is 100, the result is same as function MAX.
- algo_type can only be input as
default
ort-digest
, if it’s not specifieddefault
will be used, i.e.apercentile(column_name, 50)
is same asapercentile(column_name, 50, "default")
. - When
t-digest
is used,t-digest
sampling is used to calculate. It can be used from version 2.2.0.0.
Nested query: It can be used in both the outer query and inner query in a nested query.
taos> SELECT APERCENTILE(current, 20) FROM d1001;
apercentile(current, 20) |
============================
10.300000191 |
Query OK, 1 row(s) in set (0.000645s)
taos> select apercentile (count, 80, 'default') from stb1;
apercentile (c0, 80, 'default') |
==================================
601920857.210056424 |
Query OK, 1 row(s) in set (0.012363s)
taos> select apercentile (count, 80, 't-digest') from stb1;
apercentile (c0, 80, 't-digest') |
===================================
605869120.966666579 |
Query OK, 1 row(s) in set (0.011639s)
LAST_ROW
SELECT LAST_ROW(field_name) FROM { tb_name | stb_name };
Description: The last row of a table or STable
Return value type: Same as the column being operated upon
Applicable column types: Any data type
Applicable table types: table, STable
More explanations:
- When it’s used against a STable, multiple rows with the same and largest timestamp may exist, in this case one of them is returned randomly and it’s not guaranteed that the result is same if the query is run multiple times.
- Can’t be used with
INTERVAL
.
Examples:
taos> SELECT LAST_ROW(current) FROM meters;
last_row(current) |
=======================
12.30000 |
Query OK, 1 row(s) in set (0.001238s)
taos> SELECT LAST_ROW(current) FROM d1002;
last_row(current) |
=======================
10.30000 |
Query OK, 1 row(s) in set (0.001042s)
INTERP [Since version 2.3.1]
SELECT INTERP(field_name) FROM { tb_name | stb_name } [WHERE where_condition] [ RANGE(timestamp1,timestamp2) ] [EVERY(interval)] [FILL ({ VALUE | PREV | NULL | LINEAR | NEXT})];
Description: The value that matches the specified timestamp range is returned, if existing; or an interpolation value is returned.
Return value type: Same as the column being operated upon
Applicable column types: Numeric data types
Applicable table types: table, STable, nested query
More explanations
INTERP
is used to get the value that matches the specified time slice from a column. If no such value exists an interpolation value will be returned based onFILL
parameter.- The input data of
INTERP
is the value of the specified column and awhere
clause can be used to filter the original data. If nowhere
condition is specified then all original data is the input. - The output time range of
INTERP
is specified byRANGE(timestamp1,timestamp2)
parameter, with timestamp1<=timestamp2. timestamp1 is the starting point of the output time range and must be specified. timestamp2 is the ending point of the output time range and must be specified. IfRANGE
is not specified, then the timestamp of the first row that matches the filter condition is treated as timestamp1, the timestamp of the last row that matches the filter condition is treated as timestamp2. - The number of rows in the result set of
INTERP
is determined by the parameterEVERY
. Starting from timestamp1, one interpolation is performed for every time interval specifiedEVERY
parameter. IfEVERY
parameter is not used, the time windows will be considered as no ending timestamp, i.e. there is only one time window from timestamp1. - Interpolation is performed based on
FILL
parameter. No interpolation is performed ifFILL
is not used, that means either the original data that matches is returned or nothing is returned. INTERP
can only be used to interpolate in single timeline. So it must be used withgroup by tbname
when it’s used on a STable. It can’t be used withGROUP BY
when it’s used in the inner query of a nested query.- The result of
INTERP
is not influenced byORDER BY TIMESTAMP
, which impacts the output order only..
Examples: Based on the meters
schema used throughout the documents
- Single point linear interpolation between “2017-07-14 18:40:00” and “2017-07-14 18:40:00:
taos> SELECT INTERP(current) FROM t1 RANGE('2017-7-14 18:40:00','2017-7-14 18:40:00') FILL(LINEAR);
- Get original data every 5 seconds, no interpolation, between “2017-07-14 18:00:00” and “2017-07-14 19:00:00:
taos> SELECT INTERP(current) FROM t1 RANGE('2017-7-14 18:00:00','2017-7-14 19:00:00') EVERY(5s);
- Linear interpolation every 5 seconds between “2017-07-14 18:00:00” and “2017-07-14 19:00:00:
taos> SELECT INTERP(current) FROM t1 RANGE('2017-7-14 18:00:00','2017-7-14 19:00:00') EVERY(5s) FILL(LINEAR);
- Backward interpolation every 5 seconds
taos> SELECT INTERP(current) FROM t1 EVERY(5s) FILL(NEXT);
- Linear interpolation every 5 seconds between “2017-07-14 17:00:00” and “2017-07-14 20:00:00”
taos> SELECT INTERP(current) FROM t1 where ts >= '2017-07-14 17:00:00' and ts <= '2017-07-14 20:00:00' RANGE('2017-7-14 18:00:00','2017-7-14 19:00:00') EVERY(5s) FILL(LINEAR);
INTERP [Since version 2.0.15.0]
SELECT INTERP(field_name) FROM { tb_name | stb_name } WHERE ts='timestamp' [FILL ({ VALUE | PREV | NULL | LINEAR | NEXT})];
Description: The value of a specific column that matches the specified time slice
Return value type: Same as the column being operated upon
Applicable column types: Numeric data type
Applicable table types: table, STable
More explanations:
- Time slice must be specified. If there is no data matching the specified time slice, interpolation is performed based on
FILL
parameter. Conditions such as tags ortbname
can be usedWhere
clause can be used to filter data. - The timestamp specified must be within the time range of the data rows of the table or STable. If it is beyond the valid time range, nothing is returned even with
FILL
parameter. INTERP
can be used to query only single time point once.INTERP
can be used withEVERY
to get the interpolation value every time interval.- Examples:
taos> SELECT INTERP(*) FROM meters WHERE ts='2017-7-14 18:40:00.004';
interp(ts) | interp(current) | interp(voltage) | interp(phase) |
==========================================================================================
2017-07-14 18:40:00.004 | 9.84020 | 216 | 0.32222 |
Query OK, 1 row(s) in set (0.002652s)
If there is no data corresponding to the specified timestamp, an interpolation value is returned if interpolation policy is specified by FILL
parameter; or nothing is returned.
taos> SELECT INTERP(*) FROM meters WHERE tbname IN ('d636') AND ts='2017-7-14 18:40:00.005';
Query OK, 0 row(s) in set (0.004022s)
taos> SELECT INTERP(*) FROM meters WHERE tbname IN ('d636') AND ts='2017-7-14 18:40:00.005' FILL(PREV);
interp(ts) | interp(current) | interp(voltage) | interp(phase) |
==========================================================================================
2017-07-14 18:40:00.005 | 9.88150 | 217 | 0.32500 |
Query OK, 1 row(s) in set (0.003056s)
Interpolation is performed every 5 milliseconds between ['2017-7-14 18:40:00', '2017-7-14 18:40:00.014']
taos> SELECT INTERP(current) FROM d636 WHERE ts>='2017-7-14 18:40:00' AND ts<='2017-7-14 18:40:00.014' EVERY(5a);
ts | interp(current) |
=================================================
2017-07-14 18:40:00.000 | 10.04179 |
2017-07-14 18:40:00.010 | 10.16123 |
Query OK, 2 row(s) in set (0.003487s)
Scalar functions
DIFF
SELECT {DIFF(field_name, ignore_negative) | DIFF(field_name)} FROM tb_name [WHERE clause];
Description: The different of each row with its previous row for a specific column. ignore_negative
can be specified as 0 or 1, the default value is 1 if it’s not specified. 1
means negative values are ignored.
Return value type: Same as the column being operated upon
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- The number of result rows is the number of rows subtracted by one, no output for the first row
- Since version 2.1.30,
DIFF
can be used on STable withGROUP by tbname
Examples:
taos> SELECT DIFF(current) FROM d1001;
ts | diff(current) |
=================================================
2018-10-03 14:38:15.000 | 2.30000 |
2018-10-03 14:38:16.800 | -0.30000 |
Query OK, 2 row(s) in set (0.001162s)
DERIVATIVE
SELECT DERIVATIVE(field_name, time_interval, ignore_negative) FROM tb_name [WHERE clause];
Description: The derivative of a specific column. The time rage can be specified by parameter time_interval
, the minimum allowed time range is 1 second (1s); the value of ignore_negative
can be 0 or 1, 1 means negative values are ignored.
Return value type: Double precision floating point
Applicable column types: Data types except for timestamp, binary, nchar and bool
Applicable table types: table, STable
More explanations:
- It is available since version 2.1.3.0, the number of result rows is the number of total rows in the time range subtracted by one, no output for the first row.
- It can be used together with
GROUP BY tbname
against a STable.
Examples:
taos> select derivative(current, 10m, 0) from t1;
ts | derivative(current, 10m, 0) |
========================================================
2021-08-20 10:11:22.790 | 0.500000000 |
2021-08-20 11:11:22.791 | 0.166666620 |
2021-08-20 12:11:22.791 | 0.000000000 |
2021-08-20 13:11:22.792 | 0.166666620 |
2021-08-20 14:11:22.792 | -0.666666667 |
Query OK, 5 row(s) in set (0.004883s)
SPREAD
SELECT SPREAD(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The difference between the max and the min of a specific column
Return value type: Double precision floating point
Applicable column types: Data types except for binary, nchar, and bool
Applicable table types: table, STable
More explanations: Can be used on a column of TIMESTAMP type, the result is the time range size.
Examples:
taos> SELECT SPREAD(voltage) FROM meters;
spread(voltage) |
============================
5.000000000 |
Query OK, 1 row(s) in set (0.001792s)
taos> SELECT SPREAD(voltage) FROM d1001;
spread(voltage) |
============================
3.000000000 |
Query OK, 1 row(s) in set (0.000836s)
CEIL
SELECT CEIL(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The rounded up value of a specific column
Return value type: Same as the column being used
Applicable data types: Data types except for timestamp, binary, nchar, bool
Applicable table types: table, STable
Applicable nested query: Inner query and outer query
More explanations:
- Can’t be used on any tags of any type
- Arithmetic operation can be performed on the result of
ceil
function - Can’t be used with aggregate functions
FLOOR
SELECT FLOOR(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The rounded down value of a specific column
More explanations: The restrictions are same as those of the CEIL
function.
ROUND
SELECT ROUND(field_name) FROM { tb_name | stb_name } [WHERE clause];
Description: The rounded value of a specific column.
More explanations: The restrictions are same as CEIL
function.
CSUM
SELECT CSUM(field_name) FROM { tb_name | stb_name } [WHERE clause]
Description: The cumulative sum of each row for a specific column. The number of output rows is same as that of the input rows.
Return value type: Long integer for integers; Double for floating points. Timestamp is returned for each row.
Applicable data types: Data types except for timestamp, binary, nchar, and bool
Applicable table types: table, STable
Applicable nested query: Inner query and Outer query
More explanations:
- Can’t be used on tags when it’s used on STable
- Arithmetic operation can’t be performed on the result of
csum
function - Can only be used with aggregate functions
Group by tbname
must be used together on a STable to force the result on a single timeline
Applicable versions: Since 2.3.0.x
MAVG
SELECT MAVG(field_name, K) FROM { tb_name | stb_name } [WHERE clause]
Description: The moving average of continuous k values of a specific column. If the number of input rows is less than k, nothing is returned. The applicable range is k is [1,1000].
Return value type: Double precision floating point
Applicable data types: Data types except for timestamp, binary, nchar, and bool
Applicable nested query: Inner query and Outer query
Applicable table types: table, STable
More explanations:
- Arithmetic operation can’t be performed on the result of
MAVG
. - Can only be used with data columns, can’t be used with tags.
- Can’t be used with aggregate functions.
- Must be used with
GROUP BY tbname
when it’s used on a STable to force the result on each single timeline.
Applicable versions: Since 2.3.0.x
SAMPLE
SELECT SAMPLE(field_name, K) FROM { tb_name | stb_name } [WHERE clause]
Description: k sampling values of a specific column. The applicable range of k is [1,10000]
Return value type: Same as the column being operated plus the associated timestamp
Applicable data types: Any data type except for tags of STable
Applicable table types: table, STable
Applicable nested query: Inner query and Outer query
More explanations:
- Arithmetic operation can’t be operated on the result of
SAMPLE
function - Must be used with
Group by tbname
when it’s used on a STable to force the result on each single timeline
Applicable versions: Since 2.3.0.x
Arithmetic Operations
SELECT field_name [+|-|*|/|%][Value|field_name] FROM { tb_name | stb_name } [WHERE clause];
Description: The sum, difference, product, quotient, or remainder between one or more columns
Return value type: Double precision floating point
Applicable column types: Data types except for timestamp, binary, nchar, bool
Applicable table types: table, STable
More explanations:
- Arithmetic operations can be performed on two or more columns. Parentheses
()
can be used to control the order of precedence. - NULL doesn’t participate in the operation i.e. if one of the operands is NULL then result is NULL
Examples:
taos> SELECT current + voltage * phase FROM d1001;
(current+(voltage*phase)) |
============================
78.190000713 |
84.540003240 |
80.810000718 |
Query OK, 3 row(s) in set (0.001046s)