Having filters (groupBy)

Apache Druid supports two query languages: Druid SQL and native queries. This document describes the native language. For information about functions available in SQL, refer to the SQL documentation.

A having clause is a JSON object identifying which rows from a groupBy query should be returned, by specifying conditions on aggregated values.

It is essentially the equivalent of the HAVING clause in SQL.

Apache Druid supports the following types of having clauses.

Query filters

Query filter HavingSpecs allow all Druid query filters to be used in the Having part of the query.

The grammar for a query filter HavingSpec is:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type" : "filter",
  8. "filter" : <any Druid query filter>
  9. }
  10. }

For example, to use a selector filter:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type" : "filter",
  8. "filter" : {
  9. "type": "selector",
  10. "dimension" : "<dimension>",
  11. "value" : "<dimension_value>"
  12. }
  13. }
  14. }

You can use “filter” HavingSpecs to filter on the timestamp of result rows by applying a filter to the “__time” column.

Numeric filters

The simplest having clause is a numeric filter. Numeric filters can be used as the base filters for more complex boolean expressions of filters.

Here’s an example of a having-clause numeric filter:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "greaterThan",
  8. "aggregation": "<aggregate_metric>",
  9. "value": <numeric_value>
  10. }
  11. }

Equal To

The equalTo filter will match rows with a specific aggregate value. The grammar for an equalTo filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "equalTo",
  8. "aggregation": "<aggregate_metric>",
  9. "value": <numeric_value>
  10. }
  11. }

This is the equivalent of HAVING <aggregate> = <value>.

Greater Than

The greaterThan filter will match rows with aggregate values greater than the given value. The grammar for a greaterThan filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "greaterThan",
  8. "aggregation": "<aggregate_metric>",
  9. "value": <numeric_value>
  10. }
  11. }

This is the equivalent of HAVING <aggregate> > <value>.

Less Than

The lessThan filter will match rows with aggregate values less than the specified value. The grammar for a greaterThan filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "lessThan",
  8. "aggregation": "<aggregate_metric>",
  9. "value": <numeric_value>
  10. }
  11. }

This is the equivalent of HAVING <aggregate> < <value>.

Dimension Selector Filter

dimSelector

The dimSelector filter will match rows with dimension values equal to the specified value. The grammar for a dimSelector filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "dimSelector",
  8. "dimension": "<dimension>",
  9. "value": <dimension_value>
  10. }
  11. }

Logical expression filters

AND

The grammar for an AND filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "and",
  8. "havingSpecs": [
  9. {
  10. "type": "greaterThan",
  11. "aggregation": "<aggregate_metric>",
  12. "value": <numeric_value>
  13. },
  14. {
  15. "type": "lessThan",
  16. "aggregation": "<aggregate_metric>",
  17. "value": <numeric_value>
  18. }
  19. ]
  20. }
  21. }

OR

The grammar for an OR filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "or",
  8. "havingSpecs": [
  9. {
  10. "type": "greaterThan",
  11. "aggregation": "<aggregate_metric>",
  12. "value": <numeric_value>
  13. },
  14. {
  15. "type": "equalTo",
  16. "aggregation": "<aggregate_metric>",
  17. "value": <numeric_value>
  18. }
  19. ]
  20. }
  21. }

NOT

The grammar for a NOT filter is as follows:

  1. {
  2. "queryType": "groupBy",
  3. "dataSource": "sample_datasource",
  4. ...
  5. "having":
  6. {
  7. "type": "not",
  8. "havingSpec":
  9. {
  10. "type": "equalTo",
  11. "aggregation": "<aggregate_metric>",
  12. "value": <numeric_value>
  13. }
  14. }
  15. }