Moving percentiles aggregation
Moving percentiles aggregation
Given an ordered series of percentiles, the Moving Percentile aggregation will slide a window across those percentiles and allow the user to compute the cumulative percentile.
This is conceptually very similar to the Moving Function pipeline aggregation, except it works on the percentiles sketches instead of the actual buckets values.
Syntax
A moving_percentiles
aggregation looks like this in isolation:
{
"moving_percentiles": {
"buckets_path": "the_percentile",
"window": 10
}
}
Table 76. moving_percentiles
Parameters
Parameter Name | Description | Required | Default Value |
---|---|---|---|
| Path to the percentile of interest (see buckets_path Syntax for more details | Required | |
| The size of window to “slide” across the histogram. | Required | |
| Shift of window position. | Optional | 0 |
moving_percentiles
aggregations must be embedded inside of a histogram
or date_histogram
aggregation. They can be embedded like any other metric aggregation:
resp = client.search(
size=0,
aggs={
"my_date_histo": {
"date_histogram": {
"field": "date",
"calendar_interval": "1M"
},
"aggs": {
"the_percentile": {
"percentiles": {
"field": "price",
"percents": [
1,
99
]
}
},
"the_movperc": {
"moving_percentiles": {
"buckets_path": "the_percentile",
"window": 10
}
}
}
}
},
)
print(resp)
response = client.search(
body: {
size: 0,
aggregations: {
my_date_histo: {
date_histogram: {
field: 'date',
calendar_interval: '1M'
},
aggregations: {
the_percentile: {
percentiles: {
field: 'price',
percents: [
1,
99
]
}
},
the_movperc: {
moving_percentiles: {
buckets_path: 'the_percentile',
window: 10
}
}
}
}
}
}
)
puts response
const response = await client.search({
size: 0,
aggs: {
my_date_histo: {
date_histogram: {
field: "date",
calendar_interval: "1M",
},
aggs: {
the_percentile: {
percentiles: {
field: "price",
percents: [1, 99],
},
},
the_movperc: {
moving_percentiles: {
buckets_path: "the_percentile",
window: 10,
},
},
},
},
},
});
console.log(response);
POST /_search
{
"size": 0,
"aggs": {
"my_date_histo": {
"date_histogram": {
"field": "date",
"calendar_interval": "1M"
},
"aggs": {
"the_percentile": {
"percentiles": {
"field": "price",
"percents": [ 1.0, 99.0 ]
}
},
"the_movperc": {
"moving_percentiles": {
"buckets_path": "the_percentile",
"window": 10
}
}
}
}
}
}
A | |
A | |
Finally, we specify a |
Moving percentiles are built by first specifying a histogram
or date_histogram
over a field. You then add a percentile metric inside of that histogram. Finally, the moving_percentiles
is embedded inside the histogram. The buckets_path
parameter is then used to “point” at the percentiles aggregation inside of the histogram (see buckets_path Syntax for a description of the syntax for buckets_path
).
And the following may be the response:
{
"took": 11,
"timed_out": false,
"_shards": ...,
"hits": ...,
"aggregations": {
"my_date_histo": {
"buckets": [
{
"key_as_string": "2015/01/01 00:00:00",
"key": 1420070400000,
"doc_count": 3,
"the_percentile": {
"values": {
"1.0": 151.0,
"99.0": 200.0
}
}
},
{
"key_as_string": "2015/02/01 00:00:00",
"key": 1422748800000,
"doc_count": 2,
"the_percentile": {
"values": {
"1.0": 10.4,
"99.0": 49.6
}
},
"the_movperc": {
"values": {
"1.0": 151.0,
"99.0": 200.0
}
}
},
{
"key_as_string": "2015/03/01 00:00:00",
"key": 1425168000000,
"doc_count": 2,
"the_percentile": {
"values": {
"1.0": 175.25,
"99.0": 199.75
}
},
"the_movperc": {
"values": {
"1.0": 11.6,
"99.0": 200.0
}
}
}
]
}
}
}
The output format of the moving_percentiles
aggregation is inherited from the format of the referenced percentiles aggregation.
Moving percentiles pipeline aggregations always run with skip
gap policy.
shift parameter
By default (with shift = 0
), the window that is offered for calculation is the last n
values excluding the current bucket. Increasing shift
by 1 moves starting window position by 1
to the right.
- To include current bucket to the window, use
shift = 1
. - For center alignment (
n / 2
values before and after the current bucket), useshift = window / 2
. - For right alignment (
n
values after the current bucket), useshift = window
.
If either of window edges moves outside the borders of data series, the window shrinks to include available values only.