Getting started with Watcher

Getting started with Watcher

To complete these steps, you must obtain a license that includes the alerting features. For more information about Elastic license levels, see https://www.elastic.co/subscriptions and License management.

To set up a watch to start sending alerts:

Schedule the watch and define an input

A watch schedule controls how often a watch is triggered. The watch input gets the data that you want to evaluate.

To periodically search log data and load the results into the watch, you could use an interval schedule and a search input. For example, the following Watch searches the logs index for errors every 10 seconds:

  1. PUT _watcher/watch/log_error_watch
  2. {
  3. "trigger" : {
  4. "schedule" : { "interval" : "10s" }
  5. },
  6. "input" : {
  7. "search" : {
  8. "request" : {
  9. "indices" : [ "logs" ],
  10. "body" : {
  11. "query" : {
  12. "match" : { "message": "error" }
  13. }
  14. }
  15. }
  16. }
  17. }
  18. }

Schedules are typically configured to run less frequently. This example sets the interval to 10 seconds so you can easily see the watches being triggered. Since this watch runs so frequently, don’t forget to delete the watch when you’re done experimenting.

If you check the watch history you’ll see that the watch is being triggered every 10 seconds. However, the search isn’t returning any results so nothing is loaded into the watch payload.

For example, the following request retrieves the last ten watch executions (watch records) from the watch history:

  1. GET .watcher-history*/_search?pretty
  2. {
  3. "sort" : [
  4. { "result.execution_time" : "desc" }
  5. ]
  6. }

Add a condition

A condition evaluates the data you’ve loaded into the watch and determines if any action is required. Now that you’ve loaded log errors into the watch, you can define a condition that checks to see if any errors were found.

For example, the following compare condition simply checks to see if the search input returned any hits.

  1. PUT _watcher/watch/log_error_watch
  2. {
  3. "trigger" : { "schedule" : { "interval" : "10s" }},
  4. "input" : {
  5. "search" : {
  6. "request" : {
  7. "indices" : [ "logs" ],
  8. "body" : {
  9. "query" : {
  10. "match" : { "message": "error" }
  11. }
  12. }
  13. }
  14. }
  15. },
  16. "condition" : {
  17. "compare" : { "ctx.payload.hits.total" : { "gt" : 0 }}
  18. }
  19. }

The compare condition lets you easily compare against values in the execution context.

For this compare condition to evaluate to true, you need to add an event to the logs index that contains an error. For example, the following request adds a 404 error to the logs index:

  1. POST logs/event
  2. {
  3. "timestamp": "2015-05-17T18:12:07.613Z",
  4. "request": "GET index.html",
  5. "status_code": 404,
  6. "message": "Error: File not found"
  7. }

Once you add this event, the next time the watch executes its condition will evaluate to true. The condition result is recorded as part of the watch_record each time the watch executes, so you can verify whether or not the condition was met by searching the watch history:

  1. GET .watcher-history*/_search?pretty
  2. {
  3. "query" : {
  4. "bool" : {
  5. "must" : [
  6. { "match" : { "result.condition.met" : true }},
  7. { "range" : { "result.execution_time" : { "from" : "now-10s" }}}
  8. ]
  9. }
  10. }
  11. }

Configure an action

Recording watch records in the watch history is nice, but the real power of Watcher is being able to do something when the watch condition is met. A watch’s actions define what to do when the watch condition evaluates to true. You can send emails, call third-party webhooks, write documents to an Elasticsearch index, or log messages to the standard Elasticsearch log files.

For example, the following action writes a message to the Elasticsearch log when an error is detected.

  1. PUT _watcher/watch/log_error_watch
  2. {
  3. "trigger" : { "schedule" : { "interval" : "10s" }},
  4. "input" : {
  5. "search" : {
  6. "request" : {
  7. "indices" : [ "logs" ],
  8. "body" : {
  9. "query" : {
  10. "match" : { "message": "error" }
  11. }
  12. }
  13. }
  14. }
  15. },
  16. "condition" : {
  17. "compare" : { "ctx.payload.hits.total" : { "gt" : 0 }}
  18. },
  19. "actions" : {
  20. "log_error" : {
  21. "logging" : {
  22. "text" : "Found {{ctx.payload.hits.total}} errors in the logs"
  23. }
  24. }
  25. }
  26. }

Delete the Watch

Since the log_error_watch is configured to run every 10 seconds, make sure you delete it when you’re done experimenting. Otherwise, the noise from this sample watch will make it hard to see what else is going on in your watch history and log file.

To remove the watch, use the delete watch API:

  1. DELETE _watcher/watch/log_error_watch

Required security privileges

To enable users to create and manipulate watches, assign them the watcher_admin security role. Watcher admins can also view watches, watch history, and triggered watches.

To allow users to view watches and the watch history, assign them the watcher_user security role. Watcher users cannot create or manipulate watches; they are only allowed to execute read-only watch operations.

Where to go next