ArangoDB Server Log Options
Log levels and topics
ArangoDB’s log output is grouped into topics. --log.level
can be specified multiple times at startup, for as many topics as needed. The log verbosity and output files can be adjusted per log topic. For example
--log.level startup=trace --log.level queries=trace --log.level info
will log messages concerning startup at trace level, AQL queries at trace level and everything else at info level.
In a configuration file, it is written like this:
[log]
level = startup=trace
level = queries=trace
level = info
The available log levels are:
fatal
: only logs fatal errorserror
: only logs errorswarning
: only logs warnings and errorsinfo
: logs information messages, warnings and errorsdebug
: logs debug and information messages, warnings and errorstrace
: logs trace, debug and information messages, warnings and errors
Note that levels debug
and trace
will be very verbose.
See Log Levels in the Monitoring chapter for a detailed description of the different levels.
Some relevant log topics available in ArangoDB 3 are:
agency
: information about the Agencycollector
: information about the WAL collector’s statecompactor
: information about the collection datafile compactordatafiles
: datafile-related operationsmmap
: information about memory-mapping operations (including msync)performance
: performance-related messagesqueries
: executed AQL queries, slow queriesreplication
: replication-related inforequests
: HTTP requestsstartup
: information about server startup and shutdownthreads
: information about threads
See more log levels
Log outputs
The log option --log.output <definition>
allows directing the global or per-topic log output to different outputs. The output definition <definition>
can be one of
-
for stdin+
for stderrsyslog://<syslog-facility>
syslog://<syslog-facility>/<application-name>
file://<relative-path>
The option can be specified multiple times in order to configure the output for different log topics. To set up a per-topic output configuration, use --log.output <topic>=<definition>
, e.g.
queries=file://queries.txt
logs all queries to the file “queries.txt”.
Any occurrence of $PID
inside a log output value will be replaced at runtime with the actual process id. This enables logging to process-specific files, e.g.
--log.output 'file:///var/log/arangod.log.$PID'
Please note that the dollar sign may need extra escaping when specified from inside shells such as Bash.
The old option --log.file
is still available in 3.0 for convenience reasons. In 3.0 it is a shortcut for the more general option --log.output file://filename
.
The old option --log.requests-file
is still available in 3.0. It is now a shortcut for the more general option --log.output requests=file://...
.
Using --log.output
also allows directing log output to different files based on topics. For example, to log all AQL queries to a file “queries.log” one can use the options:
--log.level queries=trace --log.output queries=file:///path/to/queries.log
To additionally log HTTP request to a file named “requests.log” add the options:
--log.level requests=info --log.output requests=file:///path/to/requests.log
If you specify --log.file-mode octalvalue
then any newly created log file will use “octalvalue” as file mode. Please note that the umask
value will be applied as well.
If you specify --log.file-group name
then any newly created log file will try to use “name” as group name. Please note that you have to be a member of that group. Otherwise the group ownership will not be changed. Please note that this option is only available under Linux and Mac. It is not available under Windows.
Forcing direct output
The option --log.force-direct
can be used to disable logging in an extra logging thread. If set to true
, any log messages are immediately printed in the thread that triggered the log message. This is non-optimal for performance but can aid debugging. If set to false
, log messages are handed off to an extra logging thread, which asynchronously writes the log messages.
Time format
The option --log.time-format
controls the time format used in log output. The possible values for this option are:
Format | Example | Description |
---|---|---|
timestamp | 1553766923000 | unix timestamps, in seconds |
timestamp-millis | 1553766923000.123 | unix timestamps, in seconds, with millisecond precision |
timestamp-micros | 1553766923000.123456 | unix timestamps, in seconds, with microsecond precision |
uptime | 987654 | seconds since server start |
uptime-millis | 987654.123 | seconds since server start, with millisecond precision |
uptime-micros | 987654.123456 | seconds since server start, with microsecond precision |
utc-datestring | 2019-03-28T09:55:23Z | UTC-based date and time in format YYYY-MM-DDTHH:MM:SSZ |
utc-datestring-millis | 2019-03-28T09:55:23.123Z | like utc-datestring , but with millisecond precision |
local-datestring | 2019-03-28T10:55:23 | local date and time in format YYYY-MM-DDTHH:MM:SS |
Escaping
--log.escape value
This option toggles the escaping of log output.
If set to true
, the following characters in the log output are escaped:
- the carriage return character (hex
0d
) - the newline character (hex
0a
) - the tabstop character (hex
09
) - any other characters with an ordinal value less than hex
20
If the option is set to false
, no characters are escaped. Characters with an ordinal value less than hex 20
will not be printed in this mode but will be replaced with a space character (hex 20
).
A side effect of turning off the escaping is that it will reduce the CPU overhead for the logging. However, this will only be noticeable when logging is set to a very verbose level (e.g. debug or trace).
The default value for this option is true
.
Maximum line length
Introduced in: v3.7.9
--log.max-entry-length value
This option can be used to limit the maximum line length for individual log messages that are written into normal logfiles by arangod.
This option does not include audit log messages. See --audit.max-entry-length instead.
Any log messages longer than the specified value will be truncated and the suffix ...
will be added to them.
The purpose of this parameter is to shorten long log messages in case there is lot a lot of space for logfiles, and to keep rogue log messages from overusing resources.
The default value is 128 MB, which is very high and should effectively mean downwards-compatibility with previous arangod versions, which did not restrict the maximum size of log messages.
Color logging
--log.color value
Logging to terminal output is by default colored. Colorful logging can be turned off by setting the value to false.
Source function, file and line number
--log.line-number
If enabled, then log messages will include the function name, file name and line number of the source code that issued the log message. The format is func@FileName.cpp:123
.
Example:
2021-06-08T16:09:31Z [1] INFO [prepare@GreetingsFeature.cpp:43] [e52b0] ArangoDB 3.7.11 [linux] 64bit, using jemalloc, build tags/v3.7.11-0-g5ca39c161b, VPack 0.1.33, RocksDB 6.8.0, ICU 64.2, V8 7.9.317, OpenSSL 1.1.1k 25 Mar 2021
2021-06-08T16:09:31Z [1] INFO [prepare@EnvironmentFeature.cpp:68] [75ddc] detected operating system: Linux version 4.15.0-140-generic (buildd@lgw01-amd64-054) (gcc version 7.5.0 (Ubuntu 7.5.0-3ubuntu1~18.04)) #144-Ubuntu SMP Fri Mar 19 14:12:35 UTC 2021
2021-06-08T16:09:31Z [1] INFO [prepare@EnvironmentFeature.cpp:251] [25362] {memory} Available physical memory: 67513589760 bytes, available cores: 32
2021-06-08T16:09:31Z [1] WARNING [prepare@EnvironmentFeature.cpp:426] [3e451] {memory} It is recommended to set NUMA to interleaved.
2021-06-08T16:09:31Z [1] WARNING [prepare@EnvironmentFeature.cpp:428] [b25a4] {memory} put 'numactl --interleave=all' in front of your command
2021-06-08T16:09:31Z [1] INFO [prepare@AuthenticationFeature.cpp:190] [43396] {authentication} Jwt secret not specified, generating...
2021-06-08T16:09:31Z [1] INFO [prepare@EngineSelectorFeature.cpp:187] [144fe] using storage engine 'rocksdb'
2021-06-08T16:09:31Z [1] INFO [reportRole@ClusterFeature.cpp:410] [3bb7d] {cluster} Starting up with role SINGLE
2021-06-08T16:09:31Z [1] INFO [start@FileDescriptorsFeature.cpp:90] [a1c60] {syscall} file-descriptors (nofiles) hard limit is 1048576, soft limit is 1048576
2021-06-08T16:09:31Z [1] INFO [start@AuthenticationFeature.cpp:222] [3844e] {authentication} Authentication is turned off, authentication for unix sockets is turned on
2021-06-08T16:09:32Z [1] INFO [start@IResearchFeature.cpp:972] [c1b63] {arangosearch} ArangoSearch maintenance: [5..5] commit thread(s), [5..5] consolidation thread(s)
2021-06-08T16:09:32Z [1] INFO [dump@EndpointList.cpp:222] [6ea38] using endpoint 'http+tcp://0.0.0.0:8529' for non-encrypted requests
2021-06-08T16:09:32Z [1] INFO [start@BootstrapFeature.cpp:370] [cf3f4] ArangoDB (version 3.7.11 [linux]) is ready for business. Have fun!
Prefix
Log prefix: --log.prefix
This option specifies a prefix for log messages.
Example: arangod ... --log.prefix "-->"
2020-07-23T09:46:03Z --> [17493] INFO ...
Process ID, Thread ID and Name
Log Process identifier: --log.process
(introduced in 3.8.0)
Log thread identifier: --log.thread
Log thread name: --log.thread-name
When log output is generated, the process ID is emitted as part of the log information by default. This can be turned off by adjusting the --log.process
option.
The thread ID is not emitted by default, but it can be enabled by setting the option --log.thread
.
To also log thread names, it is possible to set the --log.thread-name
option. By default --log.thread-name
is set to false
.
Here is an example that only contains the process ID (19355 in this case):
2010-09-20T13:04:01Z [19355] ... ready for business
And here is an example that also contains the thread ID in addition:
2010-09-20T13:04:17Z [19371-18446744072487317056] ... ready for business
And another example with process and thread identifier logging disabled, but thread name logging turned on:
2010-09-20T13:04:29Z [main] ... ready for business
IDs
Log IDs: --log.ids true
Since ArangoDB 3.5, each log invocation in the ArangoDB source code contains a unique log ID, which can be used to quickly find the location in the source code that produced a specific log message. These log IDs are shown by default, unless the option --log.ids
is set to false
.
Log IDs are printed as 5-digit hexadecimal identifiers in square brackets between the log level and the log topic, e.g.
2020-06-22T21:16:48Z [39028] INFO [144fe] {general} using storage engine 'rocksdb'
(where 144fe
is the log ID).
Role
Log role: --log.role
When set to true
, this option will make the ArangoDB logger print a single character with the server’s role into each logged message. The roles are:
- U: Undefined / unclear (used at startup)
- S: Single server
- C: Coordinator
- P: Primary / DB-Server
- A: Agent
The default value for this option is false
, so no roles will be logged.
Hostname
Log hostname: --log.hostname
This option specifies an optional hostname to be logged at the beginning of each log message (for regular logging) or inside the hostname
attribute (for JSON-based logging).
The default value is the empty string, meaning no hostnames will be logged. Setting this option to a value of auto
will automatically determine the hostname and use that value.
Example: arangod ... --log.hostname "auto"
JSON log output
Introduced in: v3.8.0
Toggle JSON log output: --log.use-json-format
This option can be used to switch log output to JSON format. Each log message then produces a separate line with JSON-encoded log data, which can be consumed by applications.
The attributes produced for each log message JSON object are:
Key | Value |
---|---|
time | date/time of log message, in format specified by —log.time-format |
prefix | only emitted if —log.prefix is set |
pid | process id, only emitted if —log.process is set |
tid | thread id, only emitted if —log.thread is set |
thread | thread name, only emitted if —log.thread-name is set |
role | server role (1 character), only emitted if —log.role is set |
level | log level (e.g. “WARN” , “INFO” ) |
file | source file name of log message, only emitted if —log.line-number is set |
line | source file line of log message, only emitted if —log.line-number is set |
function | source file function name, only emitted if —log.line-number is set |
topic | log topic name |
id | log id (5 digit hexadecimal string), only emitted if —log.ids is set |
hostname | hostname if —log.hostname is set |
message | the actual log message payload |
Log API Access
Introduced in: v3.4.11, v3.5.6, v3.6.5, v3.7.1
/_admin/log
control: --log.api-enabled
Credentials data is not written to log files. Nevertheless, some logged data might be sensitive depending on the context of the deployment. For example, if request logging is switched on, user requests and corresponding data might end up in log files. Therefore, a certain care with log files is recommended.
Since the database server offers an API to control logging and query logging data, this API has to be secured properly. By default, the API is accessible for admin users (administrative access to the _system
database). However, one can lock this down further.
The possible values for this option are:
true
: The API/_admin/log
is accessible for admin users.jwt
: The API/_admin/log
is accessible only for the superuser (authentication with JWT token and empty username).false
: The API/_admin/log
is not accessible at all.
The default value is true
.
Logging to memory buffers
Introduced in: v3.8.0
Log to memory: --log.in-memory
This option can be used to toggle storing log messages in memory, from which they can be consumed via the /_admin/log
HTTP API and via the Web UI. By default, this option is turned on, so log messages are consumable via the API and UI. Turning this option off will disable that functionality, save a tiny bit of memory for the in-memory log buffers and prevent potential log information leakage via these means.
Log level control for in-memory log messages: --log.in-memory-level
This option can be used to control which log messages are preserved in memory (in case --log.in-memory
is set to true). The default value is info
, meaning all log messages of types info
, warning
, error
and fatal
will be stored by an instance in memory. By setting this option to warning
, only warning, error and fatal log messages will be preserved in memory, and by setting the option to error
only error and fatal messages will be kept. This option is useful because the number of in-memory log messages is limited to the latest 2048 messages, and these slots are by default shared between informational, warning and error messages.