rqlite Design
Learn about the design and implementation of the database
rqlite has been in development since 2014, and its design and implementation has evolved substantially during that time. The distributed consensus system has changed, the API has improved enormously, and support for automatic clustering and node-discovery was introduced along the way.
High-level design
The diagram below shows a high-level view of a rqlite node, as it’s currently implemented.
Design presentations
There have also been a series of presentations to various groups – both industry and academic.
- Presentation given at the GoSF April 2016 Meetup.
- Presentation given to the University of Pittsburgh, April 2018.
- Presentation given to the Carnegie Mellon Database Group, September 2021. There is also a video recording of the talk.
- Presentation given to Hacker Nights NYC, March 2022.
- Build your own Distributed System using Go given at GopherCon 2023. While not specifically about rqlite, it explains the key principles behind building a system such as rqlite.
Blog posts
The most important design articles, linked below, show how the database has evolved through the years:
- Introduction to replicating SQLite with Raft (2014)
- Moving to an upgraded Raft consensus system (2016)
- Building an rqlite discovery service using AWS Lamda (2017)
- Moving from JSON to Protocol Buffers for internal data structures (2020)
- Comparing disk usage across database releases (2021)
- 7 years of open-source database development - lessons learned (2021)
- The evolution of a distributed database design (2021)
- Static linking and smaller Docker images (2021)
- Designing node discovery and automatic clustering (2022)
- Evaluating rqlite consistency with Jepsen-style testing (2022)
- Trading durability for write performance (2022)
- How rqlite exposed a bug in SQLite (2022)
- 9 years of open-source database development (2023)
- Adding large data set support (2023)
You can find many other details on rqlite from the rqlite blog.
Other design details
Raft
The Raft layer always creates a file – it creates the Raft log. This log stores the set of committed SQLite commands, in the order which they were executed. This log is authoritative record of every change that has happened to the system. It may also contain some read-only queries as entries, depending on read-consistency choices. Since every node in an rqlite cluster applies the entries log in exactly the same way, this guarantees that the SQLite database is the same on every node.
SQLite
SQLite runs in WAL mode and with SYNCHRONOUS=off. In normal operation this configuration risks database corruption in the event of crash, but does provide substantially better write performance. However, since the SQLite database is completely recreated everytime rqlited
starts, using the information stored in the Raft log, corruption is a non-issue.
Log Compaction and Truncation
rqlite automatically performs log compaction, so that disk usage due to the log remains bounded. After a configurable number of changes rqlite snapshots the SQLite database, and truncates the Raft log. This is a technical feature of the Raft consensus system, and most users of rqlite need not be concerned with this.
Last modified December 7, 2023: More design details (330a10e)