Network Configuration Reference
Network configuration is critical for building a high performance CephStorage Cluster. The Ceph Storage Cluster does not perform request routing ordispatching on behalf of the Ceph Client. Instead, Ceph Clients makerequests directly to Ceph OSD Daemons. Ceph OSD Daemons perform data replicationon behalf of Ceph Clients, which means replication and other factors imposeadditional loads on Ceph Storage Cluster networks.
Our Quick Start configurations provide a trivial Ceph configuration file thatsets monitor IP addresses and daemon host names only. Unless you specify acluster network, Ceph assumes a single “public” network. Ceph functions justfine with a public network only, but you may see significant performanceimprovement with a second “cluster” network in a large cluster.
It is possible to run a Ceph Storage Cluster with two networks: a public(front-side) network and a cluster (back-side) network. However, this approachcomplicates network configuration (both hardware and software) and does not usually have a significant impact on overall performance. For this reason, we generally recommend that dual-NIC systems either be configured with two IPs on the same network, or bonded.
If, despite the complexity, one still wishes to use two networks, eachCeph Node will need to have more than one NIC. See HardwareRecommendations - Networks for additional details.
IP Tables
By default, daemons bind to ports within the 6800:7300
range. You mayconfigure this range at your discretion. Before configuring your IP tables,check the default iptables
configuration.
sudo iptables -L
Some Linux distributions include rules that reject all inbound requestsexcept SSH from all network interfaces. For example:
- REJECT all -- anywhere anywhere reject-with icmp-host-prohibited
You will need to delete these rules on both your public and cluster networksinitially, and replace them with appropriate rules when you are ready toharden the ports on your Ceph Nodes.
Monitor IP Tables
Ceph Monitors listen on ports 3300
and 6789
bydefault. Additionally, Ceph Monitors always operate on the publicnetwork. When you add the rule using the example below, make sure youreplace {iface}
with the public network interface (e.g., eth0
,eth1
, etc.), {ip-address}
with the IP address of the publicnetwork and {netmask}
with the netmask for the public network.
- sudo iptables -A INPUT -i {iface} -p tcp -s {ip-address}/{netmask} --dport 6789 -j ACCEPT
MDS and Manager IP Tables
A Ceph Metadata Server or Ceph Manager listens on the firstavailable port on the public network beginning at port 6800. Note that thisbehavior is not deterministic, so if you are running more than one OSD or MDSon the same host, or if you restart the daemons within a short window of time,the daemons will bind to higher ports. You should open the entire 6800-7300range by default. When you add the rule using the example below, make sureyou replace {iface}
with the public network interface (e.g., eth0
,eth1
, etc.), {ip-address}
with the IP address of the public networkand {netmask}
with the netmask of the public network.
For example:
- sudo iptables -A INPUT -i {iface} -m multiport -p tcp -s {ip-address}/{netmask} --dports 6800:7300 -j ACCEPT
OSD IP Tables
By default, Ceph OSD Daemons bind to the first available ports on a Ceph Nodebeginning at port 6800. Note that this behavior is not deterministic, so if youare running more than one OSD or MDS on the same host, or if you restart thedaemons within a short window of time, the daemons will bind to higher ports.Each Ceph OSD Daemon on a Ceph Node may use up to four ports:
One for talking to clients and monitors.
One for sending data to other OSDs.
Two for heartbeating on each interface.
When a daemon fails and restarts without letting go of the port, the restarteddaemon will bind to a new port. You should open the entire 6800-7300 port rangeto handle this possibility.
If you set up separate public and cluster networks, you must add rules for boththe public network and the cluster network, because clients will connect usingthe public network and other Ceph OSD Daemons will connect using the clusternetwork. When you add the rule using the example below, make sure you replace{iface}
with the network interface (e.g., eth0
, eth1
, etc.),{ip-address}
with the IP address and {netmask}
with the netmask of thepublic or cluster network. For example:
- sudo iptables -A INPUT -i {iface} -m multiport -p tcp -s {ip-address}/{netmask} --dports 6800:7300 -j ACCEPT
Tip
If you run Ceph Metadata Servers on the same Ceph Node as theCeph OSD Daemons, you can consolidate the public network configuration step.
Ceph Networks
To configure Ceph networks, you must add a network configuration to the[global]
section of the configuration file. Our 5-minute Quick Startprovides a trivial Ceph configuration file that assumes one public networkwith client and server on the same network and subnet. Ceph functions just finewith a public network only. However, Ceph allows you to establish much morespecific criteria, including multiple IP network and subnet masks for yourpublic network. You can also establish a separate cluster network to handle OSDheartbeat, object replication and recovery traffic. Don’t confuse the IPaddresses you set in your configuration with the public-facing IP addressesnetwork clients may use to access your service. Typical internal IP networks areoften 192.168.0.0
or 10.0.0.0
.
Tip
If you specify more than one IP address and subnet mask foreither the public or the cluster network, the subnets within the networkmust be capable of routing to each other. Additionally, make sure youinclude each IP address/subnet in your IP tables and open ports for themas necessary.
Note
Ceph uses CIDR notation for subnets (e.g., 10.0.0.0/24
).
When you have configured your networks, you may restart your cluster or restarteach daemon. Ceph daemons bind dynamically, so you do not have to restart theentire cluster at once if you change your network configuration.
Public Network
To configure a public network, add the following option to the [global]
section of your Ceph configuration file.
- [global]
- # ... elided configuration
- public network = {public-network/netmask}
Cluster Network
If you declare a cluster network, OSDs will route heartbeat, object replicationand recovery traffic over the cluster network. This may improve performancecompared to using a single network. To configure a cluster network, add thefollowing option to the [global]
section of your Ceph configuration file.
- [global]
- # ... elided configuration
- cluster network = {cluster-network/netmask}
We prefer that the cluster network is NOT reachable from the public networkor the Internet for added security.
Ceph Daemons
The monitor daemons are each configured to bind to a specific IP address. These addresses are normally configured by your deployment tool. Other components in the Ceph system discover the monitors via the mon host
configuration option, normally specified in the [global]
section of the ceph.conf
file.
- [global]
- mon host = 10.0.0.2, 10.0.0.3, 10.0.0.4
The mon host
value can be a list of IP addresses or a name that islooked up via DNS. In the case of a DNS name with multiple A or AAAArecords, all records are probed in order to discover a monitor. Onceone monitor is reached, all other current monitors are discovered, sothe mon host
configuration option only needs to be sufficiently upto date such that a client can reach one monitor that is currently online.
The MGR, OSD, and MDS daemons will bind to any available address anddo not require any special configuration. However, it is possible tospecify a specific IP address for them to bind to with the publicaddr
(and/or, in the case of OSD daemons, the cluster addr
)configuration option. For example,
- [osd.0]
- public addr = {host-public-ip-address}
- cluster addr = {host-cluster-ip-address}
Network Config Settings
Network configuration settings are not required. Ceph assumes a public networkwith all hosts operating on it unless you specifically configure a clusternetwork.
Public Network
The public network configuration allows you specifically define IP addressesand subnets for the public network. You may specifically assign static IPaddresses or override public network
settings using the public addr
setting for a specific daemon.
public network
- Description
The IP address and netmask of the public (front-side) network(e.g.,
192.168.0.0/24
). Set in[global]
. You may specifycomma-separated subnets.Type
{ip-address}/{netmask} [, {ip-address}/{netmask}]
Required
No
Default
- N/A
public addr
- Description
The IP address for the public (front-side) network.Set for each daemon.
Type
IP Address
Required
No
Default
- N/A
Cluster Network
The cluster network configuration allows you to declare a cluster network, andspecifically define IP addresses and subnets for the cluster network. You mayspecifically assign static IP addresses or override cluster network
settings using the cluster addr
setting for specific OSD daemons.
cluster network
- Description
The IP address and netmask of the cluster (back-side) network(e.g.,
10.0.0.0/24
). Set in[global]
. You may specifycomma-separated subnets.Type
{ip-address}/{netmask} [, {ip-address}/{netmask}]
Required
No
Default
- N/A
cluster addr
- Description
The IP address for the cluster (back-side) network.Set for each daemon.
Type
Address
Required
No
Default
- N/A
Bind
Bind settings set the default port ranges Ceph OSD and MDS daemons use. Thedefault range is 6800:7300
. Ensure that your IP Tables configurationallows you to use the configured port range.
You may also enable Ceph daemons to bind to IPv6 addresses instead of IPv4addresses.
ms bind port min
- Description
The minimum port number to which an OSD or MDS daemon will bind.
Type
32-bit Integer
Default
6800
Required
- No
ms bind port max
- Description
The maximum port number to which an OSD or MDS daemon will bind.
Type
32-bit Integer
Default
7300
Required
- No.
ms bind ipv6
- Description
Enables Ceph daemons to bind to IPv6 addresses. Currently themessenger either uses IPv4 or IPv6, but it cannot do both.
Type
Boolean
Default
false
Required
- No
public bind addr
- Description
In some dynamic deployments the Ceph MON daemon might bindto an IP address locally that is different from the
public addr
advertised to other peers in the network. The environment must ensurethat routing rules are set correctly. Ifpublic bind addr
is setthe Ceph MON daemon will bind to it locally and usepublic addr
in the monmaps to advertise its address to peers. This behavior is limitedto the MON daemon.Type
IP Address
Required
No
Default
- N/A
TCP
Ceph disables TCP buffering by default.
ms tcp nodelay
- Description
Ceph enables
ms tcp nodelay
so that each request is sentimmediately (no buffering). Disabling Nagle’s algorithmincreases network traffic, which can introduce latency. If youexperience large numbers of small packets, you may trydisablingms tcp nodelay
.Type
Boolean
Required
No
Default
true
ms tcp rcvbuf
- Description
The size of the socket buffer on the receiving end of a networkconnection. Disable by default.
Type
32-bit Integer
Required
No
Default
0
ms tcp read timeout
- Description
If a client or daemon makes a request to another Ceph daemon anddoes not drop an unused connection, the
ms tcp read timeout
defines the connection as idle after the specified numberof seconds.Type
Unsigned 64-bit Integer
Required
No
Default
900
15 minutes.