On this page本页内容
This tutorial involves creating a new sharded cluster that consists of a mongos
, the config server replica set, and two shard replica sets.
Each member of a sharded cluster must be able to connect to all other members in the cluster. This includes all shards and config servers. Ensure that network and security systems, including all interface and firewalls, allow these connections.
Tip
When possible, use a logical DNS hostname instead of an ip address, particularly when configuring replica set members or sharded cluster members. The use of logical DNS hostnames avoids configuration changes due to ip address changes.
If you use either localhost
or its IP address as the hostname portion of any host identifier, you must use that identifier as the host setting for any other MongoDB component in the cluster.
For example, the sh.addShard()
method takes a host
parameter for the hostname of the target shard. If you set host
to localhost
, you must then use localhost
as the host for all other shards in the cluster.
This tutorial does not include the required steps for configuring Internal/Membership Authentication or Role-Based Access Control.
In production environments, sharded clusters should employ at minimum x.509 security for internal authentication and client access.
The following steps deploys a config server replica set.
For a production deployment, deploy a config server replica set with at least three members. For testing purposes, you can create a single-member replica set.
Note
The config server replica set must not use the same name as any of the shard replica sets.
For this tutorial, the config server replica set members are associated with the following hosts:
Config Server Replica Set Member | Hostname |
---|---|
Member 0 | cfg1.example.net |
Member 1 | cfg2.example.net |
Member 2 | cfg3.example.net |
When starting each mongod
, specify the mongod
settings either via a configuration file or the command line.
If using a configuration file, set:
sharding.clusterRole
to configsvr
,replication.replSetName
to the desired name of the config server replica set,net.bindIp
option to the hostname/ip address or comma-delimited list of hostnames or ip addresses that remote clients (including the other members of the config server replica set as well as other members of the sharded cluster)
can use to connect to the instance.
Warning
Before binding to a non-localhost (e.g. publicly accessible) IP address, ensure you have secured your cluster from unauthorized access. For a complete list of security recommendations, see Security Checklist. At minimum, consider enabling authentication and hardening network infrastructure.
storage.dbPath
and net.port
. For more information on the configuration file, see configuration options.Start the mongod
with the --config
option set to the configuration file path.
If using the command line options, start the mongod
with the --configsvr
, --replSet
, --bind_ip
, and other options as appropriate to your deployment. For example:例如:
Warning
Before binding to a non-localhost (e.g. publicly accessible) IP address, ensure you have secured your cluster from unauthorized access. For a complete list of security recommendations, see Security Checklist. At minimum, consider enabling authentication and hardening network infrastructure.
For more information on startup parameters, see the mongod
reference page.
From the mongo
shell, run the rs.initiate()
method.
rs.initiate()
can take an optional replica set configuration document. In the replica set configuration document, include:
_id
set to the replica set name specified in either the replication.replSetName
or the --replSet
option.configsvr
field set to true
for the config server replica set.members
array with a document per each member of the replica set.Important
Run rs.initiate()
on just one and only one
mongod
instance for the replica set.
See Replica Set Configuration for more information on replica set configuration documents.
Once the config server replica set (CSRS) is initiated and up, proceed to creating the shard replica sets.
For a production deployment, use a replica set with at least three members. For testing purposes, you can create a single-member replica set.
Note
Shard replica sets must not use the same name as the config server replica set.
For each shard, use the following steps to create the shard replica set:
When starting each mongod
, specify the mongod
settings either via a configuration file or the command line.
If using a configuration file, set:
replication.replSetName
to the desired name of the replica set,sharding.clusterRole
option to shardsvr
,net.bindIp
option to the ip or a comma-delimited list of ips that remote clients (including the other members of the config server replica set as well as other members of the sharded cluster) can use to connect to the instance.
Warning
Before binding to a non-localhost (e.g. publicly accessible) IP address, ensure you have secured your cluster from unauthorized access. For a complete list of security recommendations, see Security Checklist. At minimum, consider enabling authentication and hardening network infrastructure.
storage.dbPath
and net.port
. For more information on the configuration file, see configuration options.Start the mongod
with the --config
option set to the configuration file path.
From the mongo
shell, run the rs.initiate()
method.
rs.initiate()
can take an optional replica set configuration document. In the replica set configuration document, include:
_id
field set to the replica set name specified in either the replication.replSetName
or the --replSet
option.members
array with a document per each member of the replica set.The following example initiates a three member replica set.
Important
Run rs.initiate()
on just one and only one
mongod
instance for the replica set.
mongos
for the Sharded Cluster¶Start a mongos
using either a configuration file or a command line parameter to specify the config servers.
If using a configuration file, set the sharding.configDB
to the config server replica set name and at least one member of the replica set in <replSetName>/<host:port>
format.
Warning
Before binding to a non-localhost (e.g. publicly accessible) IP address, ensure you have secured your cluster from unauthorized access. For a complete list of security recommendations, see Security Checklist. At minimum, consider enabling authentication and hardening network infrastructure.
Start the mongos
specifying the --config
option and the path to the configuration file.
For more information on the configuration file, see configuration options.
If using command line parameters start the mongos
and specify the --configdb
, --bind_ip
, and other options as appropriate to your deployment. For example:例如:
Warning
Before binding to a non-localhost (e.g. publicly accessible) IP address, ensure you have secured your cluster from unauthorized access. For a complete list of security recommendations, see Security Checklist. At minimum, consider enabling authentication and hardening network infrastructure.
Include any other options as appropriate for your deployment.
At this point, your sharded cluster consists of the mongos
and the config servers. You can now connect to the sharded cluster using the mongo
shell.
Connect a mongo
shell to the mongos
. Specify the host
and port
on which the mongos
is running:
Once you have connected the mongo
shell to the mongos
, continue to the next procedure to add shards to the cluster.
In the mongo
shell connected to the mongos
, use the sh.addShard()
method to add each shard to the cluster.
The following operation adds a single shard replica set to the cluster:
Repeat these steps until the cluster includes all desired shards.
Before you can shard a collection, you must enable sharding for the collection’s database. Enabling sharding for a database does not redistribute data but make it possible to shard the collections in that database.
From the mongo
shell connected to the mongos
, use the sh.enableSharding()
method to enable sharding on the target database. Enabling sharding on a database makes it possible to shard collections within a database.
Once you enable sharding for a database, MongoDB assigns a primary shard for that database where MongoDB stores all data in that database.
Important
Before you can shard a collection you must first enable sharding for the database where the collection resides.
To shard a collection, connect to the mongos
from the mongo
shell and use the sh.shardCollection()
method.
Sharding and Indexes
If the collection already contains data, you must create an index that supports the shard key before sharding the collection. If the collection is empty, MongoDB creates the index as part of sh.shardCollection()
.
MongoDB provides two strategies to shard collections:
Your selection of shard key affects the efficiency of sharding, as well as your ability to take advantage of certain sharding features such as zones. To learn how to choose an effective shard key, see Choosing a Shard Key.
Starting in version 4.0, the mongo
shell provides the method convertShardKeyToHashed()
. This method uses the same hashing function as the hashed index and can be used to see what the hashed value would be for a key.
See also参阅