Hi,
We got a 4.5 cluster of about 50 nodes, it is quite full with data. We’re going to replace some core switches in our local network, so connectivity between all nodes will be disrupted for some minutes or hours. We’re afraid the cluster can become unusable, so we’d like to gracefully stop it before we break the network.
Is there a way to stop all nodes “at where they are” so they don’t try to elect new master, failover all others, rebalance etc?
Thanks,
Kirill
Thanks Dave,
We did like you suggested and it generally worked, the cluster has booted up after shutdown and all data seems to be in place.
The only issue we got is some nodes did not terminate for about 20 minutes, they were rendered as “pend” in Web UI and “service couchbase-server status” was showing the service was still running. As our maintenance window was running out we had to force kill such nodes (kill -9 respective processes).
How quick does a node typically shut down (given there’s no requests coming in to it)?
Thanks,
Kirill
is this server shutdown process applicable to stop and start node when Clusters are in K8s Pod’s ?
As I am getting bunch of errors while running this inside POD:
root@-cluster-0001:/# systemctl stop couchbase-server
Failed to connect to bus: No such file or directory
root@-cluster-0001:/# service couchbase-server stop
couchbase-server: unrecognized service
And I am on Ubuntu release:
cat /etc/os-release
NAME=“Ubuntu”
VERSION=“16.04.6 LTS (Xenial Xerus)”
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME=“Ubuntu 16.04.6 LTS”
VERSION_ID=“16.04”
HOME_URL=“http://www.ubuntu.com/”
SUPPORT_URL=“http://help.ubuntu.com/”
BUG_REPORT_URL=“http://bugs.launchpad.net/ubuntu/”
VERSION_CODENAME=xenial
UBUNTU_CODENAME=xenial