CouchBase Node Goes down frequently

Hey Folks,
We have a scenario here… we have 3 couchbase node under the same network, we have seen that one of the node(same everytime) goes down quite often.Checked the logs and we see following logs
1 - Haven’t heard from a higher priority node or a master, so I’m taking over.
2 - Enabled auto-failover with timeout 120 auto_failover 000 ns_xxxxxxxxxxxxxxxxxx
3 - Shutting down bucket “default” on ‘ns_xxxxxxxxxxxxxxx’ for server shutdown ns_memcached 000
4 - Node ‘ns_1@xxxxxxxxxxxxxxx’ saw that node ‘ns_1@xxxxxxxxxxxxxxx’ went down Details: [{nodedown_reason, connection_closed}] …ns_node_disco_005
5 - Service ‘ns_server’ exited with status 1. Restarting. Messages: [os_mon] win32 supervisor port (win32sysinfo): Error writing to pipe {“Kernel pid terminated”,application_controller,"{application_terminated,os_mon,shutdown}"}
Crash dump was written to: erl_crash.dump
Kernel pid terminated (application_controller) ({application_terminated,os_mon,shutdown})

After few seconds log says that it is able to contact the node…

6 - Couchbase Server has started on web port 8091 on node ‘ns_1@XXXXXXXXX’. Version: “4.5.0-2601-enterprise”.
7 - Node ‘ns_1@xxxxxxxxxx’ saw that node ‘ns_1@xxxxxxxxxx’ came up. Tags:
8 - Default,core and mobile buckets were loaded after node was up.

Any suggestions is appreciated…

Regards,
Shan

Unavailability of the VM were CB was installed could be an issue? however i dont see that server went down anytime.

Hi @shantanu_das89,

Are you running 4.5 on Windows 10 with Anniversary Update by any chance? Also, have you tried the latest 4.6 release?

Hi Matthew,
Yes it is 4.5 on windows 2012, no we havn’t tried 4.6 yet i believe it is still in developer preview, keeping that aside can some body tell me about the reason on why this issue has occurred so frequently.

The issue is on production environment which is making it even worse.

shan

@shantanu_das89,

A few things that might help:

a) 4.6 has been released, so you might want to give that a try to see if it helps. There are a number of issues that have been fixed (see 4.6 release notes), but I don’t have enough information to know if any of them apply to your problem.

b) Since you are using Enterprise edition, you may want to contact support for help.

c) One of the most common issues with Windows deployments is “port exhaustion”, so you may want to look into that. That link refers to BizTalk, but it can apply to other applications, including Couchbase.