Connection timeouts during statistics

markgaudreau · September 11, 2014, 6:00pm

Hi,

Every few minutes, I get connection timeouts on my couchbase installation. The timeouts are very regular (exactly every two minutes) and seem to happen during some statistics operation (I found this looking at the /opt/couchbase/var/lib/couchbase/logs/stats logfile).

Is this a known issue? Is there anyway to juste disable the statistics to see if it corrects to problem?

Thank you!

ingenthr · October 8, 2014, 12:14am

What kind of statistics operations? Do you mean stats in the Couchbase Web Console?

markgaudreau · October 8, 2014, 1:01pm

Actually, I don’t know exactly what the server is doing at the moment of the timeouts. But, as I said, the timeouts occur exactly when statistics are logged in “/opt/couchbase/var/lib/couchbase/logs/stats”. I get something like :

[ns_doctor:debug,2014-10-06T15:23:39.054,ns_1@:ns_doctor<0.15098.40
4>:ns_doctor:handle_info:167]Current node statuses:
[{'ns_1@,
[{last_heard,{1412,623414,52371}},
{outgoing_replications_safeness_level,
[{,green},
[…]

and the timeouts at the exact same time. I upgraded the servers where I have these problems (more ram and cpus) and the problems are a lot less frequent. But I still get timeouts every once in a while.

I figured that if I had a way to disabled these statistics (that I don’t use), maybe the timeouts would completely disappear or I would at least have something to work on.

Also, I have these every once in a while in the same logfile :

[stats:warn,2014-10-06T15:25:25.680,ns_1@:<0.26253.449>:stats_colle
ctor:latest_tick:240]Dropped 1 ticks

Maybe that’s completely normal though…

misttar · January 15, 2015, 9:22pm

@markgaudreau Did you ever figure out a solution to this problem, other then upgrade the ram/cpu of your server? It appears I am having the exact same problem.

markgaudreau · January 16, 2015, 1:22pm

Actually, upgrading RAM/CPU and reinstalling everything (after the upgrade) solved everything for me. I don’t have these problems anymore.

misttar · January 28, 2015, 5:46pm

Upgrade our nodes from m3.medmium (1 vCPU) to c3.xlarge (4 vCPU) and the issue went away as well. Our average response time is now 10-20ms.

My take away from this:

Couchbase doesn’t do well with a small number of vCPU.

SureshJoshi · July 16, 2015, 3:01pm

I think I’m currently hitting this problem. It manifests as Cloudflare giving me a 524 error, but after a LOT of digging (weeks), I found that every 2 minutes, CB hits 100% CPU for about 2 seconds - during that 2 seconds, any requests get a timeout.

Is it just increasing the number of CPUs? Could it be anything else?

EDIT: Seems to be coming from Beam.smp specifically

misttar · July 16, 2015, 5:17pm

After digging in, we found two things that were contributing. View indexing and Stats collection. If you are seeing spikes every 2m exactly, it is almost certainly the stats collection.

For our situation, increasing the number CPUs was all it took.

SureshJoshi · July 17, 2015, 12:21am

Okay, I’ve doubled the number of cores, so here’s hoping!

Thanks!

Topic		Replies	Views
Couchbase stats timeouts Couchbase Server	0	557	January 20, 2022
Statistics gathering interval Couchbase Server	1	539	March 24, 2020
Intermittent time-outs (client 2.0.5, server 2.2.0 community edition) Node.js SDK	2	3094	May 19, 2015
Couchbase server throwing timeout after server is idle for a few hours Couchbase Server	2	2044	March 9, 2016
Connection Timeouts Node.js SDK	0	1949	September 17, 2014

Connection timeouts during statistics

Related topics