Team,
We are seeing high CPU (max of 13-14 of 16 CPUs) utilization and very high load average (>25) on few nodes in the cluster which was consumed mainly by INDEXER service.
Node1 : INDEX_1 ( id, time)
Node2: INDEX_2(id, time) → Duplicate/backup index
PEAK load avg: top - 23:03:32 up 4 days, 17:38, 2 users, load average: 57.55, 46.01, 28.72
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
6982 couchba+ 20 0 11.2g 1.5g 13000 R 1206 9.9 6560:23 indexer
We are having duplicate index to distribute the load which is happening but Node1 where the original index was present taking high CPU during our Load run and after few hours Node1 is crashing. But requests/sec are same across both the nodes.
Any thoughts/Suggestions would be much appreciated.
Couchbase version : Community Edition 6.0.0 build 1693
Node configuration: 16 core CPU , 16GB memory , SSD.
Output of stats API:- curl -X GET -u USER:PASSWD "http://localhost:9102/api/v1/stats?pretty=true"
Node 1:
“data_size”: -13205983,
“disk_size”: 539983872,
“frag_percent”: 102,
“items_count”: 3742,
“num_docs_indexed”: 28115581,
“num_docs_pending”: 0,
“num_docs_queued”: 39,
“num_requests”: 2547447,
“num_rows_returned”: 312173501,
“recs_in_mem”: 0,
“recs_on_disk”: 0,
“resident_percent”: 0,
“scan_bytes_read”: 20563134922,
“total_scan_duration”: 166302472291019
Node 2:
“data_size”: -12836315,
“disk_size”: 573878272,
“frag_percent”: 102,
“items_count”: 3743,
“num_docs_indexed”: 28121280,
“num_docs_pending”: 1,
“num_docs_queued”: 17,
“num_requests”: 2548545,
“num_rows_returned”: 313383368,
“recs_in_mem”: 0,
“recs_on_disk”: 0,
“resident_percent”: 0,
“scan_bytes_read”: 20628740525,
“total_scan_duration”: 75286299992726