We’re frequently facing the issue of indexer restarts to the point the process gets unavailable and all the indexes start showing stale and eventually node fails over.
Sharing few errors:
1st Type of Error:
9,135729, 3837 committed:true
2022-06-29114:05:23.834+00:00 [Info] StorageMgr: :handleCreateSnapshot Added New Snapshot Index: 13519174020281430894 PartitionId: 0 sliceId: 0 Crc64: 9751788
094360975075
(SnapshotInfo: segos: 135729, 135729, 3837 committed:true) SnapType FORCE COMMIT SnapAligned true SnapCreateDur 11.26834ms SnapopenDur
39.69us
fatal error:
unexpected signal during runtime execution
[signal SIGSEGV: segmentation violation code=0x1 addr=0x20 pc=0x7f01£888¢73bl
runtime stack:
runtime. throw (0x12ad0€5,
0x2a)
/home/couchbase/.cbdepscache/exploded/x8664/go-1.16.5/go/src/runtime/panic.go:1117+0x72
runtime.sigpanic()
/home/couchbase/.cbdepscache/exploded/x8664/00-1.16.5/ao/src/runtime/signalunix.do:718+0x2e5
goroutine 116565 [svscalll:
runtime. cgocall (Oxfe4e62,
Oxc002639768,
Oxc002639700)
/home/couchbase/.chdepscache/exploded/y8664/00-1.16.5/oo/src/runtime/coocall.do:154
+0×56 f0=0yc002639738 sp=08c002639700 0c=0×409796
2nd Type of Error:
inioscancoordinator::nandleAdaindexinstanced+xclu48ubalscancoordinator(4/5260y4/1360y/513/+401690043455/65/685navik
efnum index forestdb navik 2c6cf1c29dlale89ffclf0f14791519d false (‘reference number"] N1QL SINGLE
“document type"awb generated”)
[false] false false
10.128.0.61:80911 false false 0 false 0 (true 0 0 0)
default
default 0 0 false 0 0 0 0 0 0 0 O lI (I O 0) 4 0 1 0x0036c6ac0 I] 0 0 false forestab
map[0: (10 0 1:91051} 0x0003783850 1 0x005646800 0xc0036d€2c0)
0?
2022-06-29T11:39:45.241+00:00 [Info] Indexer::initPartnInstance Initialized Partition:
Index: 18327922870596519179 Partition: PartitionId: 0 Endpoints: [:9105]
2022-06-29T11:39:45.241+00:00 [INFO] [FDB] Forestdb opened database file /opt/couchbase/var/lib/couchbase/data/021/navik navik phone filter index 1 1832792287
0596519179 0.index/data.€db.7
2022-06-29711:39:45.242+00:00 (ERR] (FDB] doc length body checksum mismatch error in a database file '/opt/couchbase/var/lib/couchbase/data/021/navik_navik
hone filter index 1 18327922870596519179 0.index/data.fdb.71 crc 6c |= 39 (crc in doc) keylen 14128 metalen 12850 bodylen 875771186 bodylen ondisk 859256632
offset 32643467
2022-06-29T11:39:45.242+00:00 (ERRO][FDB] Error in reading a stale region info document from a database file "/opt/couchbase/var/lib/couchbase/data/021/navik
navik phone filter index 1 18327922870596519179 0.index/data.fdb.7:
revnum 2,
offset 32643467
2022-06-29T11:39:45.242+00:00 (ERRO](FDB] doc length body checksum mismatch error in a database file "/opt/couchbase/var/lib/couchbase/data/021/navik navik phone filter index 1 18327922870596519179 0.index/data.fdb.7 crc e9 != 30 (crc in doc) keylen 12322 metalen 11298 bodylen 740438050 bodylen ondisk 740438050
offset 32874935
2022-06-29T11:39:45.242+00:00 [ERRO][FB] Error in reading a stale region info document from a database file
"/opt/couchbase/var/lib/couchbase/data/021/navik
navik phone filter index 1 18327922870596519179 0.index/data.fdb.7’: revnum 3,
offset 32874935
2022-06-29T11:39:45.242+00:00 [ERRO1 (FDBl
doc length body checksum mismatch error in a database file
/opt/couchbase/var/lib/couchbase/data/021/navik_navik_p
hone filter
index 1 18327922870596519179 0.index/data.€db.7
FfSeE 33075307
crc 9 != 37 (crc in doc) keylen 14646 metalen 11313 bodylen 825833009 bodylen ondisk 861613149*
2022-06-29T11:39:45.242+00:00 (ERRO1[FDB] Error in reading a stale region info document from a database file
"/opt/couchbase/var/lib/couchbase/data/021/navik
navik phone filter index 1 18327922870596519179 O.index/data.fdb.7:
revnum 4,
offset 33075307
2022-06-29T11:39:45.242+00:00 (ERRO1 (FDB] doc length body checksum mismatch error in a database file
'/opt/couchbase/var/lib/couchbase/data/021/naviknavik
hone filter index 1 18327922870596519179 0.index/data.fdb.7 crc 71 != 33 (crc in doc) keylen 13104 metalen 13875 bodylen 741882160 bodvlen ondisk 959788852
offset 33276011
2022-06-29T11:39:45.242+00:00 [ERRO1 (FDB] Error in reading
a stale region info document
from a
database
file
“/opt/couchbase/var/lib/couchbase/data/021/navik navik phone filter index 1 18327922870596519179 0.index/data.fdb.7”:
revnum 5,
offset 33276011
3rd Type of Error:
101 115 116 97 109 112 0 0
6 50 48 50 50 45 48 54 45 50 57 84 49 48 58 53 56 58 51 53 46 54 49
50 45 48 54
B 240671467) </ud» in Slice: 0. Error: Encoded secondary key
is too long(> 13824) . Skipped.
Previosuly we had indexer.settings.max_array_seckey_size=10024, on changing that to 51200 we still faced this issue/some docs are getting skipped from the index.