The couchbase Hadoop connector retrieves ~ 1.8 million records to Hdfs, where as the bucket has ~ 2.25 million records.
The Sqoop job succeeds without any issues. The only error message we can find in the logs is - "ERROR vbucket.VBucketNodeLocator: Critical reconfiguration error: Server list from Configuration and Nodes are out of synch. causing serverName1:11210 to be removed".
16/06/13 11:10:04 INFO client.CouchbaseClient: CouchbaseConnectionFactory{bucket=‘bucketName’, nodes=[http://serverName:8091/pools/], order=RANDOM, opTimeout=2500, opQueue=16384, opQueueBlockTime=10000, obsPollInt=10, obsPollMax=500, obsTimeout=5000, viewConns=10, viewTimeout=75000, viewWorkers=1, configCheck=10, reconnectInt=1100, failureMode=Redistribute, hashAlgo=NATIVE_HASH, authWaitTime=2500}
16/06/13 11:10:04 INFO client.CouchbaseClient: viewmode property isn’t defined. Setting viewmode to production mode
16/06/13 11:10:04 INFO client.CouchbaseConnection: Shut down Couchbase client
16/06/13 11:10:04 INFO client.ViewConnection: I/O reactor terminated
16/06/13 11:10:04 WARN split.JobSplitWriter: Max block location exceeded for split: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37
38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 splitsize: 103 maxsize: 10
Env - CDH 5.5.x, Couchbase 3.1 (5 node cluster with 2 replicas), Couchbase hadoop plugin