How would I batch-Upsert a bunch of data from .net client?

wpostma · July 27, 2015, 8:20pm

Suppose I have a data conversion process that is reading some SQL database, and pumping documents into Couchbase 4.0 beta cluster, so the ping time is a bit steep (28 msec versus say <1 msec if I was local).

Throughput is great when I have my data generation system local to the Couchbase cluster. When I’m pumping data from one network (say my office network), to another (say an azure store), I’m thinking I should probably just pump the data to a local cluster node, then export the data, and batch import it into the Azure store.

But I’m wondering if there is something better than either of these two ideas I haven’t considered. The flaw in my backup/restore idea is that I don’t think I can use it incrementally. The flaw in my big-for-loop-of-bucket.Upsert() calls, is that it’s ping-time limited (right now 28 msec http turnaround time limits me to 40 upserts a second).

I guess I could create a pool of workers, and maybe get 4 or 8 times more upserts per second even with 28 msec ping time, but maybe there’s ANOTHER alternative I haven’t looked at?

jmorris · July 28, 2015, 10:08pm

@wpostma -

Have you tried increasing the max pool size and using the Task based upsert method? Something like this:

var items = new List<string>();
              for (int i = 0; i < 1000000; i++)
              {
                  items.Add("key" + i);
              }
              var tasks = new List<Task<IOperationResult<string>>>();
              items.ForEach(x => tasks.Add(bucket.UpsertAsync(x, x)));
              var results = await Task.WhenAll(tasks);

              foreach (var result in results)
              {
                  if (result.Success)
                  {
                      //process
                  }
              }

You could even partition the items into smaller lists and hand off to a pool of workers using single clients instances.

-Jeff

wpostma · July 31, 2015, 6:57pm

That sounds much better, thanks

Topic		Replies	Views
C# couchbase sdk best way to bulk load data into a bucket .NET SDK dot-net	4	3000	July 3, 2019
Bulk Upsert in Couchbase .NET SDK 3.0.5 .NET SDK dot-net	1	1044	October 22, 2020
Dose couchbase C# client 3.x support bulk upsert .NET SDK dot-net	1	945	August 19, 2020
.NET SDK Performance .NET SDK	12	2816	November 3, 2020
Bulk upsert using .net SDK in Couchbase .NET SDK dot-net	2	1400	February 11, 2021

How would I batch-Upsert a bunch of data from .net client?

Related topics