Question about GSI and High Availability and USE INDEX in N1QL

atom_yang · August 24, 2016, 1:16pm

Global Secondary Indexes do not provide automatic built-in replicas.So I can place two indexes on two separate nodes (nodeA and nodeB) on the same fields by

     CREATE INDEX Index1_beer_name 
     ON `beer-sample`(name) 
     WHERE type="beer" USING GSI WITH {"nodes":["nodeA:8091"]};
     CREATE INDEX Index2_beer_name 
     ON `beer-sample`(name)
     WHERE type="beer" USING GSI WITH {"nodes":["nodeB:8091"]};

And I also can use USE INDEX in N1QL to hint which index should be used in N1QL.such as

SELECT name FRPM default WHERE type = "beer";

or

SELECT name FRPM default USE INDEX ( Index1_beer_name ) WHERE type = "beer";

or

SELECT name FRPM default USE INDEX ( Index2_beer_name ) WHERE type = "beer";

I have some questions:

If I don’t hint index in N1QL, and assume that this N1QL will use the index that created above,Which Index will be choose by QueryService? And what is the strategy?
If I have 2 Query Services, and I want to use USE INDEX to hint index in N1QL(because my application have several N1QL,and I created a lot of index,If I don’t hint index in N1QL,the query plan might not use th

isha · August 24, 2016, 11:26pm

At the prepare time, query chooses the suitable index. If you’ve given a index name, it will use it. At runtime, we load balance the index scans among the equivalent indexes based on the index definition. In this case both indexes qualify and so it will randomly use one of the 2 indexes (when no index hints are given).
When you use index hints it forces the query to use that index but only if the index qualifies. In the event the index doesn’t exist, you get an error. If the index exists but doesn’t qualify for the given query, it reverts to a primary scan in 4.5.0 and 4.5.1.

geraldss · August 25, 2016, 6:42pm

The index hint does not prevent the indexer from load balancing. And you can put multiple indexes in your hint, including the replicas.

atom_yang · August 25, 2016, 8:54pm

Thank you very much. it is useful.

kstorch · February 1, 2017, 2:06pm

Just to be sure that I understand this right. Given the example above, if I add the replica in the index hint and use the statement

SELECT name FRPM default USE INDEX ( Index1_beer_name, Index2_beer_name ) WHERE type = "beer";

load balancing will work fine, I suppose. But

So if I add the replica in the index hint, I will get an error if nodeA with the Index1_beer_name is down?
If yes: Is there any possibility to use index hints and make use of the index replicas for higher availability? Or is the best practice to do without index hints if availability is important?

geraldss · February 1, 2017, 3:40pm

Hi @kstorch,

For now, you are best off avoiding index hints if you want HA.

The problem is that the indexer currently cannot distinguish between a non-existent index and an unavailable index. Both are reported as errors and the query errors out.

If you avoid index hints, you don’t run into this issue.

kstorch · February 2, 2017, 8:22am

Thanks for the fast and helpful answer, even if the content is not really good news for us. Are there any plans to make GSI more usable in situations where availability is relevant in the foreseeable future?

The need of different names for index replicas on other nodes means our application must have quite detailed information on the Couchbase infrastructure and work with it quite deep in the application logic. At least if we want/must use index hints or otherwise refer to specific indices. The CREATE INDEX syntax WITH {“nodes”:[“nodename”]} suggests that someone thought about the possibility of creating an index with replicas under the same name on several nodes. That would be great, as the index replication could be transparent to the application. Is this intuitively expected feature postponed, dropped or not planned at all?

geraldss · February 2, 2017, 3:20pm

Copying colleagues @prasad @siri @jliang @keshav_m

Siri · February 2, 2017, 5:22pm

Yes, your observation is correct! We intend to provide easier methods to create and manage index replicas which is the reason it is an array. It should be available very soon. Until that point, as @geraldss indicates, it is best to avoid hints in HA situations.

synesty · May 15, 2017, 9:32am

Any news on that?

We would like to use GSI Index Replication on multiple nodes (for HA), but would be able to use index hints in some queries (because they are faster with index hint in some specific cases).

Our wishlist would be:

easier way replicate GSI indexes on multiple notes without having to worry about unique index names
a way to use USE INDEX hints in N1QL queries without having to worry about on which node the index is.

We are on 4.6. CE.

kstorch · May 15, 2017, 9:37am

I think https://blog.couchbase.com/gsi-replicas-rebalancing/ will answer your questions.

synesty · August 11, 2017, 12:03pm

Thanks. There are also great questions and answers in the comments section of the blog post now.

Topic		Replies	Views
N1ql How to use a GSI when doing a join SQL++ query , n1ql	13	3294	June 7, 2016
Performance N1ql on cluster SQL++	25	3361	December 8, 2016
Create GSI on a Couchbase cluster Node.js SDK n1ql	4	1853	August 6, 2016
Picking correct index when running N1QL queries SQL++	3	1398	March 3, 2017
Can a n1ql query span multiple indexes? Couchbase Server	7	3310	October 28, 2018

Question about GSI and High Availability and USE INDEX in N1QL

Related topics