N1QL for array elements

ankitlalan · February 13, 2020, 4:47am

I have a json response stored to couchbase as document. I’m trying to retrieve the results by ‘businessDate’ and ‘buyerName’ using the index created as follow but unable to retrieve the results.

What is the correct way to perform N1QL query?

CREATE INDEX idx on bucketName(businessDate,buyerName)
My search queries:
select * from bucketName where businessDate=‘2020-02-11’ and buyerName like ‘Lalan%’;
select * from bucketName where buyerName = ‘Lalan Securities Private Limited’;

Secondly, I have to perform search using node.js, do I need to get the document first and then perform search on it or can I make a search within a document? My results are stored in one single array.

Please provide suggestions.

[{
	"bucketName": {
		"data": [{
			"businessDate": "2020-02-11",
			"buyerName": "Lalan Securities Private Limited",
			
		},
		{
			"businessDate": "2020-02-12",
			"buyerName": "Ankit Securities (PVT) Ltd.",
			
		}],
		"timestamp": 1581486674956
	}
}]

ankitlalan · February 13, 2020, 5:10am

I have my answers now.
There are two methods to perform queries.

Using Unnest, and
Using Array

For the above data we can use the query:

select
   d 
from
   bucketName t unnest data as d 
where
   d.buyerName= "Lalan Securities Private Limited" 
   and d.businessDate=" 2020-02-11";

Option 2, using Array:

select
   ARRAY data FOR data in bucketName.data 
   when
      data.buyerName= "Lalan Securities Private Limited" 
      and data.businessDate= "2020-02-11"
      END
      as data 
from
   bucketName

In the above query, we are not first flattening the array through an UNNEST operation. Instead, we are using one of the collection operators to find array items that meet our criteria.

vsr1 · February 13, 2020, 6:29am

CREATE INDEX ix1 ON  bucketName (DISTINCT ARRAY d.businessDate FOR d IN data END);


select
   ARRAY data FOR data in bucketName.data 
   when
      data.buyerName= "Lalan Securities Private Limited" 
      and data.businessDate= "2020-02-11"
      END
      as data 
from
   bucketName
WHERE ANY d IN data SATISFIES d.businessDate= "2020-02-11" AND d.buyerName= "Lalan Securities Private Limited" END;

ankitlalan · February 13, 2020, 8:11am

Is it really needed to put the search arguments twice, first with the WHEN clause and later with the SATISFIES clause? Will not this impact on performance?

Also, I’d be thankful to you if you would help me rewriting the CREATE INDEX query for two arguments. I’ll search based on “buyerName” and “businessDate”. Following query did not result but errored out “index not found.”

CREATE INDEX ix1 ON  bucketName (DISTINCT ARRAY {d.stockSymbol, d.businessDate} FOR d IN data END);

vsr1 · February 13, 2020, 2:21pm

It needs both places. Projection expression starts with ARRAY and ends with END. Where caluse expression starts with ANY and end with END. You can’t use where clause derived expression in projection vice versa.

Whenever you reference data it comes from source i.e. bucket. If you don’t want that way use UNNEST.

The index will not qualify you check this out https://docs.couchbase.com/server/current/n1ql/n1ql-language-reference/indexing-arrays.html.

If you indexed {d.stockSymbol, d.businessDate} then UNNEST where clause or ANY … SATISFIES clause must have
{d.stockSymbol, d.businessDate} = {
“businessDate”: “2020-02-11”,
“buyerName”: “Lalan Securities Private Limited”}

This type of comparison is very hard always not possible. So index one search value and do indexscan and then apply post Fetch other predicates.

ankitlalan · February 14, 2020, 4:32am

Great!
Thank you for the explanation in detail. I have results now.

CREATE INDEX `ix` ON `bucketName`((distinct (array {"businessDate": (`d`.`businessDate`), "stockSymbol": (`d`.`stockSymbol`)} for `d` in `data` end)))

Following query provided result:

    select timestamp,
       ARRAY data FOR data in test.data 
       when
          {data.stockSymbol,data.businessDate}={"stockSymbol":"ADBL","businessDate":"2020-02-11"}
          END
          as data 
    from
       bucketName
    WHERE
       ANY d IN data SATISFIES {d.stockSymbol,d.businessDate}={"stockSymbol":"ADBL","businessDate":"2020-02-11"}
       END;

I’m going to mark this as solved but opening a new question to implement the same query using node.js. Please let me know if you have any solution to it as well.

Thank you.
Ankit Lalan

vsr1 · February 14, 2020, 5:06am

If you are looking equality only

CREATE INDEX `ix` ON `bucketName`((distinct (array  [d.stockSymbol , d.businessDate ] for `d` in `data` end)));
select timestamp,
       ARRAY data FOR data in test.data 
       when  dtata.stockSymbol = "ADBL" AND d.businessDate = "2020-02-11"  END
          AS  data 
    from
       bucketName
    WHERE
       ANY d IN data SATISFIES [d.stockSymbol,d.businessDate] = ["ADBL","2020-02-11"]  END;

Check this post for explanation Creating a covered index with array, how to access array elements in select?
Composite index from an array of objects

ankitlalan · February 14, 2020, 5:17am

That works as well!
Below is the explanation of the search result.

{
	"plan": {
		"#operator": "Sequence",
		"~children": [{
			"#operator": "DistinctScan",
			"scan": {
				"#operator": "IndexScan3",
				"index": "ix1",
				"index_id": "73d87faba62f6479",
				"index_projection": {
					"primary_key": true
				},
				"keyspace": "bucketName",
				"namespace": "default",
				"spans": [{
					"exact": true,
					"range": [{
						"high": "[\"ADBL\", \"2020-02-11\"]",
						"inclusion": 3,
						"low": "[\"ADBL\", \"2020-02-11\"]"
					}]
				}],
				"using": "gsi"
			}
		},
		{
			"#operator": "Fetch",
			"keyspace": "bucketName",
			"namespace": "default"
		},
		{
			"#operator": "Parallel",
			"~child": {
				"#operator": "Sequence",
				"~children": [{
					"#operator": "Filter",
					"condition": "any `d` in (`bucketName`.`data`) satisfies ([(`d`.`stockSymbol`), (`d`.`businessDate`)] = [\"ADBL\", \"2020-02-11\"]) end"
				},
				{
					"#operator": "InitialProject",
					"result_terms": [{
						"expr": "(`bucketName`.`timestamp`)"
					},
					{
						"as": "data",
						"expr": "array `data` for `data` in (`bucketName`.`data`) when (((`data`.`stockSymbol`) = \"ADBL\") and ((`data`.`businessDate`) = \"2020-02-11\")) end"
					}]
				},
				{
					"#operator": "FinalProject"
				}]
			}
		}]
	},
	"text": "select\n   timestamp,\n   ARRAY data FOR data in bucketName.data \n   when\n      data.stockSymbol = \"ADBL\" \n      AND data.businessDate = \"2020-02-11\" \n      END\n      AS data \nfrom\n   bucketName \nWHERE\n   ANY d IN data SATISFIES [d.stockSymbol,d.businessDate] = [\"ADBL\",\"2020-02-11\"] \n   END;"
}

ankitlalan · February 14, 2020, 5:41am

Hello,

When we use N1QL queries with multiple documents in one bucket, how does the search results out? Does it always look for the newly upsert document or results out by scanning all documents and providing relevant result?

I have two documents, my search query will result from both documents or from the last updated one? If latter, what happens with the old documents? How can I traverse to all documents?

Query

select
   timestamp,
   ARRAY data FOR data in test.data 
   when
      data.stockSymbol = "ADBL" 
      AND data.businessDate = "2020-02-11" 
      END
      AS data 
from
   test 
WHERE
   ANY d IN data SATISFIES [d.stockSymbol,d.businessDate] = ["ADBL","2020-02-11"] 
   END;

Actually, I’m from RDBMS background so considering bucket as tables and Documents as rows. Also having a perception that for every subject area, I need to create buckets and upsert to particular bucket only. Am I correct?

vsr1 · February 14, 2020, 12:56pm

From clause has bucket, Every document in the bucket is considered. If you don’t want you can specify where clause filter out.
Index is maintained asynchronously, few ms behind. If you need you can use scan_consistency. You should read the documentation.

Topic		Replies	Views
How to query documents which are arrays SQL++ n1ql	1	820	March 7, 2018
How to get an array of objects using n1ql query SQL++ query , n1ql	1	2659	December 10, 2019
N1QL query for objects inside a object Couchbase Server n1ql	4	194	September 26, 2024
Indexes for n1QL queries using UNNEST and how to make it covered SQL++ n1ql , index	1	1274	June 6, 2018
Query syntax for arrays SQL++	3	3546	February 28, 2017

N1QL for array elements

Related topics