Unable to Count the documents using spark structured streaming

vinaya · April 14, 2020, 9:17am

I am trying to use couchbase as the streaming source for spark structured streaming using spark connector.
val records = spark.readStream
.format(“com.couchbase.spark.sql”).schema(schema)
.load()

And I have this query
records
.groupBy(“type”)
.count()
.writeStream
.outputMode(“complete”)
.format(“console”)
.start()
.awaitTermination()

For this query I am not getting the correct output . My query output table is like this

Batch: 0

20/04/14 14:28:00 INFO CodeGenerator: Code generated in 10.538654 ms
20/04/14 14:28:00 INFO WriteToDataSourceV2Exec: Data source writer org.apache.spark.sql.execution.streaming.sources.MicroBatchWriter@17fe0ec7 committed.
±-------±----+
|type | count|
±-------±----+
±-------±----+

However if I use the couchbase to fetch the documents as non streaming. Like
val cdr = spark.read.couchbase(EqualTo(“type”, “cdr”))
cdr.count() gives the correct output. (count= 28).

Please let me know why this is not working with structured streaming.

Topic		Replies	Views
Spark Streaming document data in spark connector 3.3.3 Spark Connector	2	326	April 8, 2024
2.1 - Structured Streaming Error Spark Connector spark , connector	6	5705	July 3, 2017
Unable to Connect Spark Streaming with Couchbase Spark Connector spark	3	3317	May 16, 2019
Unable to write docs into Couchbase using spark structure streaming Couchbase Server spark	1	1404	October 8, 2019
Data source couchbase.kv does not support streamed reading Spark Connector dcp	3	810	June 29, 2022

Unable to Count the documents using spark structured streaming

For this query I am not getting the correct output . My query output table is like this

Batch: 0

Related topics