I am launching Spark-Kafka Streaming with 4 artists (1 core each). And the theme of the kafka theme had 50 sections.
In the pre-sale part of the streaming java program, I connect to the oracle and do some work. Apache DBCP2 is used for connection pooling.
Spark-streaming program performs 4 database connections - there may be 1 for each artist. But My Expectation - since there are 50 partitions, 50 threads must be executed and there are 50 database connections.
How to increase parallelism without increasing the number of cores.
source
share