Any Other Options other than Spark IN Query

ranju goel Tue, 22 Jun 2021 04:00:47 -0700

Hi All,

  Please suggest what are the other possible options in Spark other than IN
Queries for fetching the data from db.
If I am executing IN Query , all data fetched to single executor in single
partition and load does not distribute to other executors.
Please suggest are there other possibilities to use other than IN.


Code Snippet

JavaRDD<Column1> theBatchRDD =
CassandraJavaUtil.*javaFunctions*(*mySession*.sparkContext())


                        .cassandraTable(keyspace,”table” ,
CassandraJavaUtil.*mapRowTo*(Column1.*class*)).select("column1").where("key
in ?", keyslist);


Configurations:

Total Executors : 3


Regards

User

Any Other Options other than Spark IN Query

Reply via email to