Hi All,

  Please suggest what are the other possible options in Spark other than IN
Queries for fetching the data from db.
If I am executing IN Query , all data fetched to single executor in single
partition and load does not distribute to other executors.
Please suggest are there other possibilities to use other than IN.

Code Snippet

JavaRDD<Column1> theBatchRDD =
CassandraJavaUtil.*javaFunctions*(*mySession*.sparkContext())


                        .cassandraTable(keyspace,”table” ,
CassandraJavaUtil.*mapRowTo*(Column1.*class*)).select("column1").where("key
in ?", keyslist);


Configurations:

Total Executors : 3


Regards

User

Reply via email to