Hi, is it a good idea to have 2 threads in pyspark worker? - main thread responsible for receive and send data over socket while the other thread is calling user functions to process data?
since CPU is idle (?) during network I/O, this should improve concurrency quite a bit. can expert answer the question? what are the pros and cons here? thanks, Renyi.