Running a spark job on local machine and profiler results indicate that
highest time spent in
*sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.* Screenshot of
profiler result can be seen here : https://jpst.it/10i-V

Spark job(program) is performing IO (sc.wholeTextFile method of spark
apis), Reads files from local file system and analyses the text to obtain
tokens.

Any thoughts and suggestions?

Thanks.

Reply via email to