I have a spark code that works well over a sample of data in local mode, but when I pass the same code on a cluster with the entire dataset I receive GC limited exceed error. In that section is possible to submit the code and have some hints in order to solve my problem? Thanks a lot for the attention
Marco