Hello Team, in spark DAG UI , we have Stages tab. Once you click on each stage you can view the tasks.
In each task we have a column "ShuffleWrite Size/Records " that column prints wrong data when it gets the data from cache/persist . it typically will show the wrong record number though the data size is correct for e.g 3.2G/ 7400 which is wrong . please advise.
