Github user srowen commented on the issue:
https://github.com/apache/spark/pull/21456
OK, that's good evidence this is causing a lot of garbage. Although yes
ideally we figure out just why there are so many of these stream objects, I can
see optimizing this as it is. Yes I understand the intern issue here, but, not
the need for normalization. It sounded like the strings were already mostly
identical?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]