Github user jiangxb1987 commented on the issue: https://github.com/apache/spark/pull/20667 In case the same `BlockManagerId` being created multiple times, this cache will ensure we always use the first one that is created, which make it possible for the rest `BlockManagerId` instances being recycled shortly. The downside is we have to persist all the distinct `BlockManagerId` created. Since the code is added long times ago, and it's actually hard to examine the performance with/without the cache, we'd like to keep it for now.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org