Github user jerryshao commented on the issue:
https://github.com/apache/spark/pull/19184
After discussed with @mridulm offline. Though the patch here cannot address
the issue of `getSortedIterator` - which uses a PriorityQueue, somehow it
solves the problem of `getIterator(...)` which doesn't require merge. And in
this specific case, it uses `getIterator` in
`ExternalAppendOnlyUnsafeRowArray` and encounter too many file opened issue.
So this fix could solve the problem of `getIterator`, but no harm to
`getSortedIterator`. Maybe we could accept it as a partial/point fix here.
What do you think @viirya @kiszk @maropu @jiangxb1987 ?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]