rrusso2007 opened a new pull request #24672: [SPARK-27801] 
InMemoryFileIndex.listLeafFiles should use listLocatedStatus for 
DistributedFileSystem
URL: https://github.com/apache/spark/pull/24672
 
 
   ## What changes were proposed in this pull request?
   
   InMemoryFileIndex.listLeafFiles should use listLocatedStatus for 
DistributedFileSystem. DistributedFileSystem overrides the listLocatedStatus 
method in order to do it with 1 single namenode call thus saving thousands of 
calls to getBlockLocations.
   
   ## How was this patch tested?
   
   test suite ran
   
   Please review https://spark.apache.org/contributing.html before opening a 
pull request.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to