[GitHub] spark pull request #19788: [SPARK-9853][Core] Optimize shuffle fetch of cont...

jerryshao Thu, 23 Nov 2017 19:15:52 -0800

Github user jerryshao commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19788#discussion_r152891920
  
    --- Diff: 
core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala ---
    @@ -196,12 +196,14 @@ private[spark] class IndexShuffleBlockResolver(
       override def getBlockData(blockId: ShuffleBlockId): ManagedBuffer = {
         // The block is actually going to be a range of a single map output 
file for this map, so
         // find out the consolidated file, then the offset within that from 
our index
    +    logDebug(s"Fetch block data for $blockId")
         val indexFile = getIndexFile(blockId.shuffleId, blockId.mapId)
     
         val in = new DataInputStream(Files.newInputStream(indexFile.toPath))
         try {
           ByteStreams.skipFully(in, blockId.reduceId * 8)
           val offset = in.readLong()
    +      ByteStreams.skipFully(in, (blockId.length - 1) * 8)
    --- End diff --
    
    Also if length is "1", then this will always be Zero.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19788: [SPARK-9853][Core] Optimize shuffle fetch of cont...

Reply via email to