[jira] [Assigned] (SPARK-17839) UnsafeSorterSpillReader should use Nio's directbuffer to read the spill files in order to avoid additional copy

2016-10-09 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-17839:


Assignee: (was: Apache Spark)

> UnsafeSorterSpillReader should use Nio's directbuffer to read the spill files 
> in order to avoid additional copy
> ---
>
> Key: SPARK-17839
> URL: https://issues.apache.org/jira/browse/SPARK-17839
> Project: Spark
>  Issue Type: Improvement
>  Components: Shuffle
>Affects Versions: 2.0.1
>Reporter: Sital Kedia
>Priority: Minor
>
> Currently we use BufferedInputStream to read the shuffle file which copies 
> the file content from os buffer cache to the user buffer. This adds 
> additional latency in reading the spill files. We made a change to use java 
> nio's direct buffer to read the spill files and for certain jobs spilling 
> significant amount of data, we see between 5 - 7% speedup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-17839) UnsafeSorterSpillReader should use Nio's directbuffer to read the spill files in order to avoid additional copy

2016-10-09 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-17839:


Assignee: Apache Spark

> UnsafeSorterSpillReader should use Nio's directbuffer to read the spill files 
> in order to avoid additional copy
> ---
>
> Key: SPARK-17839
> URL: https://issues.apache.org/jira/browse/SPARK-17839
> Project: Spark
>  Issue Type: Improvement
>  Components: Shuffle
>Affects Versions: 2.0.1
>Reporter: Sital Kedia
>Assignee: Apache Spark
>Priority: Minor
>
> Currently we use BufferedInputStream to read the shuffle file which copies 
> the file content from os buffer cache to the user buffer. This adds 
> additional latency in reading the spill files. We made a change to use java 
> nio's direct buffer to read the spill files and for certain jobs spilling 
> significant amount of data, we see between 5 - 7% speedup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org