[jira] [Assigned] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-11-04 Thread Davies Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Davies Liu reassigned SPARK-7542:
-

Assignee: Davies Liu

> Support off-heap sort buffer in UnsafeExternalSorter
> 
>
> Key: SPARK-7542
> URL: https://issues.apache.org/jira/browse/SPARK-7542
> Project: Spark
>  Issue Type: New Feature
>  Components: Spark Core
>Affects Versions: 1.4.0
>Reporter: Josh Rosen
>Assignee: Davies Liu
>
> {{UnsafeExternalSorter}}, introduced in SPARK-7081, uses on-heap {{long[]}} 
> arrays as its sort buffers.  When records are small, the sorting array might 
> be as large as the data pages, so it would be useful to be able to allocate 
> this array off-heap (using our unsafe LongArray).  Unfortunately, we can't 
> currently do this because TimSort calls {{allocate()}} to create data buffers 
> but doesn't call any corresponding cleanup methods to free them.
> We should look into extending TimSort with buffer freeing methods, then 
> consider switching to LongArray in UnsafeShuffleSortDataFormat.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-08-10 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-7542:
---

Assignee: (was: Apache Spark)

 Support off-heap sort buffer in UnsafeExternalSorter
 

 Key: SPARK-7542
 URL: https://issues.apache.org/jira/browse/SPARK-7542
 Project: Spark
  Issue Type: New Feature
  Components: Spark Core
Affects Versions: 1.4.0
Reporter: Josh Rosen

 {{UnsafeExternalSorter}}, introduced in SPARK-7081, uses on-heap {{long[]}} 
 arrays as its sort buffers.  When records are small, the sorting array might 
 be as large as the data pages, so it would be useful to be able to allocate 
 this array off-heap (using our unsafe LongArray).  Unfortunately, we can't 
 currently do this because TimSort calls {{allocate()}} to create data buffers 
 but doesn't call any corresponding cleanup methods to free them.
 We should look into extending TimSort with buffer freeing methods, then 
 consider switching to LongArray in UnsafeShuffleSortDataFormat.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-08-10 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-7542:
---

Assignee: Apache Spark

 Support off-heap sort buffer in UnsafeExternalSorter
 

 Key: SPARK-7542
 URL: https://issues.apache.org/jira/browse/SPARK-7542
 Project: Spark
  Issue Type: New Feature
  Components: Spark Core
Affects Versions: 1.4.0
Reporter: Josh Rosen
Assignee: Apache Spark

 {{UnsafeExternalSorter}}, introduced in SPARK-7081, uses on-heap {{long[]}} 
 arrays as its sort buffers.  When records are small, the sorting array might 
 be as large as the data pages, so it would be useful to be able to allocate 
 this array off-heap (using our unsafe LongArray).  Unfortunately, we can't 
 currently do this because TimSort calls {{allocate()}} to create data buffers 
 but doesn't call any corresponding cleanup methods to free them.
 We should look into extending TimSort with buffer freeing methods, then 
 consider switching to LongArray in UnsafeShuffleSortDataFormat.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org