[GitHub] spark pull request: [SPARK-12635][SQL] Add ColumnarBatch, an in me...

kiszk Thu, 07 Jan 2016 00:28:58 -0800

Github user kiszk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10628#discussion_r49050418
  
    --- Diff: 
sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OffHeapColumnVector.java
 ---
    @@ -0,0 +1,162 @@
    +package org.apache.spark.sql.execution.vectorized;
    +
    +import org.apache.spark.sql.types.DataType;
    +import org.apache.spark.sql.types.DoubleType;
    +import org.apache.spark.sql.types.IntegerType;
    +import org.apache.spark.unsafe.Platform;
    +
    +import java.nio.ByteBuffer;
    +import java.nio.DoubleBuffer;
    +
    +/**
    + * Column data backed using offheap memory.
    + */
    +public final class OffHeapColumnVector extends ColumnVector {
    +  // The data stored in these two allocations need to maintain binary 
compatible. We can
    +  // directly pass this buffer to external components.
    +  private long nulls;
    +  private long data;
    +
    +  protected OffHeapColumnVector(int capacity, DataType type) {
    +    super(capacity, type);
    +    this.nulls = Platform.allocateMemory(capacity);
    --- End diff --
    
    Is is better to first get a area for this.data and to next get a area for 
this.nulls for this.nulls from the combined allocated memory?  It expects to 
this.data is aligned with a cache line boundary.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-12635][SQL] Add ColumnarBatch, an in me...

Reply via email to