[GitHub] spark pull request #19327: [WIP] Implement stream-stream outer joins.

tdas Fri, 22 Sep 2017 17:11:40 -0700

Github user tdas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19327#discussion_r140612077
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/SymmetricHashJoinStateManager.scala
 ---
    @@ -329,6 +392,27 @@ class SymmetricHashJoinStateManager(
           }
         }
     
    +    /** Get all the values for key and all indices, in a (value, index) 
tuple. */
    +    def getAllWithIndex(key: UnsafeRow, numValues: Long): 
Iterator[(UnsafeRow, Long)] = {
    --- End diff --
    
    We can probably convert getAll to this. Does not make sense to have both, 
especially both are equally efficient if you return 
Iterator[KeyWithIndexAndValue].
    
    Also, I think the iterator() method can be removed. its effectively not 
being used (used only in StateManager.iterator, which is not being used really).



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19327: [WIP] Implement stream-stream outer joins.

Reply via email to