[GitHub] spark pull request: [SPARK-2788] [STREAMING] Add location filterin...

sjbrunst Sun, 07 Sep 2014 13:24:46 -0700

Github user sjbrunst commented on a diff in the pull request:

    https://github.com/apache/spark/pull/1717#discussion_r17217079
  
    --- Diff: 
external/twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterInputDStream.scala
 ---
    @@ -42,6 +44,7 @@ class TwitterInputDStream(
         @transient ssc_ : StreamingContext,
         twitterAuth: Option[Authorization],
         filters: Seq[String],
    +    locations: Seq[Seq[Double]],
    --- End diff --
    
    Good question. It definitely is confusing. I went with ``Seq[Seq[Double]]`` 
because the ``FilterQuery`` created in TwitterInputDStream.scala requires a 
``double[][]`` 
(http://twitter4j.org/javadoc/twitter4j/FilterQuery.html#locations-double:A:A-).
 This way the only change I have to make to the input is to change between 
Scala sequences and Java arrays.
    
    The ``Location`` case class you described still does not remove all 
ambiguity, because the ``FilterQuery`` requires the south-west corner then the 
north-east corner for the boundary, and that would not prevent someone from 
giving them in the wrong order and getting unexpected results. If we're going 
to define a ``case class`` anyways, I think it would be better to make 
something like ``case class Boundary(west: Double, south: Double, east: Double, 
north: Double)``. Then the locations parameter would be of type 
``Seq[Boundary]``, and I can convert it to a ``double[][]`` just before passing 
it to the ``FilterQuery`` in TwitterInputDStream.scala. Should I go ahead and 
implement that?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-2788] [STREAMING] Add location filterin...

Reply via email to