[ 
https://issues.apache.org/jira/browse/BAHIR-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15998509#comment-15998509
 ] 

ASF GitHub Bot commented on BAHIR-117:
--------------------------------------

Github user c-w commented on the issue:

    https://github.com/apache/bahir/pull/43
  
    Getting the relevant excerpts from the logs about the failure:
    
    07:49:06 [INFO] Reactor Summary:
    07:49:06 [INFO] 
    07:49:06 [INFO] Apache Bahir - Parent POM .......................... 
SUCCESS [  4.278 s]
    07:49:06 [INFO] Apache Bahir - Spark SQL Cloudant DataSource ....... 
SUCCESS [ 16.878 s]
    07:49:06 [INFO] Apache Bahir - Spark Streaming Akka ................ 
SUCCESS [ 27.164 s]
    **07:49:06 [INFO] Apache Bahir - Spark SQL Streaming Akka ............ 
FAILURE [01:16 min]**
    07:49:06 [INFO] Apache Bahir - Spark Streaming MQTT ................ SKIPPED
    07:49:06 [INFO] Apache Bahir - Spark SQL Streaming MQTT ............ SKIPPED
    07:49:06 [INFO] Apache Bahir - Spark Streaming Twitter ............. SKIPPED
    07:49:06 [INFO] Apache Bahir - Spark Streaming ZeroMQ .............. SKIPPED
    07:49:06 [INFO] Apache Bahir - Spark Extensions Distribution ....... SKIPPED
    
    07:49:06 # A fatal error has been detected by the Java Runtime Environment:
    07:49:06 #
    07:49:06 #  SIGSEGV (0xb) at pc=0x00007f100f8f4988, pid=5934, 
tid=139706863331072
    07:49:06 #
    07:49:06 # JRE version: OpenJDK Runtime Environment (8.0_91-b14) (build 
1.8.0_91-b14)
    07:49:06 # Java VM: OpenJDK 64-Bit Server VM (25.91-b14 mixed mode 
linux-amd64 compressed oops)
    07:49:06 # Problematic frame:
    **07:49:06 # C  [librocksdbjni8312626221310549185.so+0x1f8988] 
rocksdb::GetColumnFamilyID(rocksdb::ColumnFamilyHandle\*)+0x8**
    
    Not sure what to make of this... running the tests locally on Windows 10 
with JDK 1.8.0_131 and Maven 3.5.0 everything passes ([full build 
log](https://pastebin.com/dsCNg1XN)). Any ideas, @lresende?


> Expand filtering options for TwitterInputDStream
> ------------------------------------------------
>
>                 Key: BAHIR-117
>                 URL: https://issues.apache.org/jira/browse/BAHIR-117
>             Project: Bahir
>          Issue Type: Improvement
>          Components: Spark Streaming Connectors
>            Reporter: Clemens Wolff
>
> Currently, the TwitterInputDStream only supports filtering by keywords [1] 
> which corresponds to the "track" option in the Twitter API [2]. The Twitter 
> API supports many more ways to receive a filtered stream (e.g. get Tweets in 
> a particular location [3]). It would be very useful to expose these 
> additional filtering options in this library.
> Proposal: add a new public method to TwitterUtils which follows the same 
> interface as createStream [4] but which takes a FilterQuery [5] object as 
> argument. In this way, we give full filtering flexibility to our users.
> I'm currently working on Project Fortis, a social data analysis platform for 
> the United Nations [6]. The extra filtering options would be very useful for 
> my project so I'm happy to implement this and create a pull request.
> [1] 
> https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterInputDStream.scala#L44
> [2] https://dev.twitter.com/streaming/overview/request-parameters#track
> [3] https://dev.twitter.com/streaming/overview/request-parameters#locations
> [4] 
> https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterUtils.scala#L39
> [5] http://twitter4j.org/javadoc/twitter4j/FilterQuery.html
> [6] https://fortis-web.azurewebsites.net/#/site/ocha/



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to