[ https://issues.apache.org/jira/browse/BAHIR-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15998509#comment-15998509 ]
ASF GitHub Bot commented on BAHIR-117: -------------------------------------- Github user c-w commented on the issue: https://github.com/apache/bahir/pull/43 Getting the relevant excerpts from the logs about the failure: 07:49:06 [INFO] Reactor Summary: 07:49:06 [INFO] 07:49:06 [INFO] Apache Bahir - Parent POM .......................... SUCCESS [ 4.278 s] 07:49:06 [INFO] Apache Bahir - Spark SQL Cloudant DataSource ....... SUCCESS [ 16.878 s] 07:49:06 [INFO] Apache Bahir - Spark Streaming Akka ................ SUCCESS [ 27.164 s] **07:49:06 [INFO] Apache Bahir - Spark SQL Streaming Akka ............ FAILURE [01:16 min]** 07:49:06 [INFO] Apache Bahir - Spark Streaming MQTT ................ SKIPPED 07:49:06 [INFO] Apache Bahir - Spark SQL Streaming MQTT ............ SKIPPED 07:49:06 [INFO] Apache Bahir - Spark Streaming Twitter ............. SKIPPED 07:49:06 [INFO] Apache Bahir - Spark Streaming ZeroMQ .............. SKIPPED 07:49:06 [INFO] Apache Bahir - Spark Extensions Distribution ....... SKIPPED 07:49:06 # A fatal error has been detected by the Java Runtime Environment: 07:49:06 # 07:49:06 # SIGSEGV (0xb) at pc=0x00007f100f8f4988, pid=5934, tid=139706863331072 07:49:06 # 07:49:06 # JRE version: OpenJDK Runtime Environment (8.0_91-b14) (build 1.8.0_91-b14) 07:49:06 # Java VM: OpenJDK 64-Bit Server VM (25.91-b14 mixed mode linux-amd64 compressed oops) 07:49:06 # Problematic frame: **07:49:06 # C [librocksdbjni8312626221310549185.so+0x1f8988] rocksdb::GetColumnFamilyID(rocksdb::ColumnFamilyHandle\*)+0x8** Not sure what to make of this... running the tests locally on Windows 10 with JDK 1.8.0_131 and Maven 3.5.0 everything passes ([full build log](https://pastebin.com/dsCNg1XN)). Any ideas, @lresende? > Expand filtering options for TwitterInputDStream > ------------------------------------------------ > > Key: BAHIR-117 > URL: https://issues.apache.org/jira/browse/BAHIR-117 > Project: Bahir > Issue Type: Improvement > Components: Spark Streaming Connectors > Reporter: Clemens Wolff > > Currently, the TwitterInputDStream only supports filtering by keywords [1] > which corresponds to the "track" option in the Twitter API [2]. The Twitter > API supports many more ways to receive a filtered stream (e.g. get Tweets in > a particular location [3]). It would be very useful to expose these > additional filtering options in this library. > Proposal: add a new public method to TwitterUtils which follows the same > interface as createStream [4] but which takes a FilterQuery [5] object as > argument. In this way, we give full filtering flexibility to our users. > I'm currently working on Project Fortis, a social data analysis platform for > the United Nations [6]. The extra filtering options would be very useful for > my project so I'm happy to implement this and create a pull request. > [1] > https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterInputDStream.scala#L44 > [2] https://dev.twitter.com/streaming/overview/request-parameters#track > [3] https://dev.twitter.com/streaming/overview/request-parameters#locations > [4] > https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterUtils.scala#L39 > [5] http://twitter4j.org/javadoc/twitter4j/FilterQuery.html > [6] https://fortis-web.azurewebsites.net/#/site/ocha/ -- This message was sent by Atlassian JIRA (v6.3.15#6346)