Sorry about that! Here’s a link to the notebook that doesn’t require registration.
https://www.zepl.com/viewer/notebooks/bm90ZTovL3N0ZXZlYmxhY2ttb24vYXBhY2hlLXplcHBlbGluLWRhc2hib2FyZC84YjQ5YmY3MWIxYTU0ZTE2YjlkMDQyMTliMzNlMjQzYS9ub3RlLmpzb24 In this notebook we used the %spark interpreter to collect the data, but most of the work is done as scala in the driver process. The streams code base is java and not dependent on spark or other frameworks external to the jar file. The easiest integration I can think of given the python/java language gap would use docker - Streams could prepare a docker container packaged with all the necessary code, and Kibble installations could use it to run ad-hoc or scheduled data processes. The data collected could be written as new-line delimited json on container mounted volumes, or directly to an elasticsearch index. Docker’s not really necessary though, if the system where Kibble’s running has a JRE configured and a streams distribution local that could work too. Steve On Dec 2, 2017 at 2:10 PM, Daniel Gruno <[email protected]> wrote: On 12/02/2017 09:07 PM, Steve Blackmon wrote: Hi Kibble Team, I've been checking out the code and the demo site this weekend. I'm interested in joining the team and integrating some of the data sources maintained in http://streams.apache.org Specifically, activity streams from the social media presences of projects and contributors (who opt in) as well as statistics derived from them could make a nice addition to Kibble. Here's an example: analysis of Twitter accounts of Apache project using Streams and Zeppelin: https://www.zepl.com/UvGWgAZb7/spaces/Sb9ElZuDD/8b49bf71b1a54e16b9d04219b33e243a Cheers, Steve Blackmon [email protected] Hi Steve, I like the idea, but I am unable to see the link you shared, it shows a 404 for me :(. Having said that, looking into the social media space is definitely something worth doing! With regards, Daniel.
