Sorry about that!  Here’s a link to the notebook that doesn’t require
registration.

https://www.zepl.com/viewer/notebooks/bm90ZTovL3N0ZXZlYmxhY2ttb24vYXBhY2hlLXplcHBlbGluLWRhc2hib2FyZC84YjQ5YmY3MWIxYTU0ZTE2YjlkMDQyMTliMzNlMjQzYS9ub3RlLmpzb24

In this notebook we used the %spark interpreter to collect the data, but
most of the work is done as scala in the driver process.  The streams code
base is java and not dependent on spark or other frameworks external to the
jar file.

The easiest integration I can think of given the python/java language gap
would use docker - Streams could prepare a docker container packaged with
all the necessary code, and Kibble installations could use it to run ad-hoc
or scheduled data processes.  The data collected could be written as
new-line delimited json on container mounted volumes,  or directly to an
elasticsearch index.

Docker’s not really necessary though, if the system where Kibble’s running
has a JRE configured and a streams distribution local that could work too.

Steve

On Dec 2, 2017 at 2:10 PM, Daniel Gruno <[email protected]> wrote:


On 12/02/2017 09:07 PM, Steve Blackmon wrote:

Hi Kibble Team,

I've been checking out the code and the demo site this weekend.

I'm interested in joining the team and integrating some of the data
sources maintained in http://streams.apache.org

Specifically, activity streams from the social media presences of
projects and contributors (who opt in) as well as statistics derived
from them could make a nice addition to Kibble.

Here's an example: analysis of Twitter accounts of Apache project
using Streams and Zeppelin:
https://www.zepl.com/UvGWgAZb7/spaces/Sb9ElZuDD/8b49bf71b1a54e16b9d04219b33e243a

Cheers,

Steve Blackmon
[email protected]


Hi Steve,
I like the idea, but I am unable to see the link you shared, it shows a
404 for me :(. Having said that, looking into the social media space is
definitely something worth doing!

With regards,
Daniel.

Reply via email to