[
https://issues.apache.org/jira/browse/SAMOA-16?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513912#comment-14513912
]
ASF GitHub Bot commented on SAMOA-16:
-------------------------------------
Github user gdfm commented on the pull request:
https://github.com/apache/incubator-samoa/pull/11#issuecomment-96609902
Thanks, I managed to make it run on Flink.
I am testing the VHT algorithm with ```bin/samoa flink
target/SAMOA-Flink-0.3.0-SNAPSHOT.jar "PrequentialEvaluation -d /tmp/dump.csv
-i 1000000 -f 100000 -l (classifiers.trees.VerticalHoeffdingTree -p 4) -s
(generators.RandomTreeGenerator -c 2 -o 10 -u 10)"```
I had to increase the task manager slots considerably to make it run. I see
many "Filter" tasks in Flink's dashboard. Is that normal?
The accuracy of the VHT is reasonable, and comparable with what we get with
Storm. It's lower than with the local execution engine, but that's an
algorithmic problem we are working on.
I also feel that's a bit slow. I didn't get the output from the console
telling me how long it took, but it took a while and the fan of my laptop
started spinning. Do you think there is margin for improvement in this respect?
> Add an adapter for Apache Flink-Streaming
> -----------------------------------------
>
> Key: SAMOA-16
> URL: https://issues.apache.org/jira/browse/SAMOA-16
> Project: SAMOA
> Issue Type: New Feature
> Reporter: Paris Carbone
> Assignee: Gianmarco De Francisci Morales
>
> Apache Flink-Streaming is a new system for distributed stream processing
> built for unique and flexible high level stream transformations. A Flink
> adapter for Samoa should be able to translate a Samoa Task topology into
> Flink streaming transformations. Some of the challenges are the compositional
> topology support, circle detection and their translation to Flink iterations.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)