[jira] [Created] (HAMA-983) Hama runner for DataFlow

2016-02-14 Thread Edward J. Yoon (JIRA)
Edward J. Yoon created HAMA-983:
---

 Summary: Hama runner for DataFlow
 Key: HAMA-983
 URL: https://issues.apache.org/jira/browse/HAMA-983
 Project: Hama
  Issue Type: Bug
Reporter: Edward J. Yoon


As you already know, Apache Beam provides unified programming model for both 
batch and streaming inputs.

The APIs are generally associated with data filtering and transforming. So 
we'll need to implement some data processing runner like 
https://github.com/dapurv5/MapReduce-BSP-Adapter/blob/master/src/main/java/org/apache/hama/mapreduce/examples/WordCount.java

Also, implementing similarity join can be funny. According to 
http://www.ruizhang.info/publications/TPDS2015-Heads_Join.pdf, Apache Hama is 
clearly winner among Apache Hadoop and Apache Spark.

Since it consists of transformation, aggregation, and partition computations, I 
think it's possible to implement using Apache Beam APIs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[VOTE] Apache Hama 0.7 release (RC2)

2016-02-14 Thread Edward J. Yoon
Hi all,

I just created a 2nd release candidate for Apache Hama 0.7.1 release. This
RC fixes newly reported bug of graph module. It compiled Java7.

RC2 is available at:
http://people.apache.org/~edwardyoon/dist/0.7.1-RC2/

Tags:
https://github.com/apache/hama/tree/0.7.1-RC2

Please try it on your environment, run the tests, verify checksum files,
etc. and vote.

Thanks~

--
Best Regards, Edward J. Yoon