Jay created SAMOA-47:
------------------------
Summary: Integrate Avro Streams with SAMOA
Key: SAMOA-47
URL: https://issues.apache.org/jira/browse/SAMOA-47
Project: SAMOA
Issue Type: New Feature
Components: SAMOA-API, SAMOA-Instances
Reporter: Jay
Priority: Minor
The current SAMOA readers can only support data streams in ARFF format. Hence
SAMOA as a distributed streaming machine learning framework is limited in scope
since end users may have to transform their data to ARFF . Apache Avro is a
data serialization system that handles data streams in compact binary format
and is typically used in conjunction with with Big Data eco-system tools. Avro
allows two encodings for the data: Binary & JSON. Hence an Avro support may
allow users with JSON data also to use SAMOA seamlessly.
The GOAL is to build support for Avro Streams into SAMOA by adding Avro File
Stream Handler, Avro Loader to read records & transform to instances and a
user option to switch between JSON/Binary encodings. The input format with
representation of meta-data for both JSON/Binary data to be finalized along
with build.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)