Jay created SAMOA-47:
------------------------

             Summary: Integrate Avro Streams with SAMOA
                 Key: SAMOA-47
                 URL: https://issues.apache.org/jira/browse/SAMOA-47
             Project: SAMOA
          Issue Type: New Feature
          Components: SAMOA-API, SAMOA-Instances
            Reporter: Jay
            Priority: Minor


The current SAMOA readers can only support data streams in ARFF format. Hence 
SAMOA as a distributed streaming machine learning framework is limited in scope 
since end users may have to transform their data to ARFF . Apache Avro is a 
data serialization system that handles data streams in compact binary format 
and is typically used in conjunction with with Big Data eco-system tools. Avro 
allows two encodings for the data: Binary & JSON. Hence an Avro support may 
allow users with JSON data also to use SAMOA seamlessly.

The GOAL is to build support for Avro Streams into SAMOA by adding Avro File 
Stream Handler, Avro Loader to read records & transform to instances and  a 
user option to switch between JSON/Binary encodings. The input format with 
representation of meta-data for both JSON/Binary data to be finalized along 
with build.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to