[ 
https://issues.apache.org/jira/browse/SAMOA-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15043266#comment-15043266
 ] 

ASF GitHub Bot commented on SAMOA-47:
-------------------------------------

GitHub user jayadeepj opened a pull request:

    https://github.com/apache/incubator-samoa/pull/41

    SAMOA-47: Avro documentation

    Detailed instructions on the Avro input format required for SAMOA & how to 
execute SAMOA with Avro data sources.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jayadeepj/incubator-samoa gh-pages

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-samoa/pull/41.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #41
    
----
commit 103d55b0479fbce8f591f55479fd760db41ee063
Author: jayadeepj <[email protected]>
Date:   2015-12-05T11:28:19Z

    SAMOA-47: Avro documentation

commit 73a58cabefa889c1c538bb22cb55c57dd309cf89
Author: jayadeepj <[email protected]>
Date:   2015-12-05T11:48:43Z

    SAMOA-47: Avro documentation

----


> Integrate Avro Streams with SAMOA
> ---------------------------------
>
>                 Key: SAMOA-47
>                 URL: https://issues.apache.org/jira/browse/SAMOA-47
>             Project: SAMOA
>          Issue Type: New Feature
>          Components: SAMOA-API, SAMOA-Instances
>            Reporter: jayadeepj
>            Priority: Minor
>              Labels: patch
>
> The current SAMOA readers can only support data streams in ARFF format. Hence 
> SAMOA as a distributed streaming machine learning framework is limited in 
> scope since end users may have to transform their data to ARFF . Apache Avro 
> is a data serialization system that handles data streams in compact binary 
> format and is typically used in conjunction with with Big Data eco-system 
> tools. Avro allows two encodings for the data: Binary & JSON. Hence an Avro 
> support may allow users with JSON data also to use SAMOA seamlessly.
> The GOAL is to build support for Avro Streams into SAMOA by adding Avro File 
> Stream Handler, Avro Loader to read records & transform to instances and  a 
> user option to switch between JSON/Binary encodings. The input format with 
> representation of meta-data for both JSON/Binary data to be finalized along 
> with build.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to