[ 
https://issues.apache.org/jira/browse/HIVE-5687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Roshan Naik updated HIVE-5687:
------------------------------

    Attachment: 5687-api-spec4.docx
                HIVE-5687.v2.patch

Revising API in patch and Spec to handle mapping of incoming data format to 
corresponding cols in table (RecordWriter interface). Adding out of the box 
support for Delimited text formats. More formats are pluggable.

Added support for auto creation of new partitions for streaming clients

> Streaming support in Hive
> -------------------------
>
>                 Key: HIVE-5687
>                 URL: https://issues.apache.org/jira/browse/HIVE-5687
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Roshan Naik
>            Assignee: Roshan Naik
>         Attachments: 5687-api-spec4.docx, 5687-draft-api-spec.pdf, 
> 5687-draft-api-spec2.pdf, 5687-draft-api-spec3.pdf, HIVE-5687.patch, 
> HIVE-5687.v2.patch
>
>
> Implement support for Streaming data into HIVE.
> - Provide a client streaming API 
> - Transaction support: Clients should be able to periodically commit a batch 
> of records atomically
> - Immediate visibility: Records should be immediately visible to queries on 
> commit
> - Should not overload HDFS with too many small files
> Use Cases:
>  - Streaming logs into HIVE via Flume
>  - Streaming results of computations from Storm



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to