[ 
https://issues.apache.org/jira/browse/FLINK-20239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235583#comment-17235583
 ] 

Seth Wiesman commented on FLINK-20239:
--------------------------------------

[~lirui] I just looked through the hive docs and I agree with [~dwysakowicz] 
it's a bit confusing. 

My proposal would be to merge the two documents. The new FileSink documentation 
is a great example of how to talk about a connector that supports both batch 
and streaming use cases[1]. I would honestly encourage you to use the FileSink 
docs as a guide and tweak the specifics for Hive. 

I would have a single page with a comprehensive section on reading and a 
comprehensive section on writing. 

For reading, you would say something like "Flink supports reading data from 
Hive in both BATCH and STREAMING modes. When run as a BATCH application, Flink 
will query the table at the point in time when the query is executed. STREAMING 
reads will continuously monitor the table and incrementally fetch new data as 
it is made available."

And then something similar for writing. 

Within each section, you can have subsections about specific batch and 
streaming configurations where relevant. 

[1] 
https://ci.apache.org/projects/flink/flink-docs-master/dev/connectors/file_sink.html
 

> Confusing pages: "Hive Read & Write" and "Hive Streaming"
> ---------------------------------------------------------
>
>                 Key: FLINK-20239
>                 URL: https://issues.apache.org/jira/browse/FLINK-20239
>             Project: Flink
>          Issue Type: Improvement
>          Components: Connectors / Hive, Documentation
>    Affects Versions: 1.12.0
>            Reporter: Dawid Wysakowicz
>            Priority: Critical
>             Fix For: 1.12.0
>
>
> The two pages describe how to read & write from Hive. It is not very clear 
> what is the relation between the two pages. Moreover the {{Hive Streaming}} 
> is way more comprehensive.
> Personally I found the {{Hive Read & Write}} page not helpful and bloated 
> with irrelevant sections such as e.g. Formats, Limit pushdown which often 
> contain a single sentence.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to