[
https://issues.apache.org/jira/browse/FLINK-20239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235367#comment-17235367
]
Rui Li commented on FLINK-20239:
--------------------------------
bq. Hive streaming has also streaming/non-streaming mode
That is to tell users how to enable streaming read. By default, a hive table
can only be read in a batch manner.
bq. what is the difference between the reading/writing from "Hive Read & Write"
and "Hive Streaming"
Like I said, that's for batch and streaming respectively.
bq. lastly I can't see any value in the "Hive Read & Write". It just states a
few rather standard SQL statements.
I guess that was because these standard SQL statements are supported
incrementally, e.g. "insert overwrite" was not supported in the first release.
So we need to let users know such limitations. But I agree those examples are
too verbose. We can just list what're supported. Or since we have supported all
common batch cases, perhaps it's not helpful to list them at all?
bq. I don't like the artificially bloated sections in the "Hive Read & Write"
Agreed. Actually I believe most of the optimizations are transparent to users
and shouldn't be mentioned there.
> Confusing pages: "Hive Read & Write" and "Hive Streaming"
> ---------------------------------------------------------
>
> Key: FLINK-20239
> URL: https://issues.apache.org/jira/browse/FLINK-20239
> Project: Flink
> Issue Type: Improvement
> Components: Connectors / Hive, Documentation
> Affects Versions: 1.12.0
> Reporter: Dawid Wysakowicz
> Priority: Critical
> Fix For: 1.12.0
>
>
> The two pages describe how to read & write from Hive. It is not very clear
> what is the relation between the two pages. Moreover the {{Hive Streaming}}
> is way more comprehensive.
> Personally I found the {{Hive Read & Write}} page not helpful and bloated
> with irrelevant sections such as e.g. Formats, Limit pushdown which often
> contain a single sentence.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)