[ 
https://issues.apache.org/jira/browse/IMPALA-8102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16758589#comment-16758589
 ] 

ASF subversion and git services commented on IMPALA-8102:
---------------------------------------------------------

Commit 79e735a46df258395ea518a5cf6e22e851a91119 in impala's branch 
refs/heads/master from Tim Armstrong
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=79e735a ]

IMPALA-8102: update Impala/HBase docs

Provide pointers to Kudu, which is generally better for analytics

Remove or reword advice that encourages people to use HBase for
analytics.

Remove incorrect information about joins resulting in single-row HBase
lookups - this simply doesn't happen.

Change-Id: If1d5f014722d35eab9b60f7a4e8479738f1bed5b
Reviewed-on: http://gerrit.cloudera.org:8080/12315
Tested-by: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Reviewed-by: Alex Rodoni <arod...@cloudera.com>


> Impala/HBase recommendations need update
> ----------------------------------------
>
>                 Key: IMPALA-8102
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8102
>             Project: IMPALA
>          Issue Type: Documentation
>          Components: Docs
>            Reporter: Tim Armstrong
>            Assignee: Tim Armstrong
>            Priority: Major
>
> https://impala.apache.org/docs/build/html/topics/impala_hbase.html hasn't 
> been updated for a while. The recommendations are a bit out of date - 
> generally HBase is not the best format for analytic workloads yet that page 
> seems to encourage using it.
> E.g.
> {quote}If you have join queries that do aggregation operations on large fact 
> tables and join the results against small dimension tables, consider using 
> Impala for the fact tables and HBase for the dimension tables.{quote}
> Assigning to myself to figure out what the best practice is, but I think we 
> need to include:
> * A statement Kudu offers significantly better performance for analytical 
> workloads with mutable data
> * A statement that HDFS tables are also preferable unless data is frequently 
> mutated
> * A pointer to the Kudu docs



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to