Tim Armstrong created IMPALA-8102:
-------------------------------------

             Summary: Impala/HBase recommendations need update
                 Key: IMPALA-8102
                 URL: https://issues.apache.org/jira/browse/IMPALA-8102
             Project: IMPALA
          Issue Type: Documentation
          Components: Docs
            Reporter: Tim Armstrong
            Assignee: Tim Armstrong


https://impala.apache.org/docs/build/html/topics/impala_hbase.html hasn't been 
updated for a while. The recommendations are a bit out of date - generally 
HBase is not the best format for analytic workloads yet that page seems to 
encourage using it.

E.g.
{quote}If you have join queries that do aggregation operations on large fact 
tables and join the results against small dimension tables, consider using 
Impala for the fact tables and HBase for the dimension tables.{quote}

Assigning to myself to figure out what the best practice is, but I think we 
need to include:
* A statement Kudu offers significantly better performance for analytical 
workloads with mutable data
* A statement that HDFS tables are also preferable unless data is frequently 
mutated
* A pointer to the Kudu docs



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to