This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hive-site.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 94f4894  deploy: 25291cb5dd5a01175f3c2772bf81723353dee12b
94f4894 is described below

commit 94f48943b0dad854d6d48112b1c5f4c42fd6c4f8
Author: zhangbutao <[email protected]>
AuthorDate: Wed Jan 8 13:53:11 2025 +0000

    deploy: 25291cb5dd5a01175f3c2772bf81723353dee12b
---
 docs/latest/index.xml                              |  2 +-
 .../manual-installation_283118363/index.html       | 63 ++++++++++++++++++++++
 index.xml                                          |  2 +-
 3 files changed, 65 insertions(+), 2 deletions(-)

diff --git a/docs/latest/index.xml b/docs/latest/index.xml
index fc4131a..d6e30c3 100644
--- a/docs/latest/index.xml
+++ b/docs/latest/index.xml
@@ -206,7 +206,7 @@ UDFs xpath, xpath_short, xpath_int, xpath_long, 
xpath_float, xpath_double, xpath
 Type Postfix Example TINYINT Y 100Y SMALLINT S 100S BIGINT L 100L String types 
String literals can be expressed with either single quotes (') or double quotes 
(&amp;quot;).</description></item><item><title>Apache Hive : 
LLAP</title><link>https://hive.apache.org/docs/latest/llap_62689557/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/llap_62689557/</guid><description>Apache
 Hive : LLAP Overview Persistent Daemon Execution Engine Query Fr [...]
 For configuration of LLAP, see the LLAP Section of Configuration Properties.
 Overview Hive has become significantly faster thanks to various features and 
improvements that were built by the community in recent years, including Tez 
and Cost-based-optimization.</description></item><item><title>Apache Hive : 
Locking</title><link>https://hive.apache.org/docs/latest/locking_27362050/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/locking_27362050/</guid><description>Apache
 Hive : Locking Hive Concurrency Model Hive Co [...]
-Managed (Internal) External Introduction This document lists some of the 
differences between the two but the fundamental difference is that Hive assumes 
that it owns the data for managed tables. That means that the data, its 
properties and data layout will and can only be changed via Hive command. The 
data still lives in a normal file system and nothing is stopping you from 
changing it without telling Hive about 
it.</description></item><item><title>Apache Hive : Manual Installation</titl 
[...]
+Managed (Internal) External Introduction This document lists some of the 
differences between the two but the fundamental difference is that Hive assumes 
that it owns the data for managed tables. That means that the data, its 
properties and data layout will and can only be changed via Hive command. The 
data still lives in a normal file system and nothing is stopping you from 
changing it without telling Hive about 
it.</description></item><item><title>Apache Hive : Manual Installation</titl 
[...]
 1. Map Join Optimization 1.1 Using Distributed Cache to Propagate Hashtable 
File 1.2 Removing JDBM 1.3 Performance Evaluation 2. Converting Join into Map 
Join Automatically 2.1 New Join Execution Flow 2.2 Resolving the Join Operation 
at Run Time 2.3 Backup Task 2.4 Performance Evaluation 1. Map Join Optimization 
1.1 Using Distributed Cache to Propagate Hashtable File Previously, when 2 
large data tables need to do a join, there will be 2 different Mappers to sort 
these tables based on th [...]
 Materialized views support is introduced in Hive 3.0.0.
 Introduction This page documents the work done for the supporting materialized 
views in Apache Hive.</description></item><item><title>Apache Hive : 
Materialized views in 
Hive</title><link>https://hive.apache.org/docs/latest/materialized-views-in-hive_283118346/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/materialized-views-in-hive_283118346/</guid><description>Apache
 Hive : Materialized views in Hive Objectives Traditionally, one of t [...]
diff --git a/docs/latest/manual-installation_283118363/index.html 
b/docs/latest/manual-installation_283118363/index.html
index e2cbd2f..f13f4e2 100644
--- a/docs/latest/manual-installation_283118363/index.html
+++ b/docs/latest/manual-installation_283118363/index.html
@@ -131,6 +131,7 @@ ASF
 <li><a href=#extra-hadoop-configurations-to-make-everything-working>Extra 
hadoop configurations to make everything working</a></li>
 <li><a href=#installing-hive-from-a-tarball>Installing Hive from a 
Tarball</a></li>
 <li><a href=#installing-from-source-code>Installing from Source Code</a></li>
+<li><a 
href=#installing-with-old-version-hadoop(greater-than-or-equal-3.1.0)>Installing
 with old version hadoop(greater than or equal 3.1.0)</a></li>
 <li><a href=#next-steps>Next Steps</a></li>
 <li><a href=#beeline-cli>Beeline CLI</a></li>
 <li><a href=#hive-metastore>Hive Metastore</a></li>
@@ -354,6 +355,68 @@ insert into test values ('Hello, from Hive!');
 </ul>
 <p>That directory should contain all the files necessary to run Hive. You can 
run it from there or copy it to a different location, if you prefer.</p>
 <p>From now, you can follow the steps described in the section Installing Hive 
from a Tarball</p>
+<h2 id=installing-with-old-version-hadoopgreater-than-or-equal-310>Installing 
with old version hadoop(greater than or equal 3.1.0)</h2>
+<p>Although we normally require hive4 to rely on a
+hadoop 3.3.6+ cluster environment.
+However, in practice, in an ON YARN environment,
+we can package all the hadoop related dependencies into
+tez&hive so that they do not need to rely on the lib
+of the original hadoop cluster environment at runtime.
+In this way, we can run HIVE4 in a lower version of hadoop,
+provided that the base APIs of the hadoop 3.x series are common to
+each other.</p>
+<p>The steps are as follows:</p>
+<p>1.Compile TEZ to get tez.tar.gz which contains all hadoop related 
dependencies(not tez minimal tarball),
+run <code>mvn clean install -DskipTests=true -Dmaven.javadoc.skip=true -Pdist 
-Paws -Pazure</code>.
+For more detail,see:<code>https://tez.apache.org/install.html</code>.
+After compiling to get tez.tar.gz, users should set the following properties 
in tez-site.xml:</p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-xml data-lang=xml>    <span 
style=color:#f92672>&lt;property&gt;</span>
+        <span style=color:#f92672>&lt;name&gt;</span>tez.lib.uris<span 
style=color:#f92672>&lt;/name&gt;</span><span 
style=color:#75715e>&lt;!--Example, replace with actual hdfs path--&gt;</span>
+        <span 
style=color:#f92672>&lt;value&gt;</span>/apps/apache-tez-0.10.4-bin.tar.gz<span 
style=color:#f92672>&lt;/value&gt;</span>
+    <span style=color:#f92672>&lt;/property&gt;</span>
+    <span style=color:#f92672>&lt;property&gt;</span>
+        <span 
style=color:#f92672>&lt;name&gt;</span>tez.lib.uris.classpath<span 
style=color:#f92672>&lt;/name&gt;</span> <span style=color:#75715e>&lt;!--only 
use tez self lib,do not use any old version hadoop cluster&#39;s 
lib--&gt;</span>
+       <span 
style=color:#f92672>&lt;value&gt;</span>$PWD/tezlib/*,$PWD/tezlib/lib/*<span 
style=color:#f92672>&lt;/value&gt;</span>
+    <span style=color:#f92672>&lt;/property&gt;</span>
+    <span style=color:#f92672>&lt;property&gt;</span>
+        <span 
style=color:#f92672>&lt;name&gt;</span>tez.use.cluster.hadoop-libs<span 
style=color:#f92672>&lt;/name&gt;</span><span style=color:#75715e>&lt;!--only 
use tez self lib,do not use any old version hadoop cluster&#39;s 
lib--&gt;</span>
+        <span style=color:#f92672>&lt;value&gt;</span>false<span 
style=color:#f92672>&lt;/value&gt;</span>
+    <span style=color:#f92672>&lt;/property&gt;</span>
+
+    <span style=color:#f92672>&lt;property&gt;</span>
+        <span style=color:#f92672>&lt;name&gt;</span>tez.am.launch.env<span 
style=color:#f92672>&lt;/name&gt;</span><span 
style=color:#75715e>&lt;!--Example, replace with actual native-lib install 
path.Reuse old version hadoop cluster&#39;s native lib is ok.--&gt;</span>
+        <span 
style=color:#f92672>&lt;value&gt;</span>LD_LIBRARY_PATH=/usr/hadoop/3.1.0/hadoop/lib/native<span
 style=color:#f92672>&lt;/value&gt;</span>
+        <span style=color:#f92672>&lt;description&gt;</span>Users can set up 
environment variables individually, including but not limited to: JAVA_HOME, 
LD_LIBRARY_PATH.<span style=color:#f92672>&lt;/description&gt;</span>
+    <span style=color:#f92672>&lt;/property&gt;</span>
+    
+    <span style=color:#f92672>&lt;property&gt;</span>
+        <span style=color:#f92672>&lt;name&gt;</span>tez.task.launch.env<span 
style=color:#f92672>&lt;/name&gt;</span><span 
style=color:#75715e>&lt;!--Example, replace with actual native-lib install 
path.Reuse old version hadoop cluster&#39;s native lib is ok.--&gt;</span>
+        <span 
style=color:#f92672>&lt;value&gt;</span>LD_LIBRARY_PATH=/usr/hadoop/3.1.0/hadoop/lib/native<span
 style=color:#f92672>&lt;/value&gt;</span>
+        <span style=color:#f92672>&lt;description&gt;</span>Users can set up 
environment variables individually, including but not limited to: JAVA_HOME, 
LD_LIBRARY_PATH.<span style=color:#f92672>&lt;/description&gt;</span>
+    <span style=color:#f92672>&lt;/property&gt;</span>
+</code></pre></div><p>2.Upload tez to the specified HDFS path in 
<code>tez.lib.uris</code>.(Please remember, do not use the minimal tarball for 
installation.)</p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-shell data-lang=shell><span style=color:#75715e>## DO not 
upload minimal tarball !!!</span>
+<span style=color:#f92672>[</span>hadoop@hive opt<span 
style=color:#f92672>]</span><span style=color:#75715e># hdfs dfs -put 
apache-tez-0.10.4-bin.tar.gz /apps/</span>
+</code></pre></div><p>3.Download the high version of the Hadoop package(Please 
ensure that the HADOOP version on which TEZ depends is the same as the HADOOP 
version you have downloaded.).Unzip HIVE, HADOOP, and TEZ all in the 
installation path.</p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-shell data-lang=shell><span style=color:#75715e>## In this 
example, we have installed HIVE-4.0.1 and TEZ-0.10.4 on an Hadoop 3.1.0 
cluster.users should install HIVE,HADOOP and TEZ into actual directories.</span>
+<span style=color:#f92672>[</span>hadoop@hive opt<span 
style=color:#f92672>]</span><span style=color:#75715e># cd /opt</span>
+<span style=color:#f92672>[</span>hadoop@hive opt<span 
style=color:#f92672>]</span><span style=color:#75715e># ll</span>
+drwxr-xr-x <span style=color:#ae81ff>11</span> hive hadoop      <span 
style=color:#ae81ff>4096</span> Nov  <span style=color:#ae81ff>7</span> 13:59 
apache-hive-4.0.1-bin
+drwxr-xr-x  <span style=color:#ae81ff>3</span> hive hadoop      <span 
style=color:#ae81ff>4096</span> Nov  <span style=color:#ae81ff>7</span> 13:59 
apache-tez-0.10.4-bin
+drwxr-xr-x <span style=color:#ae81ff>10</span> hive hadoop      <span 
style=color:#ae81ff>4096</span> Nov  <span style=color:#ae81ff>7</span> 13:59 
hadoop-3.3.6
+lrwxrwxrwx  <span style=color:#ae81ff>1</span> hive hadoop        <span 
style=color:#ae81ff>30</span> Nov  <span style=color:#ae81ff>7</span> 13:59 
hive-4.0.1 -&gt; apache-hive-4.0.1-bin
+lrwxrwxrwx  <span style=color:#ae81ff>1</span> hive hadoop        <span 
style=color:#ae81ff>21</span> Nov  <span style=color:#ae81ff>7</span> 13:59 tez 
-&gt; apache-tez-0.10.4-bin
+</code></pre></div><p>edit <code>hive-env.sh</code></p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-shell data-lang=shell><span style=color:#75715e># Set 
HADOOP_HOME to point to a specific hadoop install directory</span>
+HADOOP_HOME<span style=color:#f92672>=</span><span 
style=color:#e6db74>${</span>HADOOP_HOME<span 
style=color:#66d9ef>:-</span>/opt/hadoop-3.3.6<span style=color:#e6db74>}</span>
+export HIVE_HOME<span style=color:#f92672>=</span><span 
style=color:#e6db74>${</span>HIVE_HOME<span 
style=color:#66d9ef>:-</span>/opt/hive-4.0.1<span style=color:#e6db74>}</span>
+export TEZ_HOME<span style=color:#f92672>=</span>/opt/tez
+</code></pre></div><p>Copy old version hadoop conf into hadoop3.3.6+:</p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-shell data-lang=shell>cp /usr/hadoop/3.1.0/hadoop/conf/*  
/opt/hadoop3.3.6/conf/
+</code></pre></div><p>Put <code>tez-site.xml</code> into hive conf dir:</p>
+<div class=highlight><pre tabindex=0 
style=color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4><code
 class=language-shell data-lang=shell>mv tez-site.xml  /opt/hive-4.0.1/conf/
+</code></pre></div><p>After completing the above steps, users should be able 
to start the HMS service and HS2 service normally, and submit TEZ computing 
tasks without any issues.</p>
+<p>Through the above steps, we can run Hive4+tez in any Hadoop3 environment. 
Users do not need to upgrade the cluster&rsquo;s original hive/hadoop/tez.</p>
 <h2 id=next-steps>Next Steps</h2>
 <p>You can begin using Hive as soon as it is installed, it should be work on 
you computer. There are some extra information in the following sections.</p>
 <h2 id=beeline-cli>Beeline CLI</h2>
diff --git a/index.xml b/index.xml
index 6c2ad82..ad3ace9 100644
--- a/index.xml
+++ b/index.xml
@@ -207,7 +207,7 @@ UDFs xpath, xpath_short, xpath_int, xpath_long, 
xpath_float, xpath_double, xpath
 Type Postfix Example TINYINT Y 100Y SMALLINT S 100S BIGINT L 100L String types 
String literals can be expressed with either single quotes (') or double quotes 
(&amp;quot;).</description></item><item><title>Apache Hive : 
LLAP</title><link>https://hive.apache.org/docs/latest/llap_62689557/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/llap_62689557/</guid><description>Apache
 Hive : LLAP Overview Persistent Daemon Execution Engine Query Fr [...]
 For configuration of LLAP, see the LLAP Section of Configuration Properties.
 Overview Hive has become significantly faster thanks to various features and 
improvements that were built by the community in recent years, including Tez 
and Cost-based-optimization.</description></item><item><title>Apache Hive : 
Locking</title><link>https://hive.apache.org/docs/latest/locking_27362050/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/locking_27362050/</guid><description>Apache
 Hive : Locking Hive Concurrency Model Hive Co [...]
-Managed (Internal) External Introduction This document lists some of the 
differences between the two but the fundamental difference is that Hive assumes 
that it owns the data for managed tables. That means that the data, its 
properties and data layout will and can only be changed via Hive command. The 
data still lives in a normal file system and nothing is stopping you from 
changing it without telling Hive about 
it.</description></item><item><title>Apache Hive : Manual Installation</titl 
[...]
+Managed (Internal) External Introduction This document lists some of the 
differences between the two but the fundamental difference is that Hive assumes 
that it owns the data for managed tables. That means that the data, its 
properties and data layout will and can only be changed via Hive command. The 
data still lives in a normal file system and nothing is stopping you from 
changing it without telling Hive about 
it.</description></item><item><title>Apache Hive : Manual Installation</titl 
[...]
 1. Map Join Optimization 1.1 Using Distributed Cache to Propagate Hashtable 
File 1.2 Removing JDBM 1.3 Performance Evaluation 2. Converting Join into Map 
Join Automatically 2.1 New Join Execution Flow 2.2 Resolving the Join Operation 
at Run Time 2.3 Backup Task 2.4 Performance Evaluation 1. Map Join Optimization 
1.1 Using Distributed Cache to Propagate Hashtable File Previously, when 2 
large data tables need to do a join, there will be 2 different Mappers to sort 
these tables based on th [...]
 Materialized views support is introduced in Hive 3.0.0.
 Introduction This page documents the work done for the supporting materialized 
views in Apache Hive.</description></item><item><title>Apache Hive : 
Materialized views in 
Hive</title><link>https://hive.apache.org/docs/latest/materialized-views-in-hive_283118346/</link><pubDate>Thu,
 12 Dec 2024 00:00:00 
+0000</pubDate><guid>https://hive.apache.org/docs/latest/materialized-views-in-hive_283118346/</guid><description>Apache
 Hive : Materialized views in Hive Objectives Traditionally, one of t [...]

Reply via email to