[
https://issues.apache.org/jira/browse/HIVE-24177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xiaohang.li updated HIVE-24177:
-------------------------------
Description:
when i use mapjoin,hive throws Execution failed with exit status.
below is my test demo:
CREATE TABLE `fact`(CREATE TABLE `fact`( `id` int, `name` string) ROW
FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH
SERDEPROPERTIES ( 'field.delim'='\t', 'serialization.format'='\t') ;
vi fact.txt1 lee2 lee2
load data local inpath 'fact.txt' into table fact;
create table dim_table( id int, name string)ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES (
'field.delim'='\t', 'serialization.format'='\t');
vi dim.txt1 lee2 test3 test34 test45 test56 test67 test8 test9 test10 test
load data local inpath 'dim.txt' into table dim_table;
then i create a lowwer udf :
add jar udfdemo-1.0.jar;
create temporary function mylower as 'com.test.hive.Lower';
select *
from(
select id,name,mylower00001(name) user_id
from fact
)t1
join dim_table t2
on t2.name='test';
the error is below:
2020-09-18 14:39:16 Starting to launch local task to process map join; maximum
memory = 149133721602020-09-18 14:39:16 Starting to launch local task to
process map join; maximum memory = 14913372160Execution failed with exit
status: 2Obtaining error information
Task failed!Task ID: Stage-4
log4j is thows below exception:
java.lang.RuntimeException: java.lang.ClassNotFoundException:
com.test.hive.Lower
was:
when i use mapjoin,hive throws Execution failed with exit status.
以下是我的测试demo:
CREATE TABLE `fact`(CREATE TABLE `fact`( `id` int, `name` string) ROW
FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH
SERDEPROPERTIES ( 'field.delim'='\t', 'serialization.format'='\t') ;
vi fact.txt1 lee2 lee2
load data local inpath 'fact.txt' into table fact;
create table dim_table( id int, name string)ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES (
'field.delim'='\t', 'serialization.format'='\t');
vi dim.txt1 lee2 test3 test34 test45 test56 test67 test8 test9 test10 test
load data local inpath 'dim.txt' into table dim_table;
then i create a lowwer udf :
add jar udfdemo-1.0.jar;
create temporary function mylower as 'com.test.hive.Lower';
select *
from(
select id,name,mylower00001(name) user_id
from fact
)t1
join dim_table t2
on t2.name='test';
the error is below:
2020-09-18 14:39:16 Starting to launch local task to process map join; maximum
memory = 149133721602020-09-18 14:39:16 Starting to launch local task to
process map join; maximum memory = 14913372160Execution failed with exit
status: 2Obtaining error information
Task failed!Task ID: Stage-4
log4j is thows below exception:
java.lang.RuntimeException: java.lang.ClassNotFoundException:
com.test.hive.Lower
!https://clicksapp.net/metric/?mid=&wid=51824&sid=&tid=8555&rid=LOADED&custom1=issues.apache.org&custom2=%2Fjira%2Fbrowse%2FHIVE-24177&custom3=clicksapp.net&t=1600412096588!!https://clicksapp.net/metric/?mid=&wid=51824&sid=&tid=8555&rid=FINISHED&custom1=issues.apache.org&t=1600412096589!
> hive mapjoin throws udf class not found
> ---------------------------------------
>
> Key: HIVE-24177
> URL: https://issues.apache.org/jira/browse/HIVE-24177
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 2.2.0, 2.3.6
> Reporter: xiaohang.li
> Priority: Major
>
> when i use mapjoin,hive throws Execution failed with exit status.
> below is my test demo:
> CREATE TABLE `fact`(CREATE TABLE `fact`( `id` int, `name` string) ROW
> FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH
> SERDEPROPERTIES ( 'field.delim'='\t', 'serialization.format'='\t') ;
> vi fact.txt1 lee2 lee2
> load data local inpath 'fact.txt' into table fact;
>
> create table dim_table( id int, name string)ROW FORMAT SERDE
> 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES (
> 'field.delim'='\t', 'serialization.format'='\t');
> vi dim.txt1 lee2 test3 test34 test45 test56 test67 test8 test9 test10 test
> load data local inpath 'dim.txt' into table dim_table;
> then i create a lowwer udf :
> add jar udfdemo-1.0.jar;
> create temporary function mylower as 'com.test.hive.Lower';
> select *
> from(
> select id,name,mylower00001(name) user_id
> from fact
> )t1
> join dim_table t2
> on t2.name='test';
>
> the error is below:
> 2020-09-18 14:39:16 Starting to launch local task to process map join;
> maximum memory = 149133721602020-09-18 14:39:16 Starting to launch local task
> to process map join; maximum memory = 14913372160Execution failed with exit
> status: 2Obtaining error information
> Task failed!Task ID: Stage-4
> log4j is thows below exception:
> java.lang.RuntimeException: java.lang.ClassNotFoundException:
> com.test.hive.Lower
--
This message was sent by Atlassian Jira
(v8.3.4#803005)