[jira] [Comment Edited] (FLINK-23449) YarnTaskExecutorRunner does not contains MapReduce classes

Kai Chen (Jira) Sun, 01 Aug 2021 23:48:08 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-23449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17391381#comment-17391381
 ]


Kai Chen edited comment on FLINK-23449 at 8/2/21, 6:47 AM:
-----------------------------------------------------------

[~arvid] I checked https://issues.apache.org/jira/browse/FLINK-21319 and I 
found these solutions which is also mentioned in 
http://apache-flink.147419.n8.nabble.com/Flink-td7866.html:
1. Manually add 
MRJobConfig.DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH("$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/\*,$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/\*")
 to Yarn configuration file under yarn.application.classpath
2.Manually add hadoop-mapreduce-client-core.jar to user code jar.

Flink Hive connector can work with above solutions but it is not very 
convenient to use (as spark hive). I think we should at least add these 2 
solutions to the document.


was (Author: yuchuanchen):
[~arvid] I checked https://issues.apache.org/jira/browse/FLINK-21319 and I 
found these solutions which is also mentioned in 
http://apache-flink.147419.n8.nabble.com/Flink-td7866.html:
1. Manually add 
MRJobConfig.DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH("$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*,$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*")
 to Yarn configuration file under yarn.application.classpath
2.Manually add hadoop-mapreduce-client-core.jar to user code jar.

Flink Hive connector can work with above solutions but it is not very 
convenient to use (as spark hive). I think we should at least add these 2 
solutions to the document.

> YarnTaskExecutorRunner does  not contains MapReduce classes
> -----------------------------------------------------------
>
>                 Key: FLINK-23449
>                 URL: https://issues.apache.org/jira/browse/FLINK-23449
>             Project: Flink
>          Issue Type: Bug
>          Components: Connectors / Hive, Deployment / YARN
>    Affects Versions: 1.11.3
>         Environment: flink-1.11
> flink on yarn cluster
> jdk1.8
> hive1.2.1
> hadoop2.7
> hadoop classes is provided with  {{export HADOOP_CLASSPATH=`hadoop classpath` 
> when submitting test APP. （described in 
> [https://ci.apache.org/projects/flink/flink-docs-release-1.11/ops/deployment/hadoop.html]
>  ）}}
> {{}}
>            Reporter: Kai Chen
>            Priority: Major
>              Labels: pull-request-available
>
> I followed instructions described in 
> [https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/table/hive] 
> and tested hive streaming sink, met this exception  
>       Caused by: java.lang.ClassNotFoundException: 
> org.apache.hadoop.mapred.JobConf
> [http://apache-flink.147419.n8.nabble.com/Flink-td7866.html] met the same 
> problem.
>  
> I checked TM jvm envs and the code and found that flink only set up 
> YARN_APPLICATION_CLASSPATH, but without MAPREDUCE_APPLICATION_CLASSPATH.
> See: 
> [https://github.com/apache/flink/blob/ed39fb2efc790af038c1babd4a48847b7b39f91e/flink-yarn/src/main/java/org/apache/flink/yarn/Utils.java#L119]
>  
> I think we should add MAPREDUCE_APPLICATION_CLASSPATH as well, as the same as 
> spark does.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Comment Edited] (FLINK-23449) YarnTaskExecutorRunner does not contains MapReduce classes

Reply via email to