Hello, Glad to see that you have resolved the issues. BTW, Are you interested in contribute the ideas (described the solution) in our JIRA? We have a JIRA about testing the compatibility of drill on CDH. Also welcome to join our Slack channel:
https://bit.ly/3t4rozO > 在 2021年3月26日,16:16,Mehmet - <[email protected]> 写道: > > Hi, > We have just solved this problem. I would like to talk about our solution > to contribute to the community. Maybe other friends may face the same > problem. > We noticed the reason for that problem is trying to connect CDH's HDFS > without client deployment. So, firstly we deployed our Hadoop client files > into /opt/apache-drill/conf folder. > Then we were able to query data without block error and connect hdfs > namenode with name service name (as high available). > > F.y.i. > Have a nice day. > > Apache <[email protected]>, 4 Mar 2021 Per, 18:34 tarihinde şunu yazdı: > >> Hi, >> I think the solution to this problem depends on your initiative. Because I >> planned to download CDH 6.3.3, but it seems that Cloudera have requested a >> license from February 1. Orz... >> So, Is it possible that you are using mini cluster (installed HDFS roles >> service only) to confirm that drill can running based on 6.3.3. thanks >>>> 在 2021年3月3日,15:30,Mehmet - <[email protected]> 写道: >>> Hi, >>> I don't have any plan yet but I was trying it on our Dev cluster, I'm >>> thinking of trying it on the prod cluster as well. >>> Btw, you're right, as far as I see in drillbit.log and >>> "jars/3rdparty/hadoop-client-3.2.1.jar" folder, I have been using >>> hadoop-client-3.2.1.jar (My Hadoop version is 3.0-cdh6.3.3). Actually >>> Drill's client is higher than mine. I think there shouldn't be a problem. >>> What do you think? >>> Thank you, >>> BR. >>> luoc <[email protected]>, 1 Mar 2021 Pzt, 13:18 tarihinde şunu yazdı: >>>> Hi, >>>> I am so sorry to hear that. Do you have any plans? And, have you used >>>> hadoop-client-3.2.1.jar to connect the cluster directly? because of the >>>> drill include the hadoop version is 3.2.1. >>>>> 2021年2月28日 上午4:28,Mehmet - <[email protected]> 写道: >>>>> Hi, >>>>> Yes, you're right and I agree with you. Regarding your last comment; I >>>> can get and read any file from HDFS via terminal. I have attached the >>>> ScreenShot. >>>>> Thank you. >>>>> 27 Şub 2021 Cmt, saat 05:58 tarihinde luoc <[email protected] <mailto: >>>> [email protected]>> şunu yazdı: >>>>> Hi, >>>>> Drill 1.18 works well on CDH 5.13. the difference is that I did not >>>> enable krb, then drill is supported for hadoop 3.x. (CDH 5.x based on >>>> hadoop 2.6, 6.x based on hadoop 3.x) >>>>> May the `list` commands only need to connect NameNode for get metadata >>>> (without DataNode), So recommend that you using DFS client to connect >> the >>>> cluster and test whether the file can be read. Because of the log point >> out >>>> the dfs client have a problem on read file. >>>>> Don’t paste the image for apache email, this is not supported. as an >>>> attachment file is a simple way. >>>>>> 2021年2月27日 上午12:10,Mehmet - <[email protected] <mailto: >>>> [email protected]>> 写道: >>>>>> Hi, >>>>>> Stack Trace: https://paste.ubuntu.com/p/nFvygSpcjy/ < >>>> https://paste.ubuntu.com/p/nFvygSpcjy/> < >>>> https://paste.ubuntu.com/p/nFvygSpcjy/ < >>>> https://paste.ubuntu.com/p/nFvygSpcjy/>> >>>>>> Yes DataNode port is opening and accessible and any firewall problem >>>> is impossible because the Drill was established on the same nodes with >>>> Cloudera cluster. >>>>>> I consider there is no problem with authentication. Because I can list >>>> the hdfs folder clearly as below: >>>>>> Via Terminal (Note: I have used the same kerberos user that I set >>>> within Drill's jaas.conf folder. ); >>>>>> Thank you. >>>>>> BR. >>>>>> luoc <[email protected] <mailto:[email protected]> <mailto: >> [email protected] >>>> <mailto:[email protected]>>>, 26 Şub 2021 Cum, 17:58 tarihinde şunu >> yazdı: >>>>>> Hi, >>>>>> The storage config is correct. then enabled the Kerberos security. >>>> So, please check the java stack trace to ensure not the authentication >>>> problem. >>>>>> Is it possible to use dfs client connect the HDFS for reading the >>>> csv file? >>>>>> The DataNode port is opening and accessible? >>>>>>> 2021年2月26日 下午9:52,Mehmet - <[email protected] <mailto: >>>> [email protected]> <mailto:[email protected] <mailto: >>>> [email protected]>>> 写道: >>>>>>> Hi, >>>>>>> 1. Drill version: 1.18.0 >>>>>>> 2. HDFS Version: Hadoop 3.0-cdh6.3.3 >>>>>>> 3. Storage config: https://paste.ubuntu.com/p/5Dk9jVCxYr/ < >>>> https://paste.ubuntu.com/p/5Dk9jVCxYr/> < >>>> https://paste.ubuntu.com/p/5Dk9jVCxYr/ < >>>> https://paste.ubuntu.com/p/5Dk9jVCxYr/>> >>>>>>> 4. drill-env.sh file: https://paste.ubuntu.com/p/MGNG4zhbrk/ < >>>> https://paste.ubuntu.com/p/MGNG4zhbrk/> < >>>> https://paste.ubuntu.com/p/MGNG4zhbrk/ < >>>> https://paste.ubuntu.com/p/MGNG4zhbrk/>> >>>>>>> Thank you. >>>>>>> BR. >>>>>>> luoc <[email protected] <mailto:[email protected]> <mailto: >>>> [email protected] <mailto:[email protected]>>>, 26 Şub 2021 Cum, 16:14 >>>> tarihinde şunu yazdı: >>>>>>>> Hi, >>>>>>>> That does not seem like an issues with Drill. >>>>>>>> Would you please provides more helpful information : >>>>>>>> 1. Drill version >>>>>>>> 2. HDFS version >>>>>>>> 3. Storage config >>>>>>>>> 2021年2月26日 下午3:32,Mehmet - <[email protected] <mailto: >>>> [email protected]> <mailto:[email protected] <mailto: >>>> [email protected]>>> 写道: >>>>>>>>> Hi Team, >>>>>>>>> I have a problem with Hdfs query on Drill. When I run a "SHOW >>>> FILES in >>>>>>>>> root.`tmp/` ", I can list the files correctly. >>>>>>>>> Bu when I run a select query like "Select * from root.`tmp/` it >>>> throws >>>>>>>>> below error. >>>>>>>>> Notes: >>>>>>>>> - I have already checked the state of hdfs health(via dfsadmin and >>>> hdfs >>>>>>>> ui) >>>>>>>>> and there is no any corruption or block error. >>>>>>>>> - Drillbits are on the same cluster with Hadoop. So I think any >>>> network >>>>>>>>> problem is impossible. >>>>>>>>> - I have also set dfs.client.use.datanode.hostname as true ( >>>>>>>>> https://stackoverflow.com/a/55290406/7894534 < >>>> https://stackoverflow.com/a/55290406/7894534> < >>>> https://stackoverflow.com/a/55290406/7894534 < >>>> https://stackoverflow.com/a/55290406/7894534>> ) >>>>>>>>> org.apache.drill.common.exceptions.UserRemoteException: DATA_READ >>>> ERROR: >>>>>>>>> Could not obtain block: BP-2026912985-<namenode_ip>- >>>>>>>>> 1569935018133:blk_1073842201_101390 file=/tmp/2015-summary.csv >>>>>>>>> File Path: hdfs://<drillbit_ip>:8020/tmp/2015-summary.csv >>>>>>>>> Fragment: 0:0 [Error Id: 466835bd-6512-4854-b231-eaa439eba6f2 on >>>>>>>>> <drillbit_ip>:31010] >>>>>>>>> Thank you. >>>>>>>>> -- >>>>>>>>> Mehmet ERSOY >>>>>>> -- >>>>>>> Mehmet ERSOY >>>>>> -- >>>>>> Mehmet ERSOY >>> -- >>> Mehmet ERSOY > > -- > Mehmet ERSOY
