[
https://issues.apache.org/jira/browse/DRILL-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14509550#comment-14509550
]
Alexander Zarei commented on DRILL-2767:
----------------------------------------
Update:
When I convert the data to optimized ORC format, I can query all tables back to
back without letting the cluster rest for a bit. That probably suggests there
is different between reading ORC and Text from Hive storage to Drill which we
can focus on to find the root cause.
> Fragment error on TPCH Scale Factor 30 on a query that completed successfully
> previously
> ----------------------------------------------------------------------------------------
>
> Key: DRILL-2767
> URL: https://issues.apache.org/jira/browse/DRILL-2767
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Hive
> Affects Versions: 0.8.0
> Environment: AWS EMR cluster of three m1.xlarge nodes
> Reporter: Alexander Zarei
> Assignee: Venki Korukanti
> Attachments: drillbitcore1.log, drillbitcore1.out, drillbitcore2.log,
> drillbitcore2.out, drillbitmaster.out, lineitem table schema .png,
> second-set-core-1-drillbit.log, second-set-core-2-drillbit.log
>
>
> The following sequence led to the error:
> Executed the query
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
> and it took about 43 minutes to execute successfully.
> After ward I ran the query
> bq. SELECT * FROM `realhive`.`tpch_text_2`.`lineitem`
> for 6 times to find an optimization value for the ODBC driver.
> Afterward, I submitted the first query again
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
>
> and the Drill Cluster returned a fragment error.
> bq. ***[HY000]: [MapR][Drill] (1040) Drill failed to execute the query:
> SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`[30024]Query execution
> error. Details:[RemoteRpcException: Failure while running fragment.[
> fb97e7be-d09e-46fe-8728-9577fd0d8795 on ip-10-12-62-65
> Log files with debug level for the Drillbits on the master node as well as
> the core nodes of the cluster are attached.
> Also the connection through the ODBC driver on Linux 32 bit was "Direct" to
> the drillbit on the master node of the Hadoop cluster.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)