[
https://issues.apache.org/jira/browse/DRILL-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16382316#comment-16382316
]
ASF GitHub Bot commented on DRILL-4120:
---------------------------------------
Github user paul-rogers commented on the issue:
https://github.com/apache/drill/pull/1138
Another thought. The removed code is at plan time. Did the original code
have to open each file to retrieve schema? If so, does removing the code remove
that load? If so, then this change could be a huge performance improvement if
avoids the need to open every file in the Foreman.
Then, the the next question is: do we actually do anything with the
plan-time type information? Few files have that information. Given that, does
the planner actually use the information? Is this something we get for free
from Calcite? If we are not using the type information at plan time, then
clearly there is no harm in removing the code that retrieves the type
information.
> dir0 does not work when the directory structure contains Avro files
> -------------------------------------------------------------------
>
> Key: DRILL-4120
> URL: https://issues.apache.org/jira/browse/DRILL-4120
> Project: Apache Drill
> Issue Type: Bug
> Components: Query Planning & Optimization
> Affects Versions: 1.3.0
> Reporter: Stefán Baxter
> Assignee: Volodymyr Vysotskyi
> Priority: Major
> Fix For: 1.13.0
>
> Attachments:
> 0001-DRILL-4120-Support-reading-directories-having-avro-f.patch
>
>
> Any select statment containing dirN fails if the target directory structure
> contains Avro files.
> Steps to test:
> 1. create a simple directory structure
> 2. copy an avro file into each directory
> 3. execute a query containing dir0
> outcome:
> Error: VALIDATION ERROR: From line 1, column 117 to line 1, column 120:
> Column 'dir0' not found in any table
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)