Diksha Chaturvedi created DRILL-8439:
----------------------------------------

             Summary: Getting col__ prefix for columns that are not special 
when extractHeader is enabled
                 Key: DRILL-8439
                 URL: https://issues.apache.org/jira/browse/DRILL-8439
             Project: Apache Drill
          Issue Type: Bug
          Components: Metadata, SQL Parser
    Affects Versions: 1.21.0
         Environment: I've enabled extractHeader in the csv config of dfs 
plugin.
Apache Drill version: apache-drill-1.21.0
No. of drillbits: Single
OS: Windows
            Reporter: Diksha Chaturvedi


As per documentation, Drill appends col__ to the columns that start with a 
number or special characters.

 

/**
* Prefix used to replace non-alphabetic characters at the start of
* a column name. For example, $foo becomes col_foo. Used
* because SQL does not allow _foo.
*/

public static final String COLUMN_PREFIX = "col_";

 

But in my case I'm getting it even for all alphabetical column name.
----
I have the following data in the CSV file,
||PRODUCTID||PRODUCTNAME||SUPPLIERID||CATEGORYID||UNIT||PRICE||
|1|Chais|1|1|10 boxes x 20 bags|18|
|2|Chang|1|1|24 - 12 oz bottles|19|
|3|Aniseed Syrup|1|2|12 - 550 ml bottles|10|
|4|Chef Anton's Cajun Seasoning|2|2|48 - 6 oz jars|22|
|5|Chef Anton's Gumbo Mix|2|2|36 boxes|21.35|

 

While querying on the csv file using following query:

{{SELECT * FROM dfs.`/var/lib/PRODUCT.csv`}}

The output is 
[!https://i.stack.imgur.com/FBNmn.png|width=770,height=164!|https://i.stack.imgur.com/FBNmn.png]

 

I know about other criterias like

{{#UNITS}} is changed to {{col_UNITS}}

{{FINANCIAL$RECORD}} is changed to {{FINANCIAL_RECORD}}

{{}}

But what's with {{PRODUCTID}} ? In this case it has appended one extra 
underscore also in the end.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to