Hi Arina, 
It doesn't seem to make a difference unfortunately. :-(
--C 

> On Jun 24, 2019, at 3:09 PM, Arina Yelchiyeva <[email protected]> 
> wrote:
> 
> Hi Charles,
> 
> Please try with v3 reader enabled: set `exec.storage.enable_v3_text_reader` = 
> true.
> Does it behave the same?
> 
> Kind regards,
> Arina
> 
>> On Jun 24, 2019, at 9:38 PM, Charles Givre <[email protected]> wrote:
>> 
>> Hello Drill Devs,
>> I'm noticing some strange behavior with the newest version of Drill.  If you 
>> query a CSV file, you get the following metadata:
>> 
>> SELECT * FROM dfs.test.`domains.csvh` LIMIT 1
>> 
>> {
>> "queryId": "22eee85f-c02c-5878-9735-091d18788061",
>> "columns": [
>>   "domain"
>> ],
>> "rows": [
>>   {
>>     "domain": "thedataist.com"
>>   }
>> ],
>> "metadata": [
>>   "VARCHAR(0, 0)",
>>   "VARCHAR(0, 0)"
>> ],
>> "queryState": "COMPLETED",
>> "attemptedAutoLimit": 0
>> }
>> 
>> 
>> There are two issues here:
>> 1.  VARCHAR now has precision 
>> 2.  There are twice as many columns as there should be.
>> 
>> Additionally, if you query a regular CSV, without the columns extracted, you 
>> get the following:
>> 
>> "rows": [
>>   {
>>     "columns": "[\"ACCT_NUM\",\"PRODUCT\",\"MONTH\",\"REVENUE\"]"
>>   }
>> ],
>> "metadata": [
>>   "VARCHAR(0, 0)",
>>   "VARCHAR(0, 0)"
>> ],
>> 
>> This is bizarre in that the data type is not being reported correctly, it 
>> should be LIST or something like that, AND we're getting too many columns in 
>> the metadata.  I'll submit a JIRA as well, but could someone please take a 
>> look?
>> Thanks,
>> -- C
>> 
>> 
>> 
> 

Reply via email to