[ 
https://issues.apache.org/jira/browse/DRILL-5928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paul Rogers updated DRILL-5928:
-------------------------------
    Description: 
Consider the following test framework query:

{noformat}
/root/drillAutomation/mapr/framework/resources/Functional/data-shapes/wide-columns/general/q2.q
{noformat}

Defined as follows:

{noformat}
select count(*) from `data-shapes/wide-columns/general/100000.tbl`
{noformat}

The data file ({{100000.tbl}}) contains a single row, single field of 100K 
width. The query suceeds and returns 1.

However, if we did the following:

{noformat}
select * from `data-shapes/wide-columns/general/100000.tbl`
{noformat}

then the query would fail with an oversize field exception. (Before Drill 1.13 
the error is "Tried to write something large in a field", while 1.13 and after 
the error is "Text column is too large."

The question is, should the query succeed in the {{COUNT\(*)}} case if it would 
fail in the {{SELECT *}} case?

Similarly, one can invent a table with more than 64K columns. A {{SELECT *}} 
query would fail with an index out of bounds exception (Drill 1.12 or earlier) 
or a "Too many fields" error (Drill 1.13 and later.) But, a {{COUNT(*)}} query 
would succeed. Is this desired?

  was:
Consider the following test framework query:

{noformat}
/root/drillAutomation/mapr/framework/resources/Functional/data-shapes/wide-columns/general/q2.q
{noformat}

Defined as follows:

{noformat}
select count(*) from `data-shapes/wide-columns/general/100000.tbl`
{noformat}

The data file ({{100000.tbl}}) contains a single row, single field of 100K 
width. The query suceeds and returns 1.

However, if we did the following:

{noformat}
select * from `data-shapes/wide-columns/general/100000.tbl`
{noformat}

then the query would fail with an oversize field exception. (Before Drill 1.13 
the error is "Tried to write something large in a field", while 1.13 and after 
the error is "Text column is too large."

The question is, should the query succeed in the {{COUNT\(*)}} case if it would 
fail in the {{SELECT *}} case?


> Drill allows a COUNT(*) over records that are too wide to load
> --------------------------------------------------------------
>
>                 Key: DRILL-5928
>                 URL: https://issues.apache.org/jira/browse/DRILL-5928
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.11.0
>            Reporter: Paul Rogers
>            Priority: Minor
>
> Consider the following test framework query:
> {noformat}
> /root/drillAutomation/mapr/framework/resources/Functional/data-shapes/wide-columns/general/q2.q
> {noformat}
> Defined as follows:
> {noformat}
> select count(*) from `data-shapes/wide-columns/general/100000.tbl`
> {noformat}
> The data file ({{100000.tbl}}) contains a single row, single field of 100K 
> width. The query suceeds and returns 1.
> However, if we did the following:
> {noformat}
> select * from `data-shapes/wide-columns/general/100000.tbl`
> {noformat}
> then the query would fail with an oversize field exception. (Before Drill 
> 1.13 the error is "Tried to write something large in a field", while 1.13 and 
> after the error is "Text column is too large."
> The question is, should the query succeed in the {{COUNT\(*)}} case if it 
> would fail in the {{SELECT *}} case?
> Similarly, one can invent a table with more than 64K columns. A {{SELECT *}} 
> query would fail with an index out of bounds exception (Drill 1.12 or 
> earlier) or a "Too many fields" error (Drill 1.13 and later.) But, a 
> {{COUNT(*)}} query would succeed. Is this desired?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to