[
https://issues.apache.org/jira/browse/DRILL-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rakesh updated DRILL-5454:
--------------------------
Description:
I have a parquet file with an array column, when I use the function
'REPEATED_COUNT' in select column it works fine, but when used in where clause
I get an error:
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
SchemaChangeException: Failure while trying to materialize incoming schema.
Errors: Error in expression at index 2. Error: Missing function implementation:
[repeated_count(INT-OPTIONAL)]. Full expression: null.. Fragment 1:4 [Error Id:
1a9b034c-949a-4faa-9185-55d40e4851e7 on uswxapcsd043.ussdnve.baml.com:31010]
(org.apache.drill.exec.exception.SchemaChangeException) Failure while trying to
materialize incoming schema. Errors: Error in expression at index 2. Error:
Missing function implementation: [repeated_count(INT-OPTIONAL)]. Full
expression: null..
Note: The query is reading data from a parquet file. This file was generated
using a CTAS expression from a CSV file, and the array column was generated
using CONVERT_FROM(colName, 'JSON'). The column has the string in the correct
format (["x1", "x2"]). Also the count comes out correct when I use the
REPEATED_COUNT(arrayCol) in select column, it's just failing in the where
clause.
A similar exception (Not the exact same exception) can be reproduced by:
SELECT Data, REPEATED_COUNT(Data)
FROM (
SELECT CONVERT_FROM('["x1", "x2"]', 'JSON') as Data FROM (VALUES(1))
)
WHERE REPEATED_COUNT(Data) > 1
was:
I have a parquet file with an array column, when I use the function
'REPEATED_COUNT' in select column it works fine, but when used in where clause
I get an error:
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
SchemaChangeException: Failure while trying to materialize incoming schema.
Errors: Error in expression at index 2. Error: Missing function implementation:
[repeated_count(INT-OPTIONAL)]. Full expression: null.. Fragment 1:4 [Error Id:
1a9b034c-949a-4faa-9185-55d40e4851e7 on uswxapcsd043.ussdnve.baml.com:31010]
(org.apache.drill.exec.exception.SchemaChangeException) Failure while trying to
materialize incoming schema. Errors: Error in expression at index 2. Error:
Missing function implementation: [repeated_count(INT-OPTIONAL)]. Full
expression: null..
Note: The query is reading data from a parquet file. This file was generated
using a CTAS expression from a CSV file, and the array column was generated
using CONVERT_FROM(colName, 'JSON'). The column has the string in the correct
format (["x1", "x2"]). Also the count comes out correct when I use the
REPEATED_COUNT(arrayCol) in select column, it's just failing in the where clause
> REPEATED_COUNT throws error when used in where clause
> -----------------------------------------------------
>
> Key: DRILL-5454
> URL: https://issues.apache.org/jira/browse/DRILL-5454
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Rakesh
>
> I have a parquet file with an array column, when I use the function
> 'REPEATED_COUNT' in select column it works fine, but when used in where
> clause I get an error:
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
> SchemaChangeException: Failure while trying to materialize incoming schema.
> Errors: Error in expression at index 2. Error: Missing function
> implementation: [repeated_count(INT-OPTIONAL)]. Full expression: null..
> Fragment 1:4 [Error Id: 1a9b034c-949a-4faa-9185-55d40e4851e7 on
> uswxapcsd043.ussdnve.baml.com:31010]
> (org.apache.drill.exec.exception.SchemaChangeException) Failure while trying
> to materialize incoming schema. Errors: Error in expression at index 2.
> Error: Missing function implementation: [repeated_count(INT-OPTIONAL)]. Full
> expression: null..
> Note: The query is reading data from a parquet file. This file was generated
> using a CTAS expression from a CSV file, and the array column was generated
> using CONVERT_FROM(colName, 'JSON'). The column has the string in the correct
> format (["x1", "x2"]). Also the count comes out correct when I use the
> REPEATED_COUNT(arrayCol) in select column, it's just failing in the where
> clause.
> A similar exception (Not the exact same exception) can be reproduced by:
> SELECT Data, REPEATED_COUNT(Data)
> FROM (
> SELECT CONVERT_FROM('["x1", "x2"]', 'JSON') as Data FROM (VALUES(1))
> )
> WHERE REPEATED_COUNT(Data) > 1
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)