[ 
https://issues.apache.org/jira/browse/IMPALA-5961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-5961:
----------------------------------
    Component/s:     (was: Backend)
                 Infrastructure

> Test data for TPC-DS schema contains a non-Unicode character
> ------------------------------------------------------------
>
>                 Key: IMPALA-5961
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5961
>             Project: IMPALA
>          Issue Type: Task
>          Components: Infrastructure
>    Affects Versions: Impala 2.10.0
>            Reporter: Tim Wood
>            Assignee: Tim Wood
>            Priority: Major
>              Labels: TPCDS
>         Attachments: ttq-50.out
>
>
> The customer table contains rows whose c_birth_country values contain 
> character 0xd4 (o-circumflex) in an illegal position for Unicode. This causes 
> tpcds-q30 to fail. Either tests need to change to accommodate the different 
> character set, or the test data should change to contain the proper Unicode 
> character.
> To reproduce, build a mini-cluster and load with test data (./buildall.sh 
> -testdata ...), then run the query from the attached file. Find the affected 
> rows with:
> SELECT * FROM customer WHERE c_birth_country LIKE '%IVOIRE%';



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to