[ 
https://issues.apache.org/jira/browse/IMPALA-11306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Riza Suminto reassigned IMPALA-11306:
-------------------------------------

    Assignee: Riza Suminto

> single_node_perf_run.py fail to load dataset if scale factor is 1
> -----------------------------------------------------------------
>
>                 Key: IMPALA-11306
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11306
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Infrastructure
>    Affects Versions: Impala 4.0.0
>            Reporter: Riza Suminto
>            Assignee: Riza Suminto
>            Priority: Minor
>              Labels: ramp-up
>
> single_node_perf_run.py has a required argument "scale". If scale > 1, the 
> script runs fine. But if scale = 1 and load is true, the data loading script 
> will fail due to missing dataset. This is becasue the preload script omit the 
> scale number padding when creating dataset directory.
> [https://github.com/apache/impala/blob/6ea15409b879a1286e72848defdda8d5d8568c19/testdata/datasets/tpch/preload#L27]
> ie., tpch scale 1 will create dataset dir "testdata/impala-data/tpch".
> On the other hand, generate-schema-statements.py will create template sql 
> referring to "testdata/impala-data/tpch1".
> [https://github.com/apache/impala/blob/6ea15409b879a1286e72848defdda8d5d8568c19/testdata/bin/generate-schema-statements.py#L599]
>  
> Consider creating symlink if scale factor = 1 in the preload script.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to