Hello Pranav Lodha, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

    http://gerrit.cloudera.org:8080/23588

to look at the new patch set (#7).

Change subject: IMPALA-9715: Add devdata dataload option
......................................................................

IMPALA-9715: Add devdata dataload option

Adds a lightweight dataload for development that only loads text and
parquet files. These are the most common formats used in testing, and
provide a good baseline for development while using much less storage.

Enable with `./buildall.sh -devdata`. Also skips loading tables that
require Hive as they're less commonly used in testing.

Data load with this option (first 3 run in parallel)

    Loading TPC-H data OK (Took: 0 min 15 sec)
    Loading TPC-DS data OK (Took: 2 min 18 sec)
    Loading functional-query data OK (Took: 3 min 43 sec)
    Loading Hive UDFs OK (Took: 0 min 12 sec)
    Running custom post-load steps OK (Took: 0 min 7 sec)
    Caching test tables OK (Took: 0 min 3 sec)
    Creating internal HBase table OK (Took: 0 min 12 sec)
    Checking HDFS health OK (Took: 0 min 2 sec)
    Logging created files OK (Took: 0 min 2 sec)
    Loading external data sources OK (Took: 0 min 19 sec)
    Computing table stats OK (Took: 0 min 14 sec)
    Creating tpcds testcase data OK (Took: 0 min 47 sec)

Reduces dataload from ~30 minutes to ~5 minutes with 16 cores.

Change-Id: Ia942c2df5f238c570a2fc5e547786ac62b9f3af4
---
M bin/impala-config.sh
M bin/load-data.py
M buildall.sh
M testdata/bin/create-load-data.sh
M testdata/bin/generate-schema-statements.py
5 files changed, 72 insertions(+), 15 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/88/23588/7
--
To view, visit http://gerrit.cloudera.org:8080/23588
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ia942c2df5f238c570a2fc5e547786ac62b9f3af4
Gerrit-Change-Number: 23588
Gerrit-PatchSet: 7
Gerrit-Owner: Michael Smith <[email protected]>
Gerrit-Reviewer: Impala Public Jenkins <[email protected]>
Gerrit-Reviewer: Michael Smith <[email protected]>
Gerrit-Reviewer: Pranav Lodha <[email protected]>

Reply via email to