[arrow-datafusion] branch master updated: remove reference to docker image that is no longer available (#2795)

agrove Sat, 25 Jun 2022 19:19:45 -0700

This is an automated email from the ASF dual-hosted git repository.

agrove pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/arrow-datafusion.git



The following commit(s) were added to refs/heads/master by this push:
     new fed204573 remove reference to docker image that is no longer available 
(#2795)
fed204573 is described below

commit fed20457306f5327eb3d772651d070054ee786fb
Author: Andy Grove <[email protected]>
AuthorDate: Sat Jun 25 20:19:16 2022 -0600

    remove reference to docker image that is no longer available (#2795)
---
 benchmarks/README.md | 31 -------------------------------
 1 file changed, 31 deletions(-)

diff --git a/benchmarks/README.md b/benchmarks/README.md
index ba68b507d..3d4aedd4d 100644
--- a/benchmarks/README.md
+++ b/benchmarks/README.md
@@ -61,37 +61,6 @@ The benchmark program also supports CSV and Parquet input 
file formats and a uti
 cargo run --release --bin tpch -- convert --input ./data --output 
/mnt/tpch-parquet --format parquet
 ```
 
-This utility does not yet provide support for changing the number of 
partitions when performing the conversion. Another
-option is to use the following Docker image to perform the conversion from 
`tbl` files to CSV or Parquet.
-
-```bash
-docker run -it ballistacompute/spark-benchmarks:0.4.0-SNAPSHOT
-  -h, --help   Show help message
-
-Subcommand: convert-tpch
-  -i, --input  <arg>
-      --input-format  <arg>
-  -o, --output  <arg>
-      --output-format  <arg>
-  -p, --partitions  <arg>
-  -h, --help                   Show help message
-```
-
-Note that it is necessary to mount volumes into the Docker container as 
appropriate so that the file conversion process
-can access files on the host system.
-
-Here is a full example that assumes that data is stored in the `/mnt` path on 
the host system.
-
-```bash
-docker run -v /mnt:/mnt -it ballistacompute/spark-benchmarks:0.4.0-SNAPSHOT \
-  convert-tpch \
-  --input /mnt/tpch/csv \
-  --input-format tbl \
-  --output /mnt/tpch/parquet \
-  --output-format parquet \
-  --partitions 64
-```
-
 ## Expected output
 
 The result of query 1 should produce the following output when executed 
against the SF=1 dataset.

[arrow-datafusion] branch master updated: remove reference to docker image that is no longer available (#2795)

Reply via email to