This is an automated email from the ASF dual-hosted git repository.
agrove pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/arrow-datafusion.git
The following commit(s) were added to refs/heads/master by this push:
new fed204573 remove reference to docker image that is no longer available
(#2795)
fed204573 is described below
commit fed20457306f5327eb3d772651d070054ee786fb
Author: Andy Grove <[email protected]>
AuthorDate: Sat Jun 25 20:19:16 2022 -0600
remove reference to docker image that is no longer available (#2795)
---
benchmarks/README.md | 31 -------------------------------
1 file changed, 31 deletions(-)
diff --git a/benchmarks/README.md b/benchmarks/README.md
index ba68b507d..3d4aedd4d 100644
--- a/benchmarks/README.md
+++ b/benchmarks/README.md
@@ -61,37 +61,6 @@ The benchmark program also supports CSV and Parquet input
file formats and a uti
cargo run --release --bin tpch -- convert --input ./data --output
/mnt/tpch-parquet --format parquet
```
-This utility does not yet provide support for changing the number of
partitions when performing the conversion. Another
-option is to use the following Docker image to perform the conversion from
`tbl` files to CSV or Parquet.
-
-```bash
-docker run -it ballistacompute/spark-benchmarks:0.4.0-SNAPSHOT
- -h, --help Show help message
-
-Subcommand: convert-tpch
- -i, --input <arg>
- --input-format <arg>
- -o, --output <arg>
- --output-format <arg>
- -p, --partitions <arg>
- -h, --help Show help message
-```
-
-Note that it is necessary to mount volumes into the Docker container as
appropriate so that the file conversion process
-can access files on the host system.
-
-Here is a full example that assumes that data is stored in the `/mnt` path on
the host system.
-
-```bash
-docker run -v /mnt:/mnt -it ballistacompute/spark-benchmarks:0.4.0-SNAPSHOT \
- convert-tpch \
- --input /mnt/tpch/csv \
- --input-format tbl \
- --output /mnt/tpch/parquet \
- --output-format parquet \
- --partitions 64
-```
-
## Expected output
The result of query 1 should produce the following output when executed
against the SF=1 dataset.