je-ik commented on issue #28256:
URL: https://github.com/apache/beam/issues/28256#issuecomment-1785196710

   I'm obviously doing something wrong.
   
   Running the command like this
   ```
   $ ./gradlew :sdks:java:testing:tpcds:run -Ptpcds.runner=":runners:spark:3" 
-Ptpcds.args=" \
     --runner=SparkRunner \
     --queries=3 \
     --tpcParallel=1 \
     --dataDirectory=gs://beam-tpcds/datasets/parquet/partitioned/ \
     --dataSize=1GB \
     --sourceType=PARQUET \
     --resultsDirectory=/tmp/tpc-ds-results/"
   ```
   
   I get a success
   ```
   
+--------------+------------------------------+--------------+------------+--------------+--------------------------------+--------------------------------+----------------------+
   |  Query Name  |           Job Name           |  Data Size   |  Dialect   |  
  Status    |           Start Time           |            End Time            | 
 Elapsed Time(sec)   |
   
+--------------+------------------------------+--------------+------------+--------------+--------------------------------+--------------------------------+----------------------+
   |    query3    |  query3result1698672244841   |     1GB      |  Calcite   |  
Successful  |  Mon Oct 30 14:24:08 CET 2023  |  Mon Oct 30 14:24:15 CET 2023  | 
       6.483         |
   
+--------------+------------------------------+--------------+------------+--------------+--------------------------------+--------------------------------+----------------------+
   ```
   
   but the outputs are empty
   ```
   $ ls -l /tmp/tpc-ds-results/1GB/
   total 0
   -rw-rw-r-- 1 honza honza 0 Oct 30 14:24 
query3result1698672244841-00000-of-00001.txt
   ```
   
   Accessing the bucket seems to be working fine, e.g.:
   ```
   $ gsutil ls -l gs://beam-tpcds/datasets/parquet/partitioned/1GB/catalog_page
            8  2021-03-24T06:03:33Z  
gs://beam-tpcds/datasets/parquet/partitioned/1GB/catalog_page/._SUCCESS.crc
         5456  2021-03-24T06:03:33Z  
gs://beam-tpcds/datasets/parquet/partitioned/1GB/catalog_page/.part-00000-43e37567-6034-4fae-bda9-db2a85216f3f-c000.snappy.parquet.crc
            0  2021-03-24T06:03:34Z  
gs://beam-tpcds/datasets/parquet/partitioned/1GB/catalog_page/_SUCCESS
       697339  2021-03-24T06:03:34Z  
gs://beam-tpcds/datasets/parquet/partitioned/1GB/catalog_page/part-00000-43e37567-6034-4fae-bda9-db2a85216f3f-c000.snappy.parquet
   TOTAL: 4 objects, 702803 bytes (686.33 KiB)
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to