xunxunmimi5577 commented on issue #1671:
URL:
https://github.com/apache/datafusion-ballista/issues/1671#issuecomment-4431448303
Hi @milenkovicm , Is the output_rows metric also incorrect?
The result of rest api is as below,
```text
{
"stage_id": "5",
"stage_status": "Successful",
"input_rows": 27787616,
"output_rows": 388416125,
"elapsed_compute": "31.84s"
},
{
"stage_id": "6",
"stage_status": "Successful",
"input_rows": 275041999,
"output_rows": 550083998,
"elapsed_compute": "179.56s"
},
and the log of scheduler shows,
=== [8TocWe6/6] Stage finished, physical plan with metrics ===
SortShuffleWriterExec: partitioning=Hash([ss_ticket_number@3, ss_item_sk@0],
16), metrics=[output_rows=275.0 M, input_rows=275.0 M, spill_bytes=17.96 B,
spill_count=514, repart_time=13.63s, spill_time=152.41s, write_time=484.16s]
DataSourceExec: file_groups={16 groups:
[[cloud/data1/rss/tpcds-data-merged/store_sales/ss_sold_date_sk=2450816/part-00156-96529834-4182-4aef-b514-adc8d36ccbf0.c000.snappy.parquet:0..5968266,
cloud/data1/rss/tpcds-data-merged/store_sales`
and
=== [8TocWe6/5] Stage finished, physical plan with metrics ===
SortShuffleWriterExec: partitioning=Hash([sr_ticket_number@1, sr_item_sk@0],
16), metrics=[output_rows=27.79 M, input_rows=27.79 M, spill_bytes=0,
spill_count=0, repart_time=935.23ms, spill_time=16ns, write_time=2.11s]
DataSourceExec: file_groups={16 groups:
[[cloud/data1/rss/tpcds-data-merged/store_returns/sr_returned_date_sk=2450820/part-00125-2043643b-fe9a-4a4d-bc32-39c71ceaa19a.c000.snappy.parquet:0..9628,
cloud/data1/rss/tpcds-data-merged/store_returns/sr_returned_date_sk=2450821/part-00153-2043643b-fe9a-4a4d-bc32-39c71ceaa19a.c000.snappy.parquet:0..15221,
cloud/data1/rss/tpcds-data-merg`
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]