Martijn Visser created FLINK-28263:
--------------------------------------
Summary: TPC-DS Bash e2e tests don't clean-up after completing
Key: FLINK-28263
URL: https://issues.apache.org/jira/browse/FLINK-28263
Project: Flink
Issue Type: Bug
Components: Tests
Affects Versions: 1.16.0
Reporter: Martijn Visser
When debugging the disk space usage for the e2e tests, the top 20 folders with
the largest file size are:
{code:java}
2022-06-27T09:32:59.8000587Z Jun 27 09:32:59 List top 20 directories with
largest file size
2022-06-27T09:33:00.9811803Z Jun 27 09:33:00 4088524 .
2022-06-27T09:33:00.9813428Z Jun 27 09:33:00 1277080 ./flink-end-to-end-tests
2022-06-27T09:33:00.9814324Z Jun 27 09:33:00 624512 ./flink-dist
2022-06-27T09:33:00.9815152Z Jun 27 09:33:00 624124 ./flink-dist/target
2022-06-27T09:33:00.9816093Z Jun 27 09:33:00 500032
./flink-dist/target/flink-1.16-SNAPSHOT-bin
2022-06-27T09:33:00.9817429Z Jun 27 09:33:00 500028
./flink-dist/target/flink-1.16-SNAPSHOT-bin/flink-1.16-SNAPSHOT
2022-06-27T09:33:00.9818167Z Jun 27 09:33:00 486412 ./.git
2022-06-27T09:33:00.9819096Z Jun 27 09:33:00 479416 ./.git/objects
2022-06-27T09:33:00.9819512Z Jun 27 09:33:00 479408 ./.git/objects/pack
2022-06-27T09:33:00.9820584Z Jun 27 09:33:00 461456 ./flink-connectors
2022-06-27T09:33:00.9821403Z Jun 27 09:33:00 449832
./.git/objects/pack/pack-0bdd9e3186d0cb404910c5843d19b5cb80b84fe0.pack
2022-06-27T09:33:00.9821992Z Jun 27 09:33:00 349236 ./flink-table
2022-06-27T09:33:00.9822631Z Jun 27 09:33:00 293008
./flink-dist/target/flink-1.16-SNAPSHOT-bin/flink-1.16-SNAPSHOT/opt
2022-06-27T09:33:00.9823233Z Jun 27 09:33:00 251272 ./flink-filesystems
2022-06-27T09:33:00.9823818Z Jun 27 09:33:00 246588
./flink-end-to-end-tests/flink-streaming-kinesis-test
2022-06-27T09:33:00.9824502Z Jun 27 09:33:00 246464
./flink-end-to-end-tests/flink-streaming-kinesis-test/target
2022-06-27T09:33:00.9825210Z Jun 27 09:33:00 196656
./flink-dist/target/flink-1.16-SNAPSHOT-bin/flink-1.16-SNAPSHOT/lib
2022-06-27T09:33:00.9825966Z Jun 27 09:33:00 184364
./flink-end-to-end-tests/flink-streaming-kinesis-test/target/KinesisExample.jar
2022-06-27T09:33:00.9826652Z Jun 27 09:33:00 156136
./flink-end-to-end-tests/flink-tpcds-test
2022-06-27T09:33:00.9827284Z Jun 27 09:33:00 151180
./flink-end-to-end-tests/flink-tpcds-test/target
{code}
See
https://dev.azure.com/martijn0323/Flink/_build/results?buildId=2732&view=logs&j=0e31ee24-31a6-528c-a4bf-45cde9b2a14e&t=ff03a8fa-e84e-5199-efb2-5433077ce8e2&l=5093
After running {{TPC-DS end-to-end test}} and after the clean-up, the following
directories are listed in the top 20:
{code:java}
2022-06-27T09:49:51.7694429Z Jun 27 09:49:51 List top 20 directories with
largest file size AFTER cleaning temorary folders and files
2022-06-27T09:49:52.9617221Z Jun 27 09:49:52 5315996 .
2022-06-27T09:49:52.9618830Z Jun 27 09:49:52 2504556 ./flink-end-to-end-tests
2022-06-27T09:49:52.9619848Z Jun 27 09:49:52 1383612
./flink-end-to-end-tests/flink-tpcds-test
2022-06-27T09:49:52.9620796Z Jun 27 09:49:52 1378656
./flink-end-to-end-tests/flink-tpcds-test/target
2022-06-27T09:49:52.9621730Z Jun 27 09:49:52 1223944
./flink-end-to-end-tests/flink-tpcds-test/target/table
2022-06-27T09:49:52.9622844Z Jun 27 09:49:52 624508 ./flink-dist
2022-06-27T09:49:52.9623585Z Jun 27 09:49:52 624120 ./flink-dist/target
2022-06-27T09:49:52.9624398Z Jun 27 09:49:52 500028
./flink-dist/target/flink-1.16-SNAPSHOT-bin
2022-06-27T09:49:52.9625366Z Jun 27 09:49:52 500024
./flink-dist/target/flink-1.16-SNAPSHOT-bin/flink-1.16-SNAPSHOT
2022-06-27T09:49:52.9625994Z Jun 27 09:49:52 486412 ./.git
2022-06-27T09:49:52.9626514Z Jun 27 09:49:52 479416 ./.git/objects
2022-06-27T09:49:52.9631740Z Jun 27 09:49:52 479408 ./.git/objects/pack
2022-06-27T09:49:52.9632755Z Jun 27 09:49:52 461456 ./flink-connectors
2022-06-27T09:49:52.9633717Z Jun 27 09:49:52 449832
./.git/objects/pack/pack-0bdd9e3186d0cb404910c5843d19b5cb80b84fe0.pack
2022-06-27T09:49:52.9634769Z Jun 27 09:49:52 379348
./flink-end-to-end-tests/flink-tpcds-test/target/table/store_sales.dat
2022-06-27T09:49:52.9635596Z Jun 27 09:49:52 349236 ./flink-table
2022-06-27T09:49:52.9636489Z Jun 27 09:49:52 293008
./flink-dist/target/flink-1.16-SNAPSHOT-bin/flink-1.16-SNAPSHOT/opt
2022-06-27T09:49:52.9637526Z Jun 27 09:49:52 288980
./flink-end-to-end-tests/flink-tpcds-test/target/table/catalog_sales.dat
2022-06-27T09:49:52.9638378Z Jun 27 09:49:52 251272 ./flink-filesystems
2022-06-27T09:49:52.9639238Z Jun 27 09:49:52 246588
./flink-end-to-end-tests/flink-streaming-kinesis-test
{code}
See
https://dev.azure.com/martijn0323/Flink/_build/results?buildId=2732&view=logs&j=0e31ee24-31a6-528c-a4bf-45cde9b2a14e&t=ff03a8fa-e84e-5199-efb2-5433077ce8e2&l=5708
This results in not enough disk space errors during various runs further
downstream. This test should also properly clean-up its files
--
This message was sent by Atlassian Jira
(v8.20.7#820007)