[jira] [Resolved] (IMPALA-11114) calculate_tval fails with ZeroDevisionError if the standard deviations are 0

Jira Wed, 03 Aug 2022 02:43:07 -0700


     [ 
https://issues.apache.org/jira/browse/IMPALA-11114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Gergely Fürnstáhl resolved IMPALA-11114.
----------------------------------------
    Fix Version/s: Impala 4.1.0
       Resolution: Fixed

> calculate_tval fails with ZeroDevisionError if the standard deviations are 0
> ----------------------------------------------------------------------------
>
>                 Key: IMPALA-11114
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11114
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Gergely Fürnstáhl
>            Assignee: Gergely Fürnstáhl
>            Priority: Minor
>             Fix For: Impala 4.1.0
>
>
> Possible cause:
> _Rounding of the data or other forms of truncation could give zero standard 
> deviation when in fact you have some. And if the difference that you are 
> trying to measure is within your measurement error that is a problem not 
> addressed by the t-test._
> [https://stats.stackexchange.com/questions/78570/t-test-with-sample-standard-deviation-of-zero-possible/275879]
> Full log:
> {code:java}
> Traceback (most recent call last):
>   File "/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py", 
> line 1131, in <module>
>     report = Report(grouped, ref_grouped)
>   File "/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py", 
> line 494, in __init__
>     self.__analyze()
>   File "/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py", 
> line 514, in __analyze
>     query_comparison_row = Report.QueryComparisonRow(results, ref_results)
>   File "/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py", 
> line 370, in __init__
>     self.__check_perf_change_significance(results, ref_results))
>   File "/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py", 
> line 390, in __check_perf_change_significance
>     ref_stat[AVG], ref_stat[STDDEV], ref_stat[ITERATIONS])
>   File "/home/gfurnstahl/Impala/tests/util/calculation_util.py", line 65, in 
> calculate_tval
>     return (avg - ref_avg) / sem
> ZeroDivisionError: float division by zero
> Traceback (most recent call last):
>   File "bin/single_node_perf_run.py", line 359, in <module>
>     main()
>   File "bin/single_node_perf_run.py", line 349, in main
>     perf_ab_test(options, args)
>   File "bin/single_node_perf_run.py", line 267, in perf_ab_test
>     compare(temp_dir, hash_a, hash_b)
>   File "bin/single_node_perf_run.py", line 175, in compare
>     report_benchmark_results(file_a, file_b, description)
>   File "bin/single_node_perf_run.py", line 166, in report_benchmark_results
>     stdout=f)
>   File 
> "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/subprocess.py",
>  line 190, in check_call
>     raise CalledProcessError(retcode, cmd)
> subprocess.CalledProcessError: Command 
> '['/home/gfurnstahl/Impala/tests/benchmark/report_benchmark_results.py', 
> '--reference_result_file=/home/gfurnstahl/Impala/perf_results/perf_run_0SdUw7/a87f8c5df9f6fbf8d468921642d7ec3d37c5f4de.json',
>  
> '--input_result_file=/home/gfurnstahl/Impala/perf_results/perf_run_0SdUw7/b4d04112559c3f04ebf42b36deb1cd537dea78c4.json',
>  '--report_description="a87f8c5df9f6fbf8d468921642d7ec3d37c5f4de vs 
> b4d04112559c3f04ebf42b36deb1cd537dea78c4"']' returned non-zero exit status 
> 1{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Resolved] (IMPALA-11114) calculate_tval fails with ZeroDevisionError if the standard deviations are 0

Reply via email to