Valentyn Tymofieiev created BEAM-6542:
-----------------------------------------
Summary: Python ValidatesContainer test suite should verify that
installed dependencies match dependencies in requirements file.
Key: BEAM-6542
URL: https://issues.apache.org/jira/browse/BEAM-6542
Project: Beam
Issue Type: Improvement
Components: sdk-py-core
Reporter: Valentyn Tymofieiev
Assignee: Ahmet Altay
Python [ValidatesContainer test
suites|https://github.com/apache/beam/blob/master/.test-infra/jenkins/job_PostCommit_Python_ValidatesContainer_Dataflow.groovy]
build Docker containers and run some integration tests using those containers,
to make sure that containers can be built and used. A python container that we
build [includes Beam
SDK|https://github.com/apache/beam/blob/1a6490d3fd9245fc59838bd4bd531755304a855a/sdks/python/container/Dockerfile#L47].
During container build we install several pip packages, which is influenced by
[requirements.txt|https://github.com/apache/beam/blob/master/sdks/python/container/base_image_requirements.txt],
[SDK
dependencies|https://github.com/apache/beam/blob/d9a1bac19c52b92804a204b7ca881b3e8617b42c/sdks/python/setup.py#L193],
and downstream dependencies of packages we install.
The purpose of specifying dependencies in setup.py is to define minimal viable
requirements for SDK to be installed.
The purpose of requirements.txt is to configure the runtime environment for SDK
harness as and requires more precision to make sure there are no version
dependency conflicts and to make sure that versions of dependencies installed
in the container match across two different container builds. therefore we
should specify exact versions in requirements.txt and also include all
transitive dependencies of Beam.
Unfortunately, requirements.txt can easily go out of sync even with Beam SDK
requirements ([example|https://github.com/apache/beam/pull/7657])
We should strengthen ValidatesContainer test suite to verify that version of
dependencies installed in the container matches versions specified in
requirements.txt.
One possible way to do it is to run `pip freeze` and compare the output with
requirements.txt.
cc [~markflyhigh], [~altay].
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)