[ 
https://issues.apache.org/jira/browse/BEAM-4430?focusedWorklogId=119374&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-119374
 ]

ASF GitHub Bot logged work on BEAM-4430:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 05/Jul/18 14:12
            Start Date: 05/Jul/18 14:12
    Worklog Time Spent: 10m 
      Work Description: szewi commented on issue #465: [BEAM-4430] Improve 
Performance Testing Documentation
URL: https://github.com/apache/beam-site/pull/465#issuecomment-402735484
 
 
   Ok, so I went through this and it looks good to me. One thing that I'm 
considering is using local kubernetes clusters for development purposes. Some 
users may want to recreate infra on locally available clusters via minikube and 
of course, there will be the different port used as minikube uses ports  >30000 
and we need to override default ports when running pipelines. Simple services 
that use a single port (like Postgres 5432)could handle that(we just override 
Postgres port 5432 with some 300xx port), but when we run complex multi-port 
services like hdfs that simply won't work. What I mean is the most suitable 
infra to develop is having GKE on GCP, rather than using minikube or local 
kubernetes clusters. For simple datastores minikube is ok, but for complex it's 
painful. The advantage of having kubernetes on GCP is also the fact that infra 
would be the same as the one created by Jenkins. TLdr; we should suggest using 
GKE rather than local kubernetes clusters. 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 119374)
    Time Spent: 2h  (was: 1h 50m)

> Improve Performance Testing Documentation
> -----------------------------------------
>
>                 Key: BEAM-4430
>                 URL: https://issues.apache.org/jira/browse/BEAM-4430
>             Project: Beam
>          Issue Type: Wish
>          Components: testing
>            Reporter: Lukasz Gajowy
>            Assignee: Lukasz Gajowy
>            Priority: Critical
>          Time Spent: 2h
>  Remaining Estimate: 0h
>
> Currently, the only documentation regarding IO Performance Testing can be 
> found here: 
> [https://beam.apache.org/documentation/io/testing/#i-o-transform-integration-tests].
>  This is certainly not enough given that the performance testing framework 
> currently allows to run tests:
>  - on local or hdfs filesystems
>  - on direct or dataflow runners
>  - manually using integrationTest task
>  - automatically using performanceTest task
>  - using pkb.py tool directly (PerfKitBenchmarker)
>  - on demand from pending Pull Requests 
>  - detecting anomalies
>  - gathering results in dashboards
> All the above bullets (and maybe others - to be investigated) need more 
> explanation in the docs to make the Performance Testing Framework usable by 
> the broader community.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to