A survey would be a good place to start.  This came up in the
python2-sunsetting thread as well: we don't know what versions of python
people are using with Beam, which makes it difficult to answer the question
of support.

-chad


On Mon, Sep 23, 2019 at 2:57 PM Ankur Goenka <goe...@google.com> wrote:

> I agree, these are the questions that need to be answered.
> The data can be anonymize and stored as public data in BigQuery or some
> other place.
>
> The intent is to get the usage statistics so that we can get to know what
> people are using Flink or Spark etc and not intended for discussion or a
> help channel.
> I also think that we don't need to monitor this actively as it's more like
> a survey rather than active channel to get issues resolved.
>
> If we think its useful for the community then we come up with the solution
> as to how can we do this (similar to how we released the container images).
>
>
>
> On Fri, Sep 20, 2019 at 4:38 PM Kyle Weaver <kcwea...@google.com> wrote:
>
>> There are some logistics that would need worked out. For example, Where
>> would the data go? Who would own it?
>>
>> Also, I'm not convinced we need yet another place to discuss Beam when we
>> already have discussed the challenge of simultaneously monitoring mailing
>> lists, Stack Overflow, Slack, etc. While "how do you use Beam" is certainly
>> an interesting question, and I'd be curious to know that >= X many people
>> use a certain runner, I'm not sure answers to these questions are as useful
>> for guiding the future of Beam as discussions on the dev/users lists, etc.
>> as the latter likely result in more depth/specific feedback.
>>
>> However, I do think it could be useful in general to include links
>> directly in the console output. For example, maybe something along the
>> lines of "Oh no, your Flink pipeline crashed! Check Jira/file a bug/ask the
>> mailing list."
>>
>> Kyle Weaver | Software Engineer | github.com/ibzib | kcwea...@google.com
>>
>>
>> On Fri, Sep 20, 2019 at 4:14 PM Ankur Goenka <goe...@google.com> wrote:
>>
>>> Hi,
>>>
>>> At the moment we don't really have a good way to collect any usage
>>> statistics for Apache Beam. Like runner used etc. As many of the users
>>> don't really have a way to report their usecase.
>>> How about if we create a feedback page where users can add their
>>> pipeline details and usecase.
>>> Also, we can start printing the link to this page when user launch the
>>> pipeline in the command line.
>>> Example:
>>> $ python my_pipeline.py --runner DirectRunner --input /tmp/abc
>>>
>>> Starting pipeline
>>> Please use
>>> http://feedback.beam.org?args=runner=DirectRunner,input=/tmp/abc
>>> Pipeline started
>>> ......
>>>
>>> Using a link and not publishing the data automatically will give user
>>> control over what they publish and what they don't. We can enhance the text
>>> and usage further but the basic idea is to ask for user feeback at each run
>>> of the pipeline.
>>> Let me know what you think.
>>>
>>>
>>> Thanks,
>>> Ankur
>>>
>>

Reply via email to