[jira] [Created] (BEAM-4206) Python: WordCount runs against manually started Flink at master

2018-04-30 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4206: -- Summary: Python: WordCount runs against manually started Flink at master Key: BEAM-4206 URL: https://issues.apache.org/jira/browse/BEAM-4206 Project: Beam

[jira] [Updated] (BEAM-4067) Java: FlinkPortableTestRunner: runs portably via self-started local Flink

2018-04-30 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-4067: --- Summary: Java: FlinkPortableTestRunner: runs portably via self-started local Flink (was: Add

[jira] [Commented] (BEAM-4067) Add portable Flink test runner

2018-04-30 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16459299#comment-16459299 ] Eugene Kirpichov commented on BEAM-4067: Clarification: this should probably go through

[jira] [Created] (BEAM-4214) Python ValidatesRunner test coverage is very poor

2018-04-30 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4214: -- Summary: Python ValidatesRunner test coverage is very poor Key: BEAM-4214 URL: https://issues.apache.org/jira/browse/BEAM-4214 Project: Beam Issue Type:

[jira] [Updated] (BEAM-4213) Python: Portable batch Flink runner passes all ValidatesRunner tests

2018-04-30 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-4213: --- Description: Like BEAM-4176, but 1) for Python; 2) there is no baseline; 3) the set of

[jira] [Commented] (BEAM-3714) JdbcIO.read() should create a forward-only, read-only result set

2018-05-01 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16459818#comment-16459818 ] Eugene Kirpichov commented on BEAM-3714: JB, what's the reason for reassigning this issue to

[jira] [Created] (BEAM-3083) BigQueryIO.write() with DynamicDestinations should not call getSchema() on every element

2017-10-20 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3083: -- Summary: BigQueryIO.write() with DynamicDestinations should not call getSchema() on every element Key: BEAM-3083 URL: https://issues.apache.org/jira/browse/BEAM-3083

[jira] [Closed] (BEAM-2994) Refactor TikaIO

2017-10-26 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2994. -- Resolution: Fixed > Refactor TikaIO > --- > > Key: BEAM-2994 >

[jira] [Closed] (BEAM-2870) BQ Partitioned Table Write Fails When Destination has Partition Decorator

2018-01-05 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2870. -- Resolution: Fixed Assignee: Eugene Kirpichov (was: Reuven Lax) This was fixed for batch

[jira] [Created] (BEAM-3424) CassandraIO uses 1 split if can't estimate size

2018-01-06 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3424: -- Summary: CassandraIO uses 1 split if can't estimate size Key: BEAM-3424 URL: https://issues.apache.org/jira/browse/BEAM-3424 Project: Beam Issue Type:

[jira] [Created] (BEAM-3425) CassandraIO fails to estimate size: Codec not found for requested operation: [varchar <-> java.lang.Long]

2018-01-06 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3425: -- Summary: CassandraIO fails to estimate size: Codec not found for requested operation: [varchar <-> java.lang.Long] Key: BEAM-3425 URL:

[jira] [Updated] (BEAM-3426) Java 8 support

2018-01-08 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-3426: --- Fix Version/s: 2.3.0 > Java 8 support > -- > > Key: BEAM-3426 >

[jira] [Commented] (BEAM-71) Watermark library

2018-01-16 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-71?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327724#comment-16327724 ] Eugene Kirpichov commented on BEAM-71: -- As far as SDF is concerned, the only relevant method is

[jira] [Commented] (BEAM-3506) JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows

2018-01-22 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334773#comment-16334773 ] Eugene Kirpichov commented on BEAM-3506: Knut - any reason why you're not using SpannerIO for

[jira] [Commented] (BEAM-3501) BigQuery Partitioned table creation/write fails when destination has partition decorator

2018-01-24 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338269#comment-16338269 ] Eugene Kirpichov commented on BEAM-3501: Please include the complete error stacktrace rather than

[jira] [Commented] (BEAM-2530) Make Beam compatible with Java 9

2018-01-16 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328268#comment-16328268 ] Eugene Kirpichov commented on BEAM-2530: Another issue:

[jira] [Updated] (BEAM-2680) Improve scalability of the Watch transform

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-2680: --- Description: [https://github.com/apache/beam/pull/3565] introduces the Watch transform

[jira] [Commented] (BEAM-2680) Improve scalability of the Watch transform

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16339797#comment-16339797 ] Eugene Kirpichov commented on BEAM-2680: Note: as a workaround, normally a user should be able to 

[jira] [Closed] (BEAM-2844) Support implicit side inputs

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2844. -- Resolution: Won't Fix Fix Version/s: 2.2.0 This was superseded by

[jira] [Closed] (BEAM-2734) Dataflow ValidatesRunner broken at HEAD

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2734. -- Resolution: Cannot Reproduce Fix Version/s: Not applicable This was fixed a while ago. >

[jira] [Closed] (BEAM-3267) Return file names from TFRecordIO write

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3267. -- Resolution: Fixed Fix Version/s: 2.3.0 FileIO.write() is in, and support for it in

[jira] [Closed] (BEAM-730) Remove Reshuffle transform in favor of Redistribute.byKey()

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-730. - Resolution: Won't Fix Fix Version/s: Not applicable We have Reshuffle.viaRandomKey() for one

[jira] [Commented] (BEAM-3152) AfterProcessingTime trigger doesn't create any file panes

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16339835#comment-16339835 ] Eugene Kirpichov commented on BEAM-3152: Does this issue affect Beam 2.2 or at HEAD? The WriteFiles

[jira] [Commented] (BEAM-3501) BigQuery Partitioned table creation/write fails when destination has partition decorator

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16340173#comment-16340173 ] Eugene Kirpichov commented on BEAM-3501: I tried doing a similar pipeline myself: * Writing to

[jira] [Commented] (BEAM-2840) BigQueryIO write is slow/fail with a bounded source

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16340197#comment-16340197 ] Eugene Kirpichov commented on BEAM-2840: Doing a cleanup pass over BigQuery bugs. Seems this one

[jira] [Closed] (BEAM-2768) Fix bigquery.WriteTables generating non-unique job identifiers

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2768. -- Resolution: Cannot Reproduce Fix Version/s: Not applicable Closing due to lack of

[jira] [Commented] (BEAM-2776) TextIO should support reading header lines

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16340207#comment-16340207 ] Eugene Kirpichov commented on BEAM-2776: Reducing priority: This is easy to do manually using

[jira] [Updated] (BEAM-2776) TextIO should support reading header lines

2018-01-25 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-2776: --- Priority: Minor (was: Major) > TextIO should support reading header lines >

[jira] [Assigned] (BEAM-3225) Non deterministic behaviour of AfterProcessingTime trigger with multiple group by transformations

2018-02-01 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reassigned BEAM-3225: -- Assignee: Aljoscha Krettek (was: Eugene Kirpichov) > Non deterministic behaviour of

[jira] [Commented] (BEAM-3225) Non deterministic behaviour of AfterProcessingTime trigger with multiple group by transformations

2018-02-01 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16349669#comment-16349669 ] Eugene Kirpichov commented on BEAM-3225: Thanks for the thorough investigation! Another comment

[jira] [Closed] (BEAM-3506) JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows

2018-01-29 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3506. -- Resolution: Won't Fix Fix Version/s: Not applicable > JdbcIO: Support writing iterables

[jira] [Commented] (BEAM-3615) Dynamic/Default Coder For Data

2018-02-03 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16351519#comment-16351519 ] Eugene Kirpichov commented on BEAM-3615: Thanks for the report. Reading Avro files whose schema is

[jira] [Closed] (BEAM-3615) Dynamic/Default Coder For Data

2018-02-03 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3615. -- Resolution: Not A Problem Assignee: Eugene Kirpichov (was: Xu Mingmin) Fix

[jira] [Closed] (BEAM-3499) Watch can make no progress if a single poll takes more than checkpoint interval

2018-02-06 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3499. -- Resolution: Fixed Fix Version/s: 2.3.0 > Watch can make no progress if a single poll

[jira] [Created] (BEAM-3683) Support BigQuery column-based time partitioning

2018-02-09 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3683: -- Summary: Support BigQuery column-based time partitioning Key: BEAM-3683 URL: https://issues.apache.org/jira/browse/BEAM-3683 Project: Beam Issue Type:

[jira] [Created] (BEAM-3684) Update well-known coder URNs in Go SDK

2018-02-09 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3684: -- Summary: Update well-known coder URNs in Go SDK Key: BEAM-3684 URL: https://issues.apache.org/jira/browse/BEAM-3684 Project: Beam Issue Type: Bug

[jira] [Closed] (BEAM-65) SplittableDoFn

2018-02-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-65. Resolution: Fixed Fix Version/s: 2.2.0 SDF has been available in the Beam model and implemented

[jira] [Assigned] (BEAM-3696) MQTT IO should compute watermark and ack messages outside of finalizeCheckpoint method

2018-02-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reassigned BEAM-3696: -- Assignee: Jean-Baptiste Onofré (was: Reuven Lax) > MQTT IO should compute watermark

[jira] [Commented] (BEAM-3698) Support SDF over Fn API

2018-02-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16362952#comment-16362952 ] Eugene Kirpichov commented on BEAM-3698: CC: [~herohde] > Support SDF over Fn API >

[jira] [Closed] (BEAM-3647) Default Coder/Reading Coder From File

2018-02-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3647. -- Resolution: Duplicate Fix Version/s: 2.1.0 I believe this is a duplicate of BEAM-3615

[jira] [Created] (BEAM-3697) Add errorprone to maven and gradle builds

2018-02-13 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3697: -- Summary: Add errorprone to maven and gradle builds Key: BEAM-3697 URL: https://issues.apache.org/jira/browse/BEAM-3697 Project: Beam Issue Type: Bug

[jira] [Created] (BEAM-3698) Support SDF over Fn API

2018-02-13 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3698: -- Summary: Support SDF over Fn API Key: BEAM-3698 URL: https://issues.apache.org/jira/browse/BEAM-3698 Project: Beam Issue Type: Bug Components:

[jira] [Closed] (BEAM-2607) Enforce that SDF must return stop() after a failed tryClaim() call

2018-02-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-2607. -- Resolution: Fixed Fix Version/s: 2.3.0 This was recently fixed. > Enforce that SDF must

[jira] [Created] (BEAM-3714) JdbcIO.read() should create a forward-only, read-only result set

2018-02-15 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3714: -- Summary: JdbcIO.read() should create a forward-only, read-only result set Key: BEAM-3714 URL: https://issues.apache.org/jira/browse/BEAM-3714 Project: Beam

[jira] [Created] (BEAM-3743) Support for SDF splitting protocol in ULR

2018-02-23 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3743: -- Summary: Support for SDF splitting protocol in ULR Key: BEAM-3743 URL: https://issues.apache.org/jira/browse/BEAM-3743 Project: Beam Issue Type:

[jira] [Closed] (BEAM-3698) Support SDF over Fn API

2018-02-23 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3698. -- Resolution: Duplicate Fix Version/s: Not applicable > Support SDF over Fn API >

[jira] [Assigned] (BEAM-2939) Fn API streaming SDF support

2018-02-23 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reassigned BEAM-2939: -- Assignee: Eugene Kirpichov > Fn API streaming SDF support >

[jira] [Created] (BEAM-3741) Proto changes for splitting over Fn API

2018-02-23 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3741: -- Summary: Proto changes for splitting over Fn API Key: BEAM-3741 URL: https://issues.apache.org/jira/browse/BEAM-3741 Project: Beam Issue Type: Sub-task

[jira] [Commented] (BEAM-2939) Fn API streaming SDF support

2018-02-23 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375125#comment-16375125 ] Eugene Kirpichov commented on BEAM-2939: Proposed design: https://s.apache.org/beam-breaking-fusion

[jira] [Created] (BEAM-3742) Support for running a streaming SDF in Python SDK

2018-02-23 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3742: -- Summary: Support for running a streaming SDF in Python SDK Key: BEAM-3742 URL: https://issues.apache.org/jira/browse/BEAM-3742 Project: Beam Issue Type:

[jira] [Closed] (BEAM-3683) Support BigQuery column-based time partitioning

2018-02-20 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3683. -- Resolution: Fixed Fix Version/s: 2.4.0 > Support BigQuery column-based time partitioning

[jira] [Updated] (BEAM-3647) Default Coder/Reading Coder From File

2018-02-15 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-3647: --- Fix Version/s: (was: 2.1.0) > Default Coder/Reading Coder From File >

[jira] [Reopened] (BEAM-3647) Default Coder/Reading Coder From File

2018-02-15 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reopened BEAM-3647: Assignee: Anton Kedin I see. My apologies, I misunderstood the question. Assigning to

[jira] [Closed] (BEAM-1542) Need Source/Sink for Spanner

2018-02-26 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-1542. -- Resolution: Fixed Fix Version/s: 2.1.0 This has been in usable shape since 2.1.0. > Need

[jira] [Closed] (BEAM-1581) JSON source and sink

2018-02-26 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-1581. -- Resolution: Won't Fix Fix Version/s: Not applicable AFAICT this has been superseded with

[jira] [Closed] (BEAM-4145) Java SDK Harness populates control request headers with worker id

2018-06-21 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-4145. -- Resolution: Fixed Fix Version/s: 2.6.0 > Java SDK Harness populates control request

[jira] [Created] (BEAM-4792) Add support for bounded SDF to all runners

2018-07-14 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4792: -- Summary: Add support for bounded SDF to all runners Key: BEAM-4792 URL: https://issues.apache.org/jira/browse/BEAM-4792 Project: Beam Issue Type: Bug

[jira] [Commented] (BEAM-4792) Add support for bounded SDF to all runners

2018-07-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16544249#comment-16544249 ] Eugene Kirpichov commented on BEAM-4792: https://github.com/apache/beam/pull/5940 > Add support

[jira] [Assigned] (BEAM-4758) Avro-Protobuf support

2018-07-11 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reassigned BEAM-4758: -- Assignee: Chamikara Jayalath (was: Eugene Kirpichov) > Avro-Protobuf support >

[jira] [Created] (BEAM-4737) SplittableDoFn dynamic rebalancing in Dataflow

2018-07-06 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4737: -- Summary: SplittableDoFn dynamic rebalancing in Dataflow Key: BEAM-4737 URL: https://issues.apache.org/jira/browse/BEAM-4737 Project: Beam Issue Type:

[jira] [Closed] (BEAM-4206) Python: WordCount runs against manually started Flink at master

2018-07-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-4206. -- Resolution: Fixed Fix Version/s: 2.6.0 > Python: WordCount runs against manually

[jira] [Created] (BEAM-4776) Java PortableRunner should support metrics

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4776: -- Summary: Java PortableRunner should support metrics Key: BEAM-4776 URL: https://issues.apache.org/jira/browse/BEAM-4776 Project: Beam Issue Type: Bug

[jira] [Created] (BEAM-4778) Less wasteful ArtifactStagingService

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4778: -- Summary: Less wasteful ArtifactStagingService Key: BEAM-4778 URL: https://issues.apache.org/jira/browse/BEAM-4778 Project: Beam Issue Type: Bug

[jira] [Created] (BEAM-4779) Python PortableTestRunner that runs VR tests against a given portable runner

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4779: -- Summary: Python PortableTestRunner that runs VR tests against a given portable runner Key: BEAM-4779 URL: https://issues.apache.org/jira/browse/BEAM-4779

[jira] [Created] (BEAM-4780) Entry point for ULR JobService compatible with TestPortableRunner

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4780: -- Summary: Entry point for ULR JobService compatible with TestPortableRunner Key: BEAM-4780 URL: https://issues.apache.org/jira/browse/BEAM-4780 Project: Beam

[jira] [Closed] (BEAM-4205) Java: WordCount runs against manually started Flink at master

2018-07-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-4205. -- Resolution: Fixed Fix Version/s: 2.6.0 > Java: WordCount runs against manually started

[jira] [Created] (BEAM-4775) JobService should support returning metrics

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4775: -- Summary: JobService should support returning metrics Key: BEAM-4775 URL: https://issues.apache.org/jira/browse/BEAM-4775 Project: Beam Issue Type: Bug

[jira] [Updated] (BEAM-4777) Python PortableRunner should support metrics

2018-07-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-4777: --- Component/s: (was: runner-core) sdk-py-core > Python PortableRunner

[jira] [Updated] (BEAM-4777) Python PortableRunner should support metrics

2018-07-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-4777: --- Description: BEAM-4775 concerns adding metrics to the JobService API; the current issue is

[jira] [Created] (BEAM-4777) Python PortableRunner should support metrics

2018-07-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4777: -- Summary: Python PortableRunner should support metrics Key: BEAM-4777 URL: https://issues.apache.org/jira/browse/BEAM-4777 Project: Beam Issue Type: Bug

[jira] [Commented] (BEAM-4758) Avro-Protobuf support

2018-07-11 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16540614#comment-16540614 ] Eugene Kirpichov commented on BEAM-4758: Hi - thanks for the feedback, this seems like something

[jira] [Created] (BEAM-4745) SDF tests broken by innocent change due to Dataflow worker dependencies

2018-07-09 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-4745: -- Summary: SDF tests broken by innocent change due to Dataflow worker dependencies Key: BEAM-4745 URL: https://issues.apache.org/jira/browse/BEAM-4745 Project:

[jira] [Created] (BEAM-3485) CassandraIO.read() splitting produces invalid queries

2018-01-16 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3485: -- Summary: CassandraIO.read() splitting produces invalid queries Key: BEAM-3485 URL: https://issues.apache.org/jira/browse/BEAM-3485 Project: Beam Issue

[jira] [Created] (BEAM-3499) Watch can make no progress if a single poll takes more than checkpoint interval

2018-01-18 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3499: -- Summary: Watch can make no progress if a single poll takes more than checkpoint interval Key: BEAM-3499 URL: https://issues.apache.org/jira/browse/BEAM-3499

[jira] [Updated] (BEAM-3499) Watch can make no progress if a single poll takes more than checkpoint interval

2018-01-18 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-3499: --- Description: E.g. when using it to poll a filepattern with hundreds of thousands of files, a

[jira] [Commented] (BEAM-3796) Implement TypedWrite extending WriteFilesResult for XmlIO

2018-03-07 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390712#comment-16390712 ] Eugene Kirpichov commented on BEAM-3796: I don't think this is worth doing. FileIO.write is quite

[jira] [Commented] (BEAM-3714) JdbcIO.read() should create a forward-only, read-only result set

2018-02-28 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16381382#comment-16381382 ] Eugene Kirpichov commented on BEAM-3714: Hey Innocent, thanks for taking this! I'll be happy to

[jira] [Closed] (BEAM-1187) GCP Transport not performing timed backoff after connection failure

2018-03-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-1187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-1187. -- Resolution: Fixed Fix Version/s: 2.3.0 This was fixed by 

[jira] [Commented] (BEAM-3849) SolrIO: Expose connection timeout tuning for writes

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399036#comment-16399036 ] Eugene Kirpichov commented on BEAM-3849: Could you provide an example where this needs to be tuned?

[jira] [Commented] (BEAM-3820) SolrIO: Allow changing batchSize for writes

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399033#comment-16399033 ] Eugene Kirpichov commented on BEAM-3820: I am strongly against this, for all the usual reasons why

[jira] [Updated] (BEAM-2817) Bigquery queries should allow options to run in batch mode or not

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov updated BEAM-2817: --- Fix Version/s: (was: 2.4.0) > Bigquery queries should allow options to run in batch mode

[jira] [Reopened] (BEAM-2817) Bigquery queries should allow options to run in batch mode or not

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov reopened BEAM-2817: Unfortunately the PR is not correct. I left some comments on

[jira] [Closed] (BEAM-3796) Implement TypedWrite extending WriteFilesResult for XmlIO

2018-03-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3796. -- Resolution: Won't Fix Fix Version/s: Not applicable > Implement TypedWrite extending

[jira] [Commented] (BEAM-3795) Introduce XmlIO.readAll()

2018-03-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16395868#comment-16395868 ] Eugene Kirpichov commented on BEAM-3795: This is also not necessary as XmlIO already provides

[jira] [Closed] (BEAM-3795) Introduce XmlIO.readAll()

2018-03-12 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3795. -- Resolution: Won't Fix Fix Version/s: Not applicable > Introduce XmlIO.readAll() >

[jira] [Created] (BEAM-3832) Streaming Dataflow runner harness should understand BundleSplit returned from ProcessBundle

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3832: -- Summary: Streaming Dataflow runner harness should understand BundleSplit returned from ProcessBundle Key: BEAM-3832 URL: https://issues.apache.org/jira/browse/BEAM-3832

[jira] [Created] (BEAM-3834) Python SDK harness should detect SDF ProcessFn and proactively checkpoint it

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3834: -- Summary: Python SDK harness should detect SDF ProcessFn and proactively checkpoint it Key: BEAM-3834 URL: https://issues.apache.org/jira/browse/BEAM-3834

[jira] [Created] (BEAM-3835) Streaming Dataflow runner harness should understand a BundleSplit returned during execution of a bundle

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3835: -- Summary: Streaming Dataflow runner harness should understand a BundleSplit returned during execution of a bundle Key: BEAM-3835 URL:

[jira] [Created] (BEAM-3833) Java SDK harness should detect SDF ProcessFn and proactively checkpoint it

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3833: -- Summary: Java SDK harness should detect SDF ProcessFn and proactively checkpoint it Key: BEAM-3833 URL: https://issues.apache.org/jira/browse/BEAM-3833 Project:

[jira] [Created] (BEAM-3836) Java SDK harness should understand a BundleSplitRequest and respond with a BundleSplit before bundle finishes

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3836: -- Summary: Java SDK harness should understand a BundleSplitRequest and respond with a BundleSplit before bundle finishes Key: BEAM-3836 URL:

[jira] [Created] (BEAM-3837) Python SDK harness should understand a BundleSplitRequest and respond with a BundleSplit before bundle finishes

2018-03-12 Thread Eugene Kirpichov (JIRA)
Eugene Kirpichov created BEAM-3837: -- Summary: Python SDK harness should understand a BundleSplitRequest and respond with a BundleSplit before bundle finishes Key: BEAM-3837 URL:

[jira] [Commented] (BEAM-3849) SolrIO: Expose connection timeout tuning for writes

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399512#comment-16399512 ] Eugene Kirpichov commented on BEAM-3849: It sounds like we need a retrying wrapper over the Solr

[jira] [Commented] (BEAM-3820) SolrIO: Allow changing batchSize for writes

2018-03-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399530#comment-16399530 ] Eugene Kirpichov commented on BEAM-3820: The lack of control is one of the biggest reasons why we

[jira] [Commented] (BEAM-4016) Direct runner incorrect lifecycle, @SplitRestriction should execute after @Setup on SplittableDoFn

2018-04-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437961#comment-16437961 ] Eugene Kirpichov commented on BEAM-4016: Yeah it's the desired order. The fix is to add a call to

[jira] [Comment Edited] (BEAM-4016) Direct runner incorrect lifecycle, @SplitRestriction should execute after @Setup on SplittableDoFn

2018-04-13 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437961#comment-16437961 ] Eugene Kirpichov edited comment on BEAM-4016 at 4/13/18 9:29 PM: - Yeah it's

[jira] [Commented] (BEAM-4075) Verify that the Build works on MacOS

2018-04-14 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438550#comment-16438550 ] Eugene Kirpichov commented on BEAM-4075: I'm running the build on MacOS. It worked with one small

[jira] [Commented] (BEAM-3268) getPerDestinationOutputFilenames() is getting processed before write is finished on dataflow runner

2018-04-18 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16442731#comment-16442731 ] Eugene Kirpichov commented on BEAM-3268: They are, if it's fused with another ParDo. c.output(x)

[jira] [Commented] (BEAM-4016) Direct runner incorrect lifecycle, @SplitRestriction should execute after @Setup on SplittableDoFn

2018-04-18 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16443139#comment-16443139 ] Eugene Kirpichov commented on BEAM-4016: Can you clarify about calling it more times than expected?

[jira] [Closed] (BEAM-3714) JdbcIO.read() should create a forward-only, read-only result set

2018-04-20 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Kirpichov closed BEAM-3714. -- Resolution: Fixed Fix Version/s: 2.5.0 > JdbcIO.read() should create a forward-only,

[jira] [Commented] (BEAM-3456) Enable large scale JdbcIOIT Performance Test

2018-04-24 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16451246#comment-16451246 ] Eugene Kirpichov commented on BEAM-3456: Nice! Did we observe an improvement here after

<    1   2   3   4   5   6   7   >