[
https://issues.apache.org/jira/browse/BEAM-6730?focusedWorklogId=211027&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-211027
]
ASF GitHub Bot logged work on BEAM-6730:
----------------------------------------
Author: ASF GitHub Bot
Created on: 11/Mar/19 14:44
Start Date: 11/Mar/19 14:44
Worklog Time Spent: 10m
Work Description: mxm commented on issue #7875: [BEAM-6730] Expose Java
SDK's GenerateSequence in Python
URL: https://github.com/apache/beam/pull/7875#issuecomment-471568266
Hey Cham, I was out of office the past week. Here's what I proposed on the
mailing list:
Changes to the PR:
1) Remove GenerateSequencePayload
2) Add generic ExternalTransformPayload, for example:
```
message ExternalTransformPayload {
enum Enum {
PROPERTIES = 0
[(beam_urn) = "beam:external:transform:external_transform:v1"];
}
// the format may change to map<string, bytes> if types are supported
map<string, string> parameters = 1;
}
```
I think we haven't fully figured out the wire format, but we have agreed
that it is a list of parameters which use a predefined set of data types. For
the scope of the PR, I'll stick with strings.
3) Define a Pojo in GenerateSequence with the parameters
4) Add a schema-based Pojo <=> ExternalTransformPayload conversion
5) Let Python wrappers use ExternalTransformPayload for the request to
the ExpansionService
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 211027)
Time Spent: 4h 20m (was: 4h 10m)
> Expose Java transforms (specifically IO) in other SDKs
> ------------------------------------------------------
>
> Key: BEAM-6730
> URL: https://issues.apache.org/jira/browse/BEAM-6730
> Project: Beam
> Issue Type: New Feature
> Components: runner-flink, sdk-java-core, sdk-py-core
> Reporter: Maximilian Michels
> Assignee: Maximilian Michels
> Priority: Major
> Time Spent: 4h 20m
> Remaining Estimate: 0h
>
> Since https://github.com/apache/beam/pull/7316 we can reference external
> transforms which are transforms only available in a "foreign" SDKs. This
> allows us to fill the gap in terms of missing transforms in the Python and Go
> SDK, specifically IO transforms.
> We can start collecting/exposing transforms that Beam users need. The
> following transforms could be interesting:
> - KafkaIO / KinesisIO
> - CassandraIO / ElasticserchIO / Hbase / Redis
> - JDBC
> - S3 file system
> - GenerateSequence
> See also https://s.apache.org/beam-cross-language-io and BEAM-6485.
> CC [~robertwb] [~chamikara] [~thw]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)