[
https://issues.apache.org/jira/browse/BEAM-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16878502#comment-16878502
]
Ryan Skraba commented on BEAM-881:
----------------------------------
[~jbonofre] – Two years later, it seems obvious that
org.apache.beam.sdk.values.Row ("immutable tuple-like schema to represent one
element in a PCollection") is the gold standard that meets all of the
requirements in this feature! What do you think?
> Provide a PTransform in IOs providing a "standard" Avro IndexedRecord
> ---------------------------------------------------------------------
>
> Key: BEAM-881
> URL: https://issues.apache.org/jira/browse/BEAM-881
> Project: Beam
> Issue Type: New Feature
> Components: io-ideas
> Reporter: Jean-Baptiste Onofré
> Assignee: Jean-Baptiste Onofré
> Priority: Major
>
> Now, each IO is using a different data format. For instance, the
> {{JmsIO.Read}} provides a {{PCollection}} of {{JmsRecord}} (and
> {{JmsIO.Write}} expects also a {{JmsRecord}}), {{KafkaIO.Read}} provides a
> {{PCollection}} of {{KafkaRecord}}.
> It could appear a bit "complex" for users to manipulate such kind of data
> format: some users may expect kind of standard format.
> Without modifying the existing IO, we could add a {{PTransform}} (as part of
> the IO) that an user can optionally use. This transform will convert the IO
> data format (let say {{JmsRecord}} for instance) to a standard Avro
> {{IndexedRecord}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)