[ 
https://issues.apache.org/jira/browse/BEAM-9189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-9189:
--------------------------------
    Priority: P3  (was: P2)

> Add Daffodil IO for Apache Beam
> -------------------------------
>
>                 Key: BEAM-9189
>                 URL: https://issues.apache.org/jira/browse/BEAM-9189
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-java-core
>            Reporter: Brian Hulette
>            Priority: P3
>              Labels: gsoc, stale-P2
>
> From https://daffodil.apache.org/:
> {quote}Daffodil is an open source implementation of the DFDL specification 
> that uses these DFDL schemas to parse fixed format data into an infoset, 
> which is most commonly represented as either XML or JSON. This allows the use 
> of well-established XML or JSON technologies and libraries to consume, 
> inspect, and manipulate fixed format data in existing solutions. Daffodil is 
> also capable of the reverse by serializing or “unparsing” an XML or JSON 
> infoset back to the original data format.
> {quote}
> We should create a Beam IO that accepts a DFDL schema as an argument and can 
> then produce and consume data in the specified format. I think it would be 
> most natural for Beam users if this IO could produce Beam Rows, but an 
> initial version that just operates with Infosets could be useful as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to