Re: Parsing data that is split into multiple packets?

Steve Lawrence Wed, 10 Apr 2024 11:46:02 -0700

Traditionally the way to do this is with a two-pass approach. Have a schema thatparses just the headers and treats the payloads as hexbinary/blobs. Afterparsing it extracts the hexBinary/blob payloads, concatenate them together, andthen parse that result with a different schema.

A possible alternative (thought I don't think this has been done before) to doit in one pass might be using a custom Layer. The new Layer would read theHeaders and Extended Headers and pass through only the payloads so that Daffodilonly ever sees the reassembled payload. Your schema then just becomes somethinglike this:


  <element name="file">
    <complexType>
      <sequence>
        <sequence dfdl:layer="PayloadRessembler">
          <element ref="ex:payload" />
        </sequence>
      </sequence>
    </complexType>
  </element>

And your "payload" element can assume the it's just parsing the assembled 
payload.

This would mean that parsing of the Header and Extended Header are done in codein the Layer and wouldn't even be part of the infoset, which isn't necessarilyideal (often the point of Daffodil is to avoid code specific to one format), butwith small enough headers it's maybe not a big deal.

And on unparsing, the layer would have to recreate the Headers/Extended Headersand split the payload.



On 2024-04-10 02:07 PM, Larry Barber wrote:

Does anyone know of a way to handle data that is split into sperate pieces?
I can parse the payload normally, but due to variable length fields, etc., itcan span multiple packets – as shown in the diagram below.
I can’t think of a way to allow parsing of the first packet to complete withoutall of the data being present, and then continuing the parse in the second (orthird, fourth, etc.) packet(s).

Re: Parsing data that is split into multiple packets?

Reply via email to