Comments on draft-michel-quic-fec-01

Christian Huitema Thu, 18 Jan 2024 22:30:18 -0800

François, Olivier,

I just spent some time studying your draft on QUIC FEC. I like the ideaof having an FEC framework independent from the algorithm used toactually compute the FEC data and repair packets. Your draft solves anumber of practical problems, such as how to notify peers when FEC helpsreceive a frame from an otherwise lost packet, or how to identify"symblos" independently of packet numbers using the symbol identifierframe (SID).


The draft is obviously a work in progress.

You propose two alternatives for linking frames to a SID. I wish youpicked just one, and I prefer your first alternative, in which your SIDframes brackets a list of protected frames. However, I an not quite surehow this should be parsed. You give an example as:


  | Pkt(6)[STREAM(2, "xyz"),                                    |
  |        SOURCE_SYMBOL(1, { STREAM(8, "def"),                 |
  |                           DATAGRAM("msg") }]                |

In that example, the frame  STREAM(8..) and DATAGRAM() are protected,
while the "STREAM(2)" is not. Fine, but the syntax is described as:

SOURCE_SYMBOL {
  SID (i),
  FEC Protected Payload (..)
}

... and I don't know how to parse that. There is no indication of thelength of the "FEC Protected Payload". Do you mean to indicate that theSOURCE_SYMBOL frame extends to the end of the packet, and that allframes following the SID are protected?

You define a framework in which client and server negotiate to use FEC,and also to select a FEC scheme. The syntax of your transport parameterseems a bit restrictive: the client proposes to use FEC and a specificscheme, and the server accepts or refuse. Given the experimental natureof FEC, I expect that we will try several algorithms. It would be nicefor the client to propose a list, and for the server to pick one -- orzero, if it does not support any of the proposed values. In fact, Ithink that you could merge the "enable FEC" parameter that negotiatesuse of FEC with the "decoder FEC scheme" negotiation.

Your draft does not assign identifiers to existing FEC schemes. Tofacilitate interop tests, I suggest that you define at least one. Infact, I would suggest a very simple one, in which the REPAIR frameidentifies a range of SID, and then carries the XOR of all packets inthat range.

The suggestion above brings a discussion of the relative size of the"FEC Protected Payload" and the REPAIR frames. As in the example above,I would expect REPAIR frames to include a small header followed by acombination of the content of several FEC Protected Payload, with thatcombination being at least as long as the longest FEC Protected Payloadin the set. That longest size, by default, can be a full packet payload(per PMTU), minus the length of the SID prefix. But that leave verylittle room for encoding the prefix of the REPAIR frame, which is likelyto require at list the REPAIR frame type (arguably same length as theSOURCE_SYMBOL frame type), and SID identifying the range (same length asthe SID parameter of the SOURCE_SYMBOL frame), and an additionalparameter indicating the variant of he repair frame according to theselected scheme (arguably the same length as the coding window). Is thatthe problem that you are discussing in section 4.2.3? Should there besome property associated to the FEC scheme, such as the maximum overheadof a REPAIR frame? (Also, why pad the FEC-protected data at thebeginning rather than at the end? Or leave that as a property of the FECscheme?)

I am not sure that I fully understand how to use the FEC WINDOW frame.You allow it to change, but what if the packet containing that frame islost? How can the peer know when exactly the use of the new windowstarts, and which window is associated with a particular SOURCE_SYMBOLor REPAIR frame?

Reed-Solomon codes are often characterized by two numbers, the length ofthe coding window and the number of redundant copies -- in our case, thenumber of REPAIR frames for a given coding window. It seems that in yourproposal these two numbers are set arbitrarily by the sender. Shouldthere me some negotiation of maximum values? Or would those maximumvalues be deduced from the scheme identifier, something like "reedsolomon 32 + 8"? Or should the "repair" frame indicate the length of thecoding widow over which it operates?

I am also not sure how the update of the coding window works for aconvolutional code.

One way to understand the coding window is "the number of frames overwhich a given REPAIR may operate," but we are concerned with correlatedlosses happening in trains. To protect against that, it is nice to sendthe repair frames some time after the protected frames, in which casethe window would an indication of how long a copy of a given frame hasto be kept. This could be expressed as a number of packets, butif multipath is supported we may want to send the repairs on a differentpath, and then using number of packets is not natural.

OK, that's a lot of text. Some of that may be because I did not fullyunderstand your intent. I expect things to get clearer with your nextdraft, or when we start interop testing of different implementations...


Waiting to work on that!

-- Christian Huitema

Comments on draft-michel-quic-fec-01

Reply via email to