Hi!

> -----Original Message-----
> From: iesg [mailto:[email protected]] On Behalf Of Filippov Alexey
> Sent: Thursday, June 13, 2019 1:15 PM
> To: Roman Danyliw <[email protected]>; Adam Roach <[email protected]>;
> The IESG <[email protected]>
> Cc: [email protected]; [email protected]; draft-ietf-netvc-
> [email protected]; Mo Zanaty <[email protected]>
> Subject: RE: Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09:
> (with DISCUSS and COMMENT)
> 
> Dear Roman,
> 
> Thank you a lot for your comments and raising the important questions.
> 
> > How should a set of QPs be specified?
> As you probably know, a set of available QP values can be codec specific.
> Usually, a QP set is selected to cover medium bit-rate range that is
> considered to be the most complex for compressing. We proposed to set up
> the assessment process not only for this range but also for low and high QP
> ranges to cover a wider range of applications. According to my
> understanding, concrete QP values should be specified for candidate codecs
> in draft-ietf-netvc-testing

See below.  Can you help with a specific pointer into draft-ietf-netvc-testing 
which lists candidate codecs+QPs.

> 
> > How should the quality values be calculated?
> It is explained in Section 4.1, namely: “To assess the quality of output
> (decoded) sequences, two indexes, PSNR [3] and MS-SSIM [3,11] are
> separately computed. In the case of the YCbCr color format, PSNR should be
> calculated for each color plane whereas MS-SSIM is calculated for luma
> channel only. In the case of the RGB color format, both metrics are
> computed for R, G and B channels. Thus, for each sequence, 30 RD-points for
> PSNR (i.e. three RD-curves, one for each channel) and 10 RD-points for MS-
> SSIM (i.e. one RD-curve, for luma channel only) should be calculated in the
> case of YCbCr. If content is encoded as RGB, 60 RD-points (30 for PSNR and
> 30 for MS-SSIM) should be calculated, i.e. three RD-curves (one for each
> channel) are computed for PSNR as well as three RD-curves (one for each
> channel) for MS-SSIM.” In references [3, 11], these 2 quality assessment
> metrics and the ways of how to calculate them are described in detail.

Got it.  Thanks.

> >-- What does the text “(in a separate document on Internet video codec
> testing)” mean?
> > Per “A list of video sequences that should be used for testing as well as 
> > the
> 10 QP values for the reference codec are defined in a separate document ",
> what document is that?  Is it draft-ietf-netvc-testing?
> According to my understanding, it is draft-ietf-netvc-testing.

Can you please help me with the specific section references in 
draft-ietf-netvc-testing as I'm not seeing it -- where is the explicit 
reference codec being named and its associate 10 QP values?  For what it's 
worth I see Section 4.1 of this draft saying that:

   As
   the reference for evaluation, state-of-the-art video codecs such as
   HEVC/H.265 [4,5] or VP9  must be used. The reference source code of
   the HEVC/H.265 codec can be found at [6]. The HEVC/H.265 codec must
   be configured according to [13] and Table 9.

I'm looking for something that either unambiguously points to the reference 
codec+QPs; or clearer language that says that all of this is out of scope in 
this and/or the -testing draft, but is clear on what input into the processes 
of this draft is required.

> >What does “codec implementation (for both an encoder and a decoder)
> should cover the worst case of computational complexity, memory
> bandwidth, and physical memory size” mean?
> I’d like to thank Adam for his comment. I also think that his formulation
> ("...should take into consideration the worst-case...") is clearer.
> A more detailed explanation on this Section can be found in my response to
> the secdir reviewer: https://mailarchive.ietf.org/arch/msg/secdir/QYx-
> hl07aec5XhgqkldqgWFG3Is

I'd like to offer a further refinement of what Adam proposed:

Original:
However, it is worth noting that a codec implementation (for both an encoder 
and a decoder) should cover the worst case of computational complexity, memory 
bandwidth, and physical memory size (e.g., for  decoded pictures used as 
references).

Adam:
However, it is worth noting that a codec implementation (for both an encoder 
and a decoder) should take into consideration the worst-case computational 
complexity, memory bandwidth, and physical memory size (e.g., for decoded 
pictures used as references).

Roman+Adam:
However, it is worth noting that a codec implementation (for both an encoder 
and a decoder) should take into consideration the worst-case computational 
complexity, memory bandwidth, and physical memory size needed to processes the 
input (e.g., the decoded pictures used as references).

> >Please add additional language that codec should be written in a defensive
> style as they will be processing untrusted input.
> Good point. Thank you. I’ll add it.

Regards,
Roman

> 
> --
> Best regards,
> Alexey Filippov
> 
> -----Original Message-----
> From: Roman Danyliw via Datatracker [mailto:[email protected]]
> Sent: Wednesday, June 12, 2019 4:25 AM
> To: The IESG <[email protected]>
> Cc: [email protected]; Mo Zanaty
> <[email protected]>; [email protected]; [email protected]; video-
> [email protected]
> Subject: Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09: (with
> DISCUSS and COMMENT)
> 
> Roman Danyliw has entered the following ballot position for
> draft-ietf-netvc-requirements-09: Discuss
> 
> When responding, please keep the subject line intact and reply to all email
> addresses included in the To and CC lines. (Feel free to cut this introductory
> paragraph, however.)
> 
> 
> Please refer to https://www.ietf.org/iesg/statement/discuss-criteria.html
> for more information about IESG DISCUSS and COMMENT positions.
> 
> 
> The document, along with other ballot positions, can be found here:
> https://datatracker.ietf.org/doc/draft-ietf-netvc-requirements/
> 
> 
> 
> ----------------------------------------------------------------------
> DISCUSS:
> ----------------------------------------------------------------------
> 
> (1) I worry that the level of detail in the in the Compression Performance
> Evaluation (Section 4.1) is insufficient for implementation.  Specifically:
> 
> (a) Per “Initially, for the codec selected as a reference one (e.g., HEVC or
> VP9), a set of 10 QP quantization parameter) values should be specified (in a
> separate document on Internet video codec testing) and corresponding
> quality values should be calculated.”
> 
> -- How should a set of QPs be specified?
> 
> --How should the quality values be calculated?
> 
> -- What does the text “(in a separate document on Internet video codec
> testing)” mean?
> 
> (b) Per “A list of video sequences that should be used for testing as well as
> the 10 QP values for the reference codec are defined in a separate document
> ", what document is that?  Is it draft-ietf-netvc-testing?
> 
> (2) Per the Security Considerations Section (Section 5)
> 
> -- What does “codec implementation (for both an encoder and a decoder)
> should cover the worst case of computational complexity, memory
> bandwidth, and physical memory size” mean?
> 
> -- Please add additional language that codec should be written in a
> defensive style as they will be processing untrusted input.
> 
> 
> ----------------------------------------------------------------------
> COMMENT:
> ----------------------------------------------------------------------
> 
> (1) There is something odd about the document formatting – the title and
> the first author last name in the footer is wrapped in “< … >”
> 
> (2) It would be helpful to forward reference that acronyms are explained in
> Appendix A.
> 
> (3) This draft uses the words should and must to prescribe action.  Why
> wasn’t
> RFC2119 cited to explain these words?
> 
> (4) Section 2.0.  A reference to explain “YCbCr 4:2:0” would be helpful
> 
> (5) Section 2.1. Per “high encoder complexity” and “decoding complexity”, I
> initiate read that as a qualitative measure.  However, the text says “up to
> 10x and more” so that implies some quantitative measure.  What is that?
> 
> (6) Section 2.1.  Expand QP values on first use
> 
> (7) Section 2.x.  The language around content doesn’t appear to be consist.
> For example:
> 
> -- Section 2.1, Internet Video Streaming says “movies, TV-series and shows,
> and animation.”
> 
> -- Section 2.2, IPTV says “television content”
> 
> -- Section 2.5, Screen casting says “business presentations …, animation
> (cartoons), gaming content, data visualization, …, virtual desktop
> infrastructure (VDI), screen/desktop sharing and collaboration, supervisory
> control and data acquisition (SCADA) display, automotive/navigation
> display, cloud gaming, factory automation display, wireless display, display
> wall, digital operating room (DiOR), etc.
> 
> What the difference between Section 2.1’s animation and Section 2.5’s
> cartoons?
> 
> What’s the different between Section 2.1’s “movies, TV series …” and Section
> 2.2’s “television content”?
> 
> (8) Section 2.5.  The sentence “Currently, …” is very challenging to parse as 
> it
> includes inline “i.e.,” and “etc”.
> 
> (9) Section 2.5.  Per “powerpoint, word documents”, these are specific
> Microsoft products.  I recommend using more generic names.
> 
> (10) Section 4.  I found it confusing that an evaluation methodology was in a
> requirements document.  I would have expected it in the draft-ietf-netvc-
> testing
> 
> (11) Section 4.1.  VP9 needs a reference.
> 
> (12) Section 6.  I don’t think this entire section is necessary.
> 
> (13) Editorial Nits:
> -- Section 3.  Style nit.  s/chapter/section/
> 
> -- Section 4.1. Typo.  s/computged/computed
> 

_______________________________________________
video-codec mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/video-codec

Reply via email to