Re: [libav-devel] [PATCH] movenc: factor initial_padding into edit lists

Martin Storsjö Mon, 20 Feb 2017 14:27:53 -0800

On Mon, 20 Feb 2017, John Stebbins wrote:

initial_padding was getting added to the edit list indirectly due to
initial negative dts.  But in cases where the audio is delayed,
all or part of initial_padding would be unaccounted for.  This patch
makes initial_padding explicit.
---
libavformat/movenc.c | 53 +++++++++++++++++++++++++++++++++++++---------------
1 file changed, 38 insertions(+), 15 deletions(-)


diff --git a/libavformat/movenc.c b/libavformat/movenc.c
index 689291d..b2c0c92 100644
--- a/libavformat/movenc.c
+++ b/libavformat/movenc.c
@@ -1698,9 +1698,28 @@ static int mov_write_edts_tag(AVIOContext *pb, 
MOVMuxContext *mov,
                                      track->timescale, AV_ROUND_UP);
    int version = duration < INT32_MAX ? 0 : 1;
    int entry_size, entry_count, size;
-    int64_t delay, start_ct = track->start_cts;
-    delay = av_rescale_rnd(track->start_dts + start_ct, MOV_TIMESCALE,
+    int64_t delay;
+    int64_t mediatime;
+    int64_t skip = 0;
+
+    delay = av_rescale_rnd(track->start_dts + track->start_cts, MOV_TIMESCALE,
                           track->timescale, AV_ROUND_DOWN);
+
+    if (track->par->codec_type == AVMEDIA_TYPE_AUDIO &&
+        track->par->initial_padding > 0) {
+        /* Adjust delay so that initial_padding gets recorded in the
+         * MediaTime of an edit list entry even in the case that
+         * delay is positive. I.e. we don't want initial_padding to be
+         * absorbed and hidden in the delay. MediaTime must contain
+         * initial_padding in order to know where the actual media
+         * timeline begins. A player should drop samples until MediaTime
+         * is reached */
+        delay += av_rescale_rnd(track->par->initial_padding, MOV_TIMESCALE,
+                                track->par->sample_rate, AV_ROUND_DOWN);

+        skip = av_rescale_rnd(track->par->initial_padding,
+                              track->timescale,
+                              track->par->sample_rate, AV_ROUND_DOWN);
+    }
    version |= delay < INT32_MAX ? 0 : 1;

    entry_size = (version == 1) ? 20 : 12;
@@ -1731,33 +1750,37 @@ static int mov_write_edts_tag(AVIOContext *pb, 
MOVMuxContext *mov,
        }
        avio_wb32(pb, 0x00010000);
    } else {
-        /* Avoid accidentally ending up with start_ct = -1 which has got a
-         * special meaning. Normally start_ct should end up positive or zero
-         * here, but use FFMIN in case dts is a a small positive integer
-         * rounded to 0 when represented in MOV_TIMESCALE units. */
-        start_ct  = -FFMIN(track->start_dts, 0);
-        /* Note, this delay is calculated from the pts of the first sample,
-         * ensuring that we don't reduce the duration for cases with
-         * dts<0 pts=0. */
-        duration += delay;
+        /* Avoid accidentally ending up with mediatime = -1 which has got a
+         * special meaning. skip and -track->start_dts are guaranteed to be
+         * positive here, so it is not possible mediatime to be -1 */
+        skip = FFMAX(skip, -track->start_dts - track->start_cts);

I don't see how it's guaranteed that those numbers arepositive/nonnegative. Example:


track->timebase = 44100
track->start_dts = 10

This should end up with delay = 0, which hits this branch (delay > 0doesn't trigger), ending up with -track->start_dts as a negative value.

With the FFMAX here though, it does seem to guarantee the same, so itprobably is fine technically, but the comment has me confused here.

With the logic changed, I think this comment actually adds more confusionthan what it helps. It could perhaps instead explain what this achievesinstead, i.e. if start_pts < 0, we should also skip some data, just likefor initial padding, and skipping according to whichever of them islarger.

Overall I like where you're going with this. I see that this passesfate-movenc, so it doesn't change the actual output, at last for the casestested there.

I'd appreciate if you could add a few cases in that test(libavformat/tests/movenc.c) to validate all the different corner casesthat this code touches. This test writes synthetic test data in differentsetups and hashes the results. By running it as ./libavformat/tests/movenc-w, it will write the actual output files to disk. Since it's syntheticdata (i.e. broken/fake packet payloads), it's mostly interesting to lookat the produced edit lists with boxdumper or so.

E.g. first test something like video with b-frames starting from bothpts=0 and pts > 0 (it does test the former but not the latter iirc). Ifthat test is added before doing this change, we can be sure that we don'tbreak/change those parts that aren't intended to be changed.

For audio, this test currently doesn't set the initial_padding field atall. The test could be extended to test audio both with and without thatset

And later we can add cases for audio with initial padding starting frompts<0 but where the padding makes it start with a delay > 0, etc.

This function is really hard to grasp fully in general. I played with thethought of trying to achieve the same as you're doing, but by refactoringthrough a few smaller steps, to keep track of it better. On the otherhand, I'm not sure if the main change can be done much smaller than whatyou do here - the more I think about it, the closer I end up with exactlywhat you have here.

After reading all of the code a few more times while typing this reviewcomment, I'm pretty convinced that your patch is good. But then my mainrequest would be to extend the movenc unit test file, to have moreextensive tests of the cases that shouldn't be changed, and new tests forcases with/without initial_padding set.


// Martin
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Re: [libav-devel] [PATCH] movenc: factor initial_padding into edit lists

Reply via email to