[FFmpeg-trac] #9675(avfilter:new): -vf overlay is not accurate with RGB input using embedded RGBA alpha channel

FFmpeg Wed, 02 Mar 2022 12:00:27 -0800

#9675: -vf overlay is not accurate with RGB input using embedded RGBA alpha
channel
----------------------------------+---------------------------------------
             Reporter:  pdr0      |                     Type:  defect
               Status:  new       |                 Priority:  normal
            Component:  avfilter  |                  Version:  unspecified
             Keywords:            |               Blocked By:
             Blocking:            |  Reproduced by developer:  0
Analyzed by developer:  0         |
----------------------------------+---------------------------------------
 Summary of the bug:
 -vf overlay is not accurate with RGB input using embedded RGBA alpha
 channel


 How to reproduce:
 {{{
 "D:\\_DOWNLOADS\\ffmpeg-master-latest-win64-gpl_20220302\\bin\\ffmpeg"
 -report -i black.png -i white_withalpharamp.png -filter_complex
 "[0:0][1:0]overlay" ffmpegoverlay_20220302.png -y
 ffmpeg version N-105822-g4b72bca6ca-20220302 Copyright (c) 2000-2022 the
 FFmpeg developers
   built with gcc 11.2.0 (crosstool-NG 1.24.0.533_681aaef)
   configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static
 --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64
 --target-os=mingw32 --enable-gpl --enable-version3 --disable-debug
 --disable-w32threads --enable-pthreads --enable-iconv --enable-libxml2
 --enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp
 --enable-lzma --enable-fontconfig --enable-libvorbis --enable-opencl
 --disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib
 --enable-amf --enable-libaom --enable-avisynth --enable-libdav1d --enable-
 libdavs2 --disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm
 --enable-frei0r --enable-libgme --enable-libass --enable-libbluray
 --enable-libmp3lame --enable-libopus --enable-librist --enable-libtheora
 --enable-libvpx --enable-libwebp --enable-lv2 --enable-libmfx --enable-
 libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264
 --enable-libopenjpeg --enable-libopenmpt --enable-librav1e --enable-
 librubberband --enable-schannel --enable-sdl2 --en  libavutil      57.
 22.100 / 57. 22.100
   libavcodec     59. 21.103 / 59. 21.103
   libavformat    59. 17.102 / 59. 17.102
   libavdevice    59.  5.100 / 59.  5.100
   libavfilter     8. 27.100 /  8. 27.100
   libswscale      6.  5.100 /  6.  5.100
   libswresample   4.  4.100 /  4.  4.100
   libpostproc    56.  4.100 / 56.  4.100
 Splitting the commandline.
 Reading option '-report' ... matched as option 'report' (generate a
 report) with argument '1'.
 Reading option '-i' ... matched as input url with argument 'black.png'.
 Reading option '-i' ... matched as input url with argument
 'white_withalpharamp.png'.
 Reading option '-filter_complex' ... matched as option 'filter_complex'
 (create a complex filtergraph) with argument '[0:0][1:0]overlay'.
 Reading option 'ffmpegoverlay_20220302.png' ... matched as output url.
 Reading option '-y' ... matched as option 'y' (overwrite output files)
 with argument '1'.
 Finished splitting the commandline.
 Parsing a group of options: global .
 Applying option report (generate a report) with argument 1.
 Applying option filter_complex (create a complex filtergraph) with
 argument [0:0][1:0]overlay.
 Applying option y (overwrite output files) with argument 1.
 Successfully parsed a group of options.
 Parsing a group of options: input url black.png.
 Successfully parsed a group of options.
 Opening an input file: black.png.
 [NULL @ 000000c64e433b40] Opening 'black.png' for reading
 [file @ 000000c64c90a040] Setting default whitelist 'file,crypto,data'
 [png_pipe @ 000000c64e433b40] Format png_pipe probed with size=2048 and
 score=99
 [png_pipe @ 000000c64e433b40] Before avformat_find_stream_info() pos: 0
 bytes read:285 seeks:0 nb_streams:1
 [png_pipe @ 000000c64e433b40] After avformat_find_stream_info() pos: 285
 bytes read:285 seeks:0 frames:1
 Input #0, png_pipe, from 'black.png':
   Duration: N/A, bitrate: N/A
   Stream #0:0, 1, 1/25: Video: png, rgb24(pc), 256x256, 25 fps, 25 tbr, 25
 tbn
 Successfully opened the file.
 Parsing a group of options: input url white_withalpharamp.png.
 Successfully parsed a group of options.
 Opening an input file: white_withalpharamp.png.
 [NULL @ 000000c64c90b0c0] Opening 'white_withalpharamp.png' for reading
 [file @ 000000c64c90c740] Setting default whitelist 'file,crypto,data'
 [png_pipe @ 000000c64c90b0c0] Format png_pipe probed with size=2048 and
 score=99
 [png_pipe @ 000000c64c90b0c0] Before avformat_find_stream_info() pos: 0
 bytes read:972 seeks:0 nb_streams:1
 [png_pipe @ 000000c64c90b0c0] After avformat_find_stream_info() pos: 972
 bytes read:972 seeks:0 frames:1
 Input #1, png_pipe, from 'white_withalpharamp.png':
   Duration: N/A, bitrate: N/A
   Stream #1:0, 1, 1/25: Video: png, rgba(pc), 256x256, 25 fps, 25 tbr, 25
 tbn
 Successfully opened the file.
 Parsing a group of options: output url ffmpegoverlay_20220302.png.
 Successfully parsed a group of options.
 Opening an output file: ffmpegoverlay_20220302.png.
 Successfully opened the file.
 detected 8 logical cores
 Stream mapping:
   Stream #0:0 (png) -> overlay
   Stream #1:0 (png) -> overlay
   overlay:default -> Stream #0:0 (png)
 Press [q] to stop, [?] for help
 cur_dts is invalid st:0 (0) [init:0 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 cur_dts is invalid st:0 (0) [init:0 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 cur_dts is invalid st:0 (0) [init:0 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 cur_dts is invalid st:0 (0) [init:0 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 [graph 0 input from stream 0:0 @ 000000c64e46c600] Setting 'video_size' to
 value '256x256'
 [graph 0 input from stream 0:0 @ 000000c64e46c600] Setting 'pix_fmt' to
 value '2'
 [graph 0 input from stream 0:0 @ 000000c64e46c600] Setting 'time_base' to
 value '1/25'
 [graph 0 input from stream 0:0 @ 000000c64e46c600] Setting 'pixel_aspect'
 to value '0/1'
 [graph 0 input from stream 0:0 @ 000000c64e46c600] Setting 'frame_rate' to
 value '25/1'
 [graph 0 input from stream 0:0 @ 000000c64e46c600] w:256 h:256
 pixfmt:rgb24 tb:1/25 fr:25/1 sar:0/1
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] Setting 'video_size' to
 value '256x256'
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] Setting 'pix_fmt' to
 value '26'
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] Setting 'time_base' to
 value '1/25'
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] Setting 'pixel_aspect'
 to value '0/1'
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] Setting 'frame_rate' to
 value '25/1'
 [graph 0 input from stream 1:0 @ 000000c64e46ca00] w:256 h:256 pixfmt:rgba
 tb:1/25 fr:25/1 sar:0/1
 [format @ 000000c64e46c700] Setting 'pix_fmts' to value
 'rgb24|rgba|rgb48be|rgba64be|pal8|gray|ya8|gray16be|ya16be|monob'
 [auto_scale_0 @ 000000c64e46d700] w:iw h:ih flags:'' interl:0
 [Parsed_overlay_0 @ 000000c64e46d900] auto-inserting filter 'auto_scale_0'
 between the filter 'graph 0 input from stream 0:0' and the filter
 'Parsed_overlay_0'
 [auto_scale_1 @ 000000c64e46c800] w:iw h:ih flags:'' interl:0
 [Parsed_overlay_0 @ 000000c64e46d900] auto-inserting filter 'auto_scale_1'
 between the filter 'graph 0 input from stream 1:0' and the filter
 'Parsed_overlay_0'
 [auto_scale_2 @ 000000c64e46c900] w:iw h:ih flags:'' interl:0
 [format @ 000000c64e46c700] auto-inserting filter 'auto_scale_2' between
 the filter 'Parsed_overlay_0' and the filter 'format'
 [AVFilterGraph @ 000000c64e475380] query_formats: 5 queried, 4 merged, 3
 already done, 0 delayed
 [auto_scale_2 @ 000000c64e46c900] picking rgba out of 10 ref:yuva420p
 alpha:1
 [auto_scale_0 @ 000000c64e46d700] w:256 h:256 fmt:rgb24 sar:0/1 -> w:256
 h:256 fmt:yuva420p sar:0/1 flags:0x0
 [auto_scale_1 @ 000000c64e46c800] w:256 h:256 fmt:rgba sar:0/1 -> w:256
 h:256 fmt:yuva420p sar:0/1 flags:0x0
 [Parsed_overlay_0 @ 000000c64e46d900] main w:256 h:256 fmt:yuva420p
 overlay w:256 h:256 fmt:yuva420p
 [Parsed_overlay_0 @ 000000c64e46d900] [framesync @ 000000c64e4767e8]
 Selected 1/25 time base
 [Parsed_overlay_0 @ 000000c64e46d900] [framesync @ 000000c64e4767e8] Sync
 level 2
 [auto_scale_2 @ 000000c64e46c900] w:256 h:256 fmt:yuva420p sar:0/1 ->
 w:256 h:256 fmt:rgba sar:0/1 flags:0x0
 [Parsed_overlay_0 @ 000000c64e46d900] n:1.000000 t:0.000000 pos:0.000000
 x:0.000000 xi:0 y:0.000000 yi:0
 Output #0, image2, to 'ffmpegoverlay_20220302.png':
   Metadata:
     encoder         : Lavf59.17.102
   Stream #0:0, 0, 1/25: Video: png, rgba(pc, gbr/unknown/unknown,
 progressive), 256x256, q=2-31, 200 kb/s, 25 fps, 25 tbn
     Metadata:
       encoder         : Lavc59.21.103 png
 Clipping frame in rate conversion by 0.000008
 frame=    1 fps=0.0 q=0.0 size=N/A time=00:00:00.00 bitrate=N/A speed=
 0x
 cur_dts is invalid st:0 (0) [init:1 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 [Parsed_overlay_0 @ 000000c64e46d900] [framesync @ 000000c64e4767e8] Sync
 level 1
 cur_dts is invalid st:0 (0) [init:1 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 [Parsed_overlay_0 @ 000000c64e46d900] [framesync @ 000000c64e4767e8] Sync
 level 0
 cur_dts is invalid st:0 (0) [init:1 i_done:0 finish:0] (this is harmless
 if it occurs once at the start per stream)
 [out_0_0 @ 000000c64e46c100] EOF on sink link out_0_0:default.
 No more output streams to write to, finishing.
 [image2 @ 000000c64e434740] Opening 'ffmpegoverlay_20220302.png' for
 writing
 [file @ 000000c64e450440] Setting default whitelist 'file,crypto,data'
 [AVIOContext @ 000000c652d47ac0] Statistics: 2411 bytes written, 0 seeks,
 1 writeouts
 frame=    1 fps=0.0 q=-0.0 Lsize=N/A time=00:00:00.04 bitrate=N/A
 speed=0.251x
 video:2kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB
 muxing overhead: unknown
 Input file #0 (black.png):
   Input stream #0:0 (video): 1 packets read (285 bytes); 1 frames decoded;
   Total: 1 packets (285 bytes) demuxed
 Input file #1 (white_withalpharamp.png):
   Input stream #1:0 (video): 1 packets read (972 bytes); 1 frames decoded;
   Total: 1 packets (972 bytes) demuxed
 Output file #0 (ffmpegoverlay_20220302.png):
   Output stream #0:0 (video): 1 frames encoded; 1 packets muxed (2411
 bytes);
   Total: 1 packets (2411 bytes) muxed
 2 frames successfully decoded, 0 decoding errors
 [AVIOContext @ 000000c64e43c040] Statistics: 285 bytes read, 0 seeks
 [AVIOContext @ 000000c64c90c980] Statistics: 972 bytes read, 0 seeks
 }}}

 black.png is RGB 0,0,0

 white_withalpharamp.png is RGB 255,255,255 with a perfect 0-255 ramp for
 the alpha channel, 256x256

 The expected output is a perfect gradient 0-255, with each xpos having the
 same R=G=B value. "greyscaleramp.png" is the expected output, and achieved
 with other programs

 The ffmpeg observed output has banding , some values are dropped, some are
 repeated (but at least 0 and 255 are preserved for a binarized mask). This
 has implications for compositing, greenscreen, non binarized masks

 This test used RGB inputs, but the log says auto scaling to "fmt:yuva420p"
 . My guess is there is some swscale issue - maybe RGB is being converted
 to YUV and back with limited range at 8bit causing the quantization errors
 , instead of operating as an RGB overlay. Or perhaps the intermediate YUV
 operation could be performed at a higher bit depth to avoid those issues
-- 
Ticket URL: <https://trac.ffmpeg.org/ticket/9675>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker

_______________________________________________
FFmpeg-trac mailing list
FFmpeg-trac@avcodec.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-trac

To unsubscribe, visit link above, or email
ffmpeg-trac-requ...@ffmpeg.org with subject "unsubscribe".

[FFmpeg-trac] #9675(avfilter:new): -vf overlay is not accurate with RGB input using embedded RGBA alpha channel

Reply via email to