2018-11-21 16:27 GMT+01:00, Harvey Pikelberger <[email protected]>: > >> On Nov 20, 2018, at 3:05 AM, Carl Eugen Hoyos <[email protected]> wrote: >> >> 2018-11-20 4:28 GMT+01:00, Harvey Pikelberger <[email protected]>: >>> Testing the creation of 8-audio-channel mp4 proxies from ProRes422 >>> sources using ffmpeg. For some reason audio channel 4 of the mp4 >>> is coming back muffled, like it got drowned in a pool. >> > >> >> Every aac encoder will assume that one of eight input channels is the >> lfe channel and will only encode low frequencies for this channel. >> (Maybe not "every" but in any case, this is the issue you see.) > > That makes sounds correct. I would think that would account for the muffled > audio 4. > Indeed, in the stdout you can see that ffmpeg "guessed" mono for the DNx > source and at 7.1 for the ProRes source. > How to force ffmpeg to treat the ProRes as mono sources?
You have to use the asplit filter (as you cannot treat 7.1 as mono). >> In case I misunderstood the issue and this is really 7.1 input > > It's not, at least intentionally, 7.1 input. > > The goal is to test sources with 8 tracks of mono sound, logic being that > these are files that are delivered to post for editing purposes, and for > which we need to generate proxies. > > We're working in Premiere Pro, where proxy audio channel count / assigns > have to match that of the original files. > Our goal here is to convert full res sources to h264 such that the sound > matches the original so that Premiere accepts them as legitimate proxies. > > So far ffmpeg is the only tool that has successfully processed video sources > with 4-audio-channel into 4-audio-channel h264/mp4 files. > > But we also get sources with 5-8 audio channels, hence these rounds of > testing. > > >> but the >> encoding is broken: Our aac encoder provides acceptable output for >> mono and stereo, maybe not for other layouts and there is an >> unavoidable issue for 7.1 aac because of a flaw in the specification, >> feel free to test with the libfdk aac encoder. > > To test with libfdk aac encoder, where would you specify that in the CLI > here? -acodec libfdk_aac > ffmpeg -i SoundMappingTest-H8Discreet.mov -map 0 -g 48 -c:v libx264 > -profile:v baseline -crf 16 -c:a aac -b:a 256k -vf scale=1280:720 -pix_fmt > yuv420p SoundMappingTest-H8Discreet.mp4 > >> >> >>> Recently tested using DNxHD mxf files -- same audio configuration >>> and everything worked perfectly. The audio configuration of your two input is very different... >> >> That sounds very unlikely / command line and complete, uncut console >> output missing. > > RE "complete, uncut console output missing": Not sure why. Tried to be > careful, but who knows. > Will attach the stdouts for both the DNx and ProRes tests below... ... which the console output very clearly shows. > > Also, I posted the source and output files to the web. > Apologies for the audio content. I randomly grabbed sound of someone > reading numbers 1 to 8 out loud from a Spanish language children's show -- a > little abrasive to say the least. > > > DNx > Source: https://drive.google.com/open?id=1kR6aYKv90cxl-HJe04_wssoxsLfJK78z > <https://drive.google.com/open?id=1kR6aYKv90cxl-HJe04_wssoxsLfJK78z> > Output: https://drive.google.com/open?id=1HgiV0Dc42iBwLurI857zc3RO8Oig9-M1 > <https://drive.google.com/open?id=1HgiV0Dc42iBwLurI857zc3RO8Oig9-M1> > > ProRes > Source: https://drive.google.com/open?id=1xEG6FsxdQ53j-TE2qaegw-8QjB4il0IF > <https://drive.google.com/open?id=1xEG6FsxdQ53j-TE2qaegw-8QjB4il0IF> > Output: https://drive.google.com/open?id=1Itua1QXhUyozhssVnvvNULdkNe3X28MA > <https://drive.google.com/open?id=1Itua1QXhUyozhssVnvvNULdkNe3X28MA> > > stdout for DNx > ffmpeg version 4.0.2 Copyright (c) 2000-2018 the FFmpeg developers old and unsupported although this may make no difference in your case. > built with Apple LLVM version 10.0.0 (clang-1000.10.44.2) > configuration: --prefix=/usr/local/Cellar/ffmpeg/4.0.2_1 --enable-shared > --enable-pthreads --enable-version3 --enable-hardcoded-tables > --enable-avresample --cc=clang --host-cflags= --host-ldflags= > --enable-ffplay --enable-gpl --enable-libmp3lame --enable-libopus > --enable-libsnappy --enable-libtheora --enable-libvorbis --enable-libvpx > --enable-libx264 --enable-libx265 --enable-libxvid --enable-lzma > --enable-chromaprint --enable-libass --enable-libfdk-aac > --enable-libfreetype --enable-libopencore-amrnb --enable-libopencore-amrwb > --enable-libopenh264 --enable-librsvg --enable-librtmp > --enable-librubberband --enable-libsoxr --enable-libssh > --enable-libtesseract --enable-libvidstab --enable-libwebp --enable-libzimg > --enable-libzmq --enable-opencl --enable-openssl --enable-videotoolbox > --enable-libopenjpeg --disable-decoder=jpeg2000 > --extra-cflags=-I/usr/local/Cellar/openjpeg/2.3.0/include/openjpeg-2.3 > --enable-nonfree > libavutil 56. 14.100 / 56. 14.100 > libavcodec 58. 18.100 / 58. 18.100 > libavformat 58. 12.100 / 58. 12.100 > libavdevice 58. 3.100 / 58. 3.100 > libavfilter 7. 16.100 / 7. 16.100 > libavresample 4. 0. 0 / 4. 0. 0 > libswscale 5. 1.100 / 5. 1.100 > libswresample 3. 1.100 / 3. 1.100 > libpostproc 55. 1.100 / 55. 1.100 > Guessed Channel Layout for Input Stream #0.1 : mono > Guessed Channel Layout for Input Stream #0.2 : mono > Guessed Channel Layout for Input Stream #0.3 : mono > Guessed Channel Layout for Input Stream #0.4 : mono > Guessed Channel Layout for Input Stream #0.5 : mono > Guessed Channel Layout for Input Stream #0.6 : mono > Guessed Channel Layout for Input Stream #0.7 : mono > Guessed Channel Layout for Input Stream #0.8 : mono > Input #0, mxf, from 'SoundMappingTest-C.mxf': > Metadata: > uid : a6adbf01-ec0b-11e8-9a0e-784f43779667 > generation_uid : a6adbf02-ec0b-11e8-92a9-784f43779667 > company_name : Adobe Systems Incorporated > product_name : Adobe Media Encoder > product_version : 12.1.2 > application_platform: Mac OS X > product_uid : 0c3919fe-46e8-11e5-a151-feff819cdc9f > modification_date: 2018-11-19T14:59:02.000000Z > material_package_umid: > 0x060A2B340101010501010D1113000000D81A37034184058816D8784F43779667 > timecode : 00:00:00:00 > Duration: 00:00:36.70, start: 0.000000, bitrate: 191705 kb/s > Stream #0:0: Video: dnxhd (DNXHD), yuv422p(bt709/unknown/unknown, > progressive), 1920x1080, SAR 1:1 DAR 16:9, 23.98 fps, 23.98 tbr, 23.98 tbn, > 23.98 tbc > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 1 > Stream #0:1: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 2 > Stream #0:2: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 3 > Stream #0:3: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 4 > Stream #0:4: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 5 > Stream #0:5: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 6 > Stream #0:6: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 7 > Stream #0:7: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 8 > Stream #0:8: Audio: pcm_s16le, 48000 Hz, mono, s16, 768 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 9 > Stream mapping: > Stream #0:0 -> #0:0 (dnxhd (native) -> h264 (libx264)) > Stream #0:1 -> #0:1 (pcm_s16le (native) -> aac (native)) > Stream #0:2 -> #0:2 (pcm_s16le (native) -> aac (native)) > Stream #0:3 -> #0:3 (pcm_s16le (native) -> aac (native)) > Stream #0:4 -> #0:4 (pcm_s16le (native) -> aac (native)) > Stream #0:5 -> #0:5 (pcm_s16le (native) -> aac (native)) > Stream #0:6 -> #0:6 (pcm_s16le (native) -> aac (native)) > Stream #0:7 -> #0:7 (pcm_s16le (native) -> aac (native)) > Stream #0:8 -> #0:8 (pcm_s16le (native) -> aac (native)) > Press [q] to stop, [?] for help > [libx264 @ 0x7f98a681c800] using SAR=1/1 > [libx264 @ 0x7f98a681c800] using cpu capabilities: MMX2 SSE2Fast SSSE3 > SSE4.2 AVX FMA3 BMI2 AVX2 > [libx264 @ 0x7f98a681c800] profile Constrained Baseline, level 3.1 > [libx264 @ 0x7f98a681c800] 264 - core 152 r2854 e9a5903 - H.264/MPEG-4 AVC > codec - Copyleft 2003-2017 - http://www.videolan.org/x264.html - options: > cabac=0 ref=3 deblock=1:0:0 analyse=0x1:0x111 me=hex subme=7 psy=1 > psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=0 > cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 > lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 > bluray_compat=0 constrained_intra=0 bframes=0 weightp=0 keyint=48 > keyint_min=4 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 > crf=16.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 > Output #0, mp4, to 'SoundMappingTest-C2.mp4': > Metadata: > uid : a6adbf01-ec0b-11e8-9a0e-784f43779667 > generation_uid : a6adbf02-ec0b-11e8-92a9-784f43779667 > company_name : Adobe Systems Incorporated > product_name : Adobe Media Encoder > product_version : 12.1.2 > application_platform: Mac OS X > product_uid : 0c3919fe-46e8-11e5-a151-feff819cdc9f > modification_date: 2018-11-19T14:59:02.000000Z > material_package_umid: > 0x060A2B340101010501010D1113000000D81A37034184058816D8784F43779667 > timecode : 00:00:00:00 > encoder : Lavf58.12.100 > Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, > 1280x720 [SAR 1:1 DAR 16:9], q=-1--1, 0.04 fps, 24k tbn, 23.98 tbc > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 1 > encoder : Lavc58.18.100 libx264 > Side data: > cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: -1 > Stream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 2 > encoder : Lavc58.18.100 aac > Stream #0:2: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 3 > encoder : Lavc58.18.100 aac > Stream #0:3: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 4 > encoder : Lavc58.18.100 aac > Stream #0:4: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 5 > encoder : Lavc58.18.100 aac > Stream #0:5: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 6 > encoder : Lavc58.18.100 aac > Stream #0:6: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 7 > encoder : Lavc58.18.100 aac > Stream #0:7: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 8 > encoder : Lavc58.18.100 aac > Stream #0:8: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, mono, fltp, > 256 kb/s > Metadata: > file_package_umid: > 0x060A2B340101010501010D12138515CFD81A370341840588D4EA784F43779667 > file_package_name: Source Package > track_name : Track 9 > encoder : Lavc58.18.100 aac > frame= 44 fps=0.0 q=0.0 size= 0kB time=00:00:02.11 bitrate= > 0.2kbits/sframe= 78 fps= 78 q=21.0 size= 512kB time=00:00:03.54 > bitrate=1184.5kbits/frame= 109 fps= 73 q=21.0 size= 1536kB > time=00:00:04.82 bitrate=2609.9kbits/frame= 140 fps= 70 q=21.0 size= > 2560kB time=00:00:06.12 bitrate=3425.3kbits/frame= 172 fps= 69 q=21.0 size= > 3584kB time=00:00:07.40 bitrate=3966.2kbits/frame= 201 fps= 67 q=21.0 > size= 4608kB time=00:00:08.66 bitrate=4358.4kbits/frame= 231 fps= 66 > q=21.0 size= 5632kB time=00:00:09.92 bitrate=4651.0kbits/frame= 260 fps= > 65 q=21.0 size= 6656kB time=00:00:11.11 bitrate=4905.8kbits/frame= 288 > fps= 64 q=21.0 size= 7680kB time=00:00:12.28 bitrate=5120.0kbits/frame= > 319 fps= 64 q=21.0 size= 8960kB time=00:00:13.58 > bitrate=5401.3kbits/frame= 351 fps= 64 q=21.0 size= 9984kB > time=00:00:14.91 bitrate=5484.8kbits/frame= 380 fps= 63 q=21.0 size= > 11008kB time=00:00:16.12 bitrate=5591.4kbits/frame= 412 fps= 63 q=21.0 > size= 12032kB time=00:00:17.47 bitrate=5641.4kbits/frame= 443 fps= 63 > q=21.0 size= 13056kB time=00:00:18.75 bitrate=5703.7kbits/frame= 473 fps= > 63 q=21.0 size= 13824kB time=00:00:20.01 bitrate=5659.3kbits/frame= 504 > fps= 63 q=21.0 size= 14848kB time=00:00:21.31 bitrate=5707.4kbits/frame= > 534 fps= 62 q=18.0 size= 15616kB time=00:00:22.54 > bitrate=5673.2kbits/frame= 565 fps= 62 q=21.0 size= 16640kB > time=00:00:23.85 bitrate=5715.4kbits/frame= 596 fps= 62 q=21.0 size= > 17664kB time=00:00:25.13 bitrate=5758.1kbits/frame= 623 fps= 62 q=21.0 > size= 18432kB time=00:00:26.26 bitrate=5749.7kbits/frame= 650 fps= 62 > q=21.0 size= 19456kB time=00:00:27.39 bitrate=5818.6kbits/frame= 678 fps= > 61 q=21.0 size= 20480kB time=00:00:28.56 bitrate=5873.3kbits/frame= 706 > fps= 61 q=21.0 size= 21248kB time=00:00:29.71 bitrate=5857.3kbits/frame= > 736 fps= 61 q=21.0 size= 22272kB time=00:00:30.97 > bitrate=5890.1kbits/frame= 767 fps= 61 q=21.0 size= 23296kB > time=00:00:32.27 bitrate=5912.5kbits/frame= 796 fps= 61 q=21.0 size= > 24320kB time=00:00:33.47 bitrate=5952.1kbits/frame= 827 fps= 61 q=21.0 > size= 25344kB time=00:00:34.77 bitrate=5970.6kbits/frame= 857 fps= 61 > q=21.0 size= 26112kB time=00:00:36.03 bitrate=5936.7kbits/frame= 880 fps= > 58 q=-1.0 Lsize= 28524kB time=00:00:36.71 bitrate=6364.5kbits/s > speed=2.43x > video:19416kB audio:8989kB subtitle:0kB other streams:0kB global headers:0kB > muxing overhead: 0.422849% > [libx264 @ 0x7f98a681c800] frame I:25 Avg QP:12.62 size: 73971 > [libx264 @ 0x7f98a681c800] frame P:855 Avg QP:15.24 size: 21090 > [libx264 @ 0x7f98a681c800] mb I I16..4: 25.6% 0.0% 74.4% > [libx264 @ 0x7f98a681c800] mb P I16..4: 11.3% 0.0% 7.8% P16..4: 43.4% > 18.9% 5.6% 0.0% 0.0% skip:13.1% > [libx264 @ 0x7f98a681c800] coded y,uvDC,uvAC intra: 40.2% 71.1% 34.1% inter: > 18.3% 41.4% 1.7% > [libx264 @ 0x7f98a681c800] i16 v,h,dc,p: 45% 25% 13% 17% > [libx264 @ 0x7f98a681c800] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 30% 24% 17% 5% > 6% 5% 5% 5% 4% > [libx264 @ 0x7f98a681c800] i8c dc,h,v,p: 49% 21% 23% 6% > [libx264 @ 0x7f98a681c800] ref P L0: 68.3% 20.9% 10.8% > [libx264 @ 0x7f98a681c800] kb/s:4333.32 > [aac @ 0x7f98a681e000] Qavg: 125.339 > [aac @ 0x7f98a681f200] Qavg: 602.799 > [aac @ 0x7f98a6820400] Qavg: 1345.386 > [aac @ 0x7f98a6821600] Qavg: 55054.453 > [aac @ 0x7f98a6822800] Qavg: 42813.750 > [aac @ 0x7f98a6823a00] Qavg: 503.112 > [aac @ 0x7f98a6824c00] Qavg: 51539.637 > [aac @ 0x7f98a701dc00] Qavg: 58240.789 > > stdout for ProRes > $ ffmpeg -i SoundMappingTest-H8Discreet.mov -map 0 -g 48 -c:v libx264 > -profile:v baseline -crf 16 -c:a aac -b:a 256k -vf scale=1280:720 -pix_fmt > yuv420p SoundMappingTest-H8Discreet.mp4 > ffmpeg version 4.0.2 Copyright (c) 2000-2018 the FFmpeg developers > built with Apple LLVM version 10.0.0 (clang-1000.10.44.2) > configuration: --prefix=/usr/local/Cellar/ffmpeg/4.0.2_1 --enable-shared > --enable-pthreads --enable-version3 --enable-hardcoded-tables > --enable-avresample --cc=clang --host-cflags= --host-ldflags= > --enable-ffplay --enable-gpl --enable-libmp3lame --enable-libopus > --enable-libsnappy --enable-libtheora --enable-libvorbis --enable-libvpx > --enable-libx264 --enable-libx265 --enable-libxvid --enable-lzma > --enable-chromaprint --enable-libass --enable-libfdk-aac > --enable-libfreetype --enable-libopencore-amrnb --enable-libopencore-amrwb > --enable-libopenh264 --enable-librsvg --enable-librtmp > --enable-librubberband --enable-libsoxr --enable-libssh > --enable-libtesseract --enable-libvidstab --enable-libwebp --enable-libzimg > --enable-libzmq --enable-opencl --enable-openssl --enable-videotoolbox > --enable-libopenjpeg --disable-decoder=jpeg2000 > --extra-cflags=-I/usr/local/Cellar/openjpeg/2.3.0/include/openjpeg-2.3 > --enable-nonfree > libavutil 56. 14.100 / 56. 14.100 > libavcodec 58. 18.100 / 58. 18.100 > libavformat 58. 12.100 / 58. 12.100 > libavdevice 58. 3.100 / 58. 3.100 > libavfilter 7. 16.100 / 7. 16.100 > libavresample 4. 0. 0 / 4. 0. 0 > libswscale 5. 1.100 / 5. 1.100 > libswresample 3. 1.100 / 3. 1.100 > libpostproc 55. 1.100 / 55. 1.100 > Guessed Channel Layout for Input Stream #0.1 : 7.1 > Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'SoundMappingTest-H8Discreet.mov': > Metadata: > major_brand : qt > minor_version : 537199360 > compatible_brands: qt > creation_time : 2018-11-20T03:09:22.000000Z > Duration: 00:00:36.70, start: 0.000000, bitrate: 115812 kb/s > Stream #0:0(eng): Video: prores (apcn / 0x6E637061), yuv422p10le(bt709, > progressive), 1920x1080, 109439 kb/s, SAR 1:1 DAR 16:9, 23.98 fps, 23.98 > tbr, 24k tbn, 24k tbc (default) > Metadata: > creation_time : 2018-11-20T03:09:22.000000Z > handler_name : Apple Alias Data Handler > encoder : Apple ProRes 422 > timecode : 00:00:00:00 > Stream #0:1(eng): Audio: pcm_s16le (lpcm / 0x6D63706C), 48000 Hz, 7.1, > s16, 6144 kb/s (default) If the audio is flagged as 7.1 but in fact it is not, I don't think there is a bug that can be fixed in FFmpeg. If you increase the width of your console window, the output gets even more readable. Carl Eugen _______________________________________________ ffmpeg-user mailing list [email protected] http://ffmpeg.org/mailman/listinfo/ffmpeg-user To unsubscribe, visit link above, or email [email protected] with subject "unsubscribe".
