Thanks Michael, I will try to answer your comments. > > +ffmpeg -i input.mp4 -vn -af > > "aformat=sample_rates=16000:channel_layouts=mono,whisper= > > Is there a reason why we convert to 16khz mono here ?
It is the only format supported by the whisper.cpp library. > > +model=../whisper.cpp/models/ggml-base.en.bin\ > > It would be nice if the models would be in a standard location, so teh user > just has to specify the model name and not the path I think that this functionality should be implemented inside the whisper.cpp library, so they can manage the exact model location and the download process. I will propose a change. > I tried this: > > ./ffmpeg -i matrixbench_mpeg2.mpg -vn -af > "aformat=sample_rates=16000:channel_layouts=mono,whisper=model=/home/michael/whisper.cpp/models/ggml-base.en.bin:language=en:queue=3000:destination=output.srt:format=srt" > -f null - > > but the output.srt is empty (0 bytes) Can you enable verbose logging? > libavfilter/af_whisper.c:75:49: error: parameter name omitted > 75 | static void cb_log_disable(enum ggml_log_level, const char *, void *) > {} > | ^~~~~~~~~~~~ > libavfilter/af_whisper.c:75:63: I don't see this error using the gcc13 compiler. Do you use a different compiler or some other flags? > > + wctx->audio_buffer_fill_size = 0; > > + > > + wctx->next_pts = AV_NOPTS_VALUE; > > + > > + wctx->avio_context = NULL; > > arent things already initialized to 0 ? Yes, maybe we can keep the AV_NOPTS_VALUE assignment (it is not zero). -- /Vittorio Palmisano/ _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".