Hi,
yesterday, news made the round, that ffmpeg 8 is going to be released, soon, and it will contain whisper, an AI software that can understand spoken text and create subtitles.
Their github page https://github.com/ggml-org/whisper.cpp says they offer a handful of models.
Model Disk Mem tiny 75 MiB ~273 MB base 142 MiB ~388 MB small 466 MiB ~852 MB medium 1.5 GiB ~2.1 GB large 2.9 GiB ~3.9 GB How does this work? Will all of this be compiled into the ffmpeg binary? _______________________________________________ ffmpeg-user mailing list ffmpeg-user@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-user To unsubscribe, visit link above, or email ffmpeg-user-requ...@ffmpeg.org with subject "unsubscribe".