Wiki: https://fedoraproject.org/wiki/Changes/ibus-speech-to-text_pywhispercpp
Discussion Thread: https://discussion.fedoraproject.org/t/179397 **This is a proposed Change for Fedora Linux.** This document represents a proposed Change. As part of the Changes process, proposals are publicly announced in order to receive community feedback. This proposal will only be implemented if approved by the Fedora Engineering Steering Committee. == Summary == ibus-speech-to-text 0.7.0 introduces support for OpenAI's Whisper engine via pywhispercpp (python bindings of WhisperCpp) in addition to the existing Vosk engine. == Owner == * Name: [[User:matiwari|Manish Tiwari]] * Email: [email protected] == Detailed Description == Key ibus-speech-to-text-0.7.0 Changes: * ibus-speech-to-text provides a new backend engine option allowing users to select between Vosk and Whisper engine * It has a new GStreamer engine to integrate WhisperCpp into ibus-speech-to-text pipeline * It supports multiple Whisper models, including locally installed models and online models downloaded from Hugging Face * Automatic locale based model selection when possible * UI updates to allow backend switching and model management from setup tool == Feedback == == Benefit to Fedora == This package will bring several benefits to Fedora: * Higher accuracy speech recognition * Greater flexibility by allowing users to choose between multiple backends == Scope == * Proposal owners: ** Package pywhispercpp ([ https://bugzilla.redhat.com/show_bug.cgi?id=2420917]) [done] * Other developers: N/A * Release engineering: [https://pagure.io/releng/issues #Releng issue number] * Policies and guidelines: N/A (not needed for this Change) * Trademark approval: N/A (not needed for this Change) * Alignment with the Fedora Strategy: == Upgrade/compatibility impact == Existing ibus-speech-to-text installations will continue to use the Vosk backend by default. No existing configuration or functionality is removed. == Early Testing (Optional) == Do you require 'QA Blueprint' support? N == How To Test == === Functionality Test === 1. Install required packages:`sudo dnf install ibus-speech-to-text` 2. Restart IBus using `ibus restart` command 3. Add `Speech To Text` in input sources 4. Launch the `IBus STT Setup` tool from the preferences for a configuration and to download a language model 5. From `Setup tool` select `Whisper` as a backend then select and download Whisper model from list of available model for each locale == User Experience == Users will see a new backend option in ibus-speech-to-text settings with a variety of Whisper models. == Dependencies == * pywhispercpp == Contingency Plan == * Contingency mechanism: N/A (Not a system wide change) * Contingency deadline: N/A (Not a system wide change) * Blocks release? N/A (Not a system wide change) == Documentation == N/A (Not a system wide change) == Release Notes == \n\nibus-speech-to-text now supports the WhisperCpp speech recognition engine via pywhispercpp, providing improved accuracy and multilingual support.
-- _______________________________________________ devel-announce mailing list -- [email protected] To unsubscribe send an email to [email protected] Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/[email protected] Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue
-- _______________________________________________ devel mailing list -- [email protected] To unsubscribe send an email to [email protected] Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/[email protected] Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue
