https://bugs.kde.org/show_bug.cgi?id=515829

            Bug ID: 515829
           Summary: Integrate Voxtral (Mistral AI) for Speech-to-text
    Classification: Applications
           Product: kdenlive
      Version First unspecified
       Reported In:
          Platform: Other
                OS: Linux
            Status: REPORTED
          Severity: wishlist
          Priority: NOR
         Component: Title Clips & Subtitles
          Assignee: [email protected]
          Reporter: [email protected]
  Target Milestone: ---

Support Mistral AI’s new open-weight Voxtral models as an alternative STT
engine for automatic subtitling.

Key Benefits:
* Higher Accuracy: Outperforms Whisper Large-v3 in speed and Word Error Rate
(WER).
* Native Diarization: Built-in speaker identification to automatically label
different voices in transcripts.
* Efficiency: Optimized for local hardware; the Mini-3B model provides
high-quality results with low VRAM usage.
* Privacy/License: Apache 2.0 license, allowing for fully offline, private
processing.

Proposed Integration:
Add "Voxtral" to the STT engine list in Settings > Speech to Text, with model
selection (Mini/Small) and a toggle for speaker diarization.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to