Federico Mariani created CAMEL-23390:
----------------------------------------

             Summary: Add audio transcription support to camel-openai component
                 Key: CAMEL-23390
                 URL: https://issues.apache.org/jira/browse/CAMEL-23390
             Project: Camel
          Issue Type: New Feature
          Components: camel-openai
    Affects Versions: 4.20.0
            Reporter: Federico Mariani


The camel-openai component currently supports three operations: 
_chat-completion_, _embeddings_, and _tool-execution_. The underlying 
openai-java SDK already provides full support for the _POST 
/v1/audio/transcriptions_ endpoint via 
_client.audio().transcriptions().create()_, including response models 
(Transcription, TranscriptionVerbose, TranscriptionDiarized) and streaming 
events.
                                                                                
                                                                                
                                                                             
This issue proposes adding a new audio-transcription operation to camel-openai 
that:                                                                           
                                                                            
   
- Accepts an audio file as the exchange body (e.g., File, InputStream, or 
byte[])                                                                         
                                                                                
 
- Exposes key parameters as endpoint options and/or headers: language, prompt, 
responseFormat, temperature, timestampGranularities                             
                                                                            
- Returns the transcription text as the exchange body, with metadata (e.g., 
duration, segments) available via headers                                       
                                                        
                                                                                
                                                                                
                                                                             
This would allow Camel routes to transcribe audio files using the OpenAI API 
without requiring custom code or a separate HTTP call.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to