Using a Custom AI Avatar on the sample code
Hello I am trying to create a Proof of Concept by using https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/js/browser/avatar I want to show a custom video of a person in an avatar with text generated with OpenAi. In other…
Azure AI Speech

Speech Studio - Error in exporting data from editor to dataset section
Hello all, I am using Azure speech studio to train custom speech models. Since last week I am unable to export data (audio + transcript) from "Editor" section to "Training and Testing Dataset" section. I simply see the attached error.…
Azure AI Speech
Azure OpenAI Realtime API
In Azure Foundry, the model "gpt-4o-realtime-preview-2025-06-03" is marked as: "Legacy model version: This deployment is using legacy model gpt-4o-realtime-preview version 2025-06-03 which will be retired on 9/01/2025 local time. When the…
Azure AI Speech
Does live chat avatar synthesis support WordBoundary events?
I was trying to set up the WordBoundary event callback for my live chat avatar synthesis but the callback is never run (the avatar speaks in the front-end but I get no events). That brings me to the question - are these events even supported for live…
Azure AI Speech
change ai voice
how we can change ai voice -audiomodify?
Azure AI Speech

Latency Issue In Speech To Text Realtime API
We are using Azure Speech-to-Text (STT) streaming API from the Central India region and experiencing consistent latency of 1.5 to 2 seconds from audio input to transcription result. Our setup: SDK: JavaScript/Node SDK using SpeechRecognizer with…
Azure AI Speech
30 secs timeout on Azure speech to text
Hello, I'm experiencing an issue with Azure Speech-to-Text where, in continuous recognition mode, it outputs a RECOGNIZED result every 30 seconds, regardless of whether speech has stopped. Adjusting settings like Speech_SegmentationSilenceTimeoutMs has…
Azure AI Speech
How-to setup Speech SDK with MAS (AEC) in Unity
Hello everyone, I am trying to implement Acoustic Echo Cancellation (AEC) in a Unity project using the Azure Speech SDK and the Microsoft Audio Stack (MAS), but I cannot get it to work correctly. The speech recognizer continues to pick up and transcribe…
Azure AI Speech

Issues with Azure Speech Services: Incorrect transcription of "draft" as "draught" and "£" as "lbs" in UK English
I'm using Azure Speech Services with the language set to UK English, and I've noticed two recurring transcription issues: When I dictate the word "draft", it consistently transcribes as "draught", even when the context clearly favors…
Azure AI Speech

azure speech to text cannot process spelt out words
When using real-time speech to text, if the audio spells out a word or name, the result outputs the name as if it was said in whole and not spelled. (e.g. the audio says "My name is John. J-O-H-N", but the result I get is "My name is John.…
Azure AI Speech

Error while trying to train a 202240228 Whisper Large v2 baseline model
When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?
Azure AI Speech
Azure | Azure Startups
Can you help me access the Dustin voice in Azure text to speech studio?
Hello I created a new Azure account and need to access the Dustin voice, but it is not available. Can you help? Thank you.
Azure AI Speech
Is it possible to get subtitles or a timed script with batch synthesis text to speech avatar?
Using batch text-to-speach or batch avatar API, is it possible to get subtitles on the generated video? Or even better, getting a script of the text with time stamps. I was hoping to do some front end shenanigans by creating texts highlights, as the…
Azure AI Speech
Azure speech to text appears very slow
Hi team, We have observed that the Azure speech-to-text is very slow. I am using continuousRecognitionAsync and I observe that Azure takes a total of close to 6s for just 3s audio. The parameters that I've set are: EndSilenceTimeoutMs =…
Azure AI Speech
Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK
Using Speech SDK for JavaScript v1.44.0, and following the STT in-memory streaming example, but using the fromEndpoint API to create Recognizer, as recommended in the Release Notes for that SDK version. Node.js is v22 LTS, running in Azure Cloud as an…
Azure AI Speech
Unexpectedly high TTS character count in Azure Speech Service during live app test
Hi team, We are running a production-ready church translation app using Azure Translator and Azure Speech Services (STT & TTS, neural voice). During a 30 minute live test involving 8 user devices (all using Spanish), we observed the…
Azure AI Speech
Quota increase on my cognitive services - text to speech usages.
Hello, I can't create a support ticket on Azure since obviously I am on basic plan, any way for me to get help increasing the quota? Thank you!
Azure AI Speech

Azure Speech Recognition: noise or chatter is recognized as insurance related texts, rather that just not recognized.
Hello, When noise or background chatter is sent to Azure Speech Recognition it sends back texts related to insurance policies. Texts which where not spoken at all. For example repsonse like (for language nl-NL): ik wil graag een verzekering afsluiten…
Azure AI Speech
How is the Synthesized Characters count for Azure's Text to Speech service when generating from an SSML?
How is the Synthesized Characters count calculated for Azure's Text to Speech service when generating speech from an SSML document? What are the specific rules? I converted the following SSML file into speech: …
Azure AI Speech
I need a local STT and TTS model partner for my CRM product
I need a local STT and TTS model partner for my CRM product Essential features: 1, Speech to text 2, Support for websocket access 3, Support for speaker identification Key features: 1, Support for real-time translation 2, Support for downloading audio…