2,098 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Using a Custom AI Avatar on the sample code

Hello I am trying to create a Proof of Concept by using https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/js/browser/avatar I want to show a custom video of a person in an avatar with text generated with OpenAi. In other…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-04T02:11:52.6033333+00:00
It is VMS 130 Reputation points
commented 2025-08-12T03:14:57.7966667+00:00
Manas Mohanty 8,150 Reputation points Microsoft External Staff Moderator
1 answer

Speech Studio - Error in exporting data from editor to dataset section

Hello all, I am using Azure speech studio to train custom speech models. Since last week I am unable to export data (audio + transcript) from "Editor" section to "Training and Testing Dataset" section. I simply see the attached error.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-06T10:46:36.89+00:00
Shree 0 Reputation points
commented 2025-08-11T21:47:11.1833333+00:00
Ravada Shivaprasad 920 Reputation points Microsoft External Staff Moderator
2 answers

Azure OpenAI Realtime API

In Azure Foundry, the model "gpt-4o-realtime-preview-2025-06-03" is marked as: "Legacy model version: This deployment is using legacy model gpt-4o-realtime-preview version 2025-06-03 which will be retired on 9/01/2025 local time. When the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-07T16:51:06.7866667+00:00
Artur Dereń 0 Reputation points
commented 2025-08-11T04:55:30.94+00:00
Pavankumar Purilla 10,350 Reputation points Microsoft External Staff Moderator
0 answers

Does live chat avatar synthesis support WordBoundary events?

I was trying to set up the WordBoundary event callback for my live chat avatar synthesis but the callback is never run (the avatar speaks in the front-end but I get no events). That brings me to the question - are these events even supported for live…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-07T13:03:07.6866667+00:00
Mindaugas Giedraitis 0 Reputation points
commented 2025-08-08T16:36:34.04+00:00
Mindaugas Giedraitis 0 Reputation points
1 answer

change ai voice

how we can change ai voice -audiomodify?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-23T14:15:03.7566667+00:00
Charlotte Flint 0 Reputation points
answered 2025-08-07T19:10:26.0633333+00:00
Amira Bedhiafi 35,766 Reputation points Volunteer Moderator
0 answers

Latency Issue In Speech To Text Realtime API

We are using Azure Speech-to-Text (STT) streaming API from the Central India region and experiencing consistent latency of 1.5 to 2 seconds from audio input to transcription result. Our setup: SDK: JavaScript/Node SDK using SpeechRecognizer with…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-07T12:47:06.6733333+00:00
Nidoos Solutions 20 Reputation points
1 answer

30 secs timeout on Azure speech to text

Hello, I'm experiencing an issue with Azure Speech-to-Text where, in continuous recognition mode, it outputs a RECOGNIZED result every 30 seconds, regardless of whether speech has stopped. Adjusting settings like Speech_SegmentationSilenceTimeoutMs has…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-06-19T06:33:44.1833333+00:00
Nandhu TS 0 Reputation points
commented 2025-08-06T06:42:34.86+00:00
Nandhu TS 0 Reputation points
1 answer

How-to setup Speech SDK with MAS (AEC) in Unity

Hello everyone, I am trying to implement Acoustic Echo Cancellation (AEC) in a Unity project using the Azure Speech SDK and the Microsoft Audio Stack (MAS), but I cannot get it to work correctly. The speech recognizer continues to pick up and transcribe…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-15T08:22:15.72+00:00
nk 5 Reputation points
answered 2025-08-05T07:36:17.16+00:00
Manas Mohanty 8,150 Reputation points Microsoft External Staff Moderator
0 answers

Issues with Azure Speech Services: Incorrect transcription of "draft" as "draught" and "£" as "lbs" in UK English

I'm using Azure Speech Services with the language set to UK English, and I've noticed two recurring transcription issues: When I dictate the word "draft", it consistently transcribes as "draught", even when the context clearly favors…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-06-12T11:36:24.3566667+00:00
Niki Kariappa 0 Reputation points
commented 2025-08-05T00:59:35.7333333+00:00
Manas Mohanty 8,150 Reputation points Microsoft External Staff Moderator
1 answer

azure speech to text cannot process spelt out words

When using real-time speech to text, if the audio spells out a word or name, the result outputs the name as if it was said in whole and not spelled. (e.g. the audio says "My name is John. J-O-H-N", but the result I get is "My name is John.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-08-03T00:33:01.7733333+00:00
Tim 0 Reputation points
answered 2025-08-03T00:54:06.0333333+00:00
Jerald Felix 4,450 Reputation points
0 answers

Error while trying to train a 202240228 Whisper Large v2 baseline model

When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
Azure | Azure Startups
asked 2024-05-03T08:53:22.2033333+00:00
Engineering 0 Reputation points
edited a comment 2025-07-31T04:17:09.4366667+00:00
RNareddy 2,505 Reputation points Microsoft External Staff Moderator
1 answer

Can you help me access the Dustin voice in Azure text to speech studio?

Hello I created a new Azure account and need to access the Dustin voice, but it is not available. Can you help? Thank you.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-28T14:48:25.63+00:00
Piet Levy 0 Reputation points
answered 2025-07-30T16:07:54.4033333+00:00
Sina Salam 22,576 Reputation points Volunteer Moderator
1 answer

Is it possible to get subtitles or a timed script with batch synthesis text to speech avatar?

Using batch text-to-speach or batch avatar API, is it possible to get subtitles on the generated video? Or even better, getting a script of the text with time stamps. I was hoping to do some front end shenanigans by creating texts highlights, as the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2024-09-13T15:07:19.0233333+00:00
d m 5 Reputation points
commented 2025-07-28T09:47:43.22+00:00
Piyush Paras Tiwari 0 Reputation points
0 answers

Azure speech to text appears very slow

Hi team, We have observed that the Azure speech-to-text is very slow. I am using continuousRecognitionAsync and I observe that Azure takes a total of close to 6s for just 3s audio. The parameters that I've set are: EndSilenceTimeoutMs =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-01-30T06:13:09.4933333+00:00
Sai Vishnu Soudri 65 Reputation points
edited a comment 2025-07-27T18:55:26.3033333+00:00
Sven 0 Reputation points
1 answer

Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK

Using Speech SDK for JavaScript v1.44.0, and following the STT in-memory streaming example, but using the fromEndpoint API to create Recognizer, as recommended in the Release Notes for that SDK version. Node.js is v22 LTS, running in Azure Cloud as an…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-06-05T09:01:51.1866667+00:00
Michael Pickering 0 Reputation points
commented 2025-07-27T08:12:24.0866667+00:00
Ravada Shivaprasad 920 Reputation points Microsoft External Staff Moderator
1 answer One of the answers was accepted by the question author.

Unexpectedly high TTS character count in Azure Speech Service during live app test

Hi team, We are running a production-ready church translation app using Azure Translator and Azure Speech Services (STT & TTS, neural voice). During a 30 minute live test involving 8 user devices (all using Spanish), we observed the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-06-25T02:39:26.7733333+00:00
Lauren Van Niekerk 20 Reputation points
accepted 2025-07-30T03:20:46.9266667+00:00
Lauren Van Niekerk 20 Reputation points
1 answer

Quota increase on my cognitive services - text to speech usages.

Hello, I can't create a support ticket on Azure since obviously I am on basic plan, any way for me to get help increasing the quota? Thank you!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-19T04:11:59.7333333+00:00
LeetGPT 100 Reputation points
commented 2025-07-25T02:50:56.41+00:00
Manas Mohanty 8,150 Reputation points Microsoft External Staff Moderator
2 answers One of the answers was accepted by the question author.

Azure Speech Recognition: noise or chatter is recognized as insurance related texts, rather that just not recognized.

Hello, When noise or background chatter is sent to Azure Speech Recognition it sends back texts related to insurance policies. Texts which where not spoken at all. For example repsonse like (for language nl-NL): ik wil graag een verzekering afsluiten…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-24T10:16:59.59+00:00
Gijsbert Huijsen 21 Reputation points
accepted 2025-07-25T10:47:42.7633333+00:00
Gijsbert Huijsen 21 Reputation points
1 answer

How is the Synthesized Characters count for Azure's Text to Speech service when generating from an SSML?

How is the Synthesized Characters count calculated for Azure's Text to Speech service when generating speech from an SSML document? What are the specific rules? I converted the following SSML file into speech: …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-25T22:42:30.5166667+00:00
ggg 0 Reputation points
commented 2025-07-27T21:51:05.3266667+00:00
ggg 0 Reputation points
0 answers

I need a local STT and TTS model partner for my CRM product

I need a local STT and TTS model partner for my CRM product Essential features: 1, Speech to text 2, Support for websocket access 3, Support for speaker identification Key features: 1, Support for real-time translation 2, Support for downloading audio…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,098 questions
asked 2025-07-29T22:07:04.68+00:00
Akansha Kapoor 0 Reputation points