Speech Services

Showing 27

Help with audio file slicing

Currently I can submit a word or phrase with an audio file and receive back the time code of the word or phrase. We need the time codes of the periods, question marks, and exclamation marks. In other words, we want to upload a transcript with the ...

over 3 years ago in Speech Services 2 Planned for future release

11 VOTE

Punctuation Needed for IBM Speech to Text Service

I have a customer that is a daily magazine on the web as well as a podcast network. They offer analysis and commentary about politics, news, business, technology, and culture. They are considering our IBM Speech to Text (STT) solution to provide a...

over 3 years ago in Speech Services 0 Planned for future release

1 VOTE

Add flag to instances to indicate if have customizations

When building a UI for Phone integration -- the reasons to select a particular STT or Voice instance over another are the plan and whether it has customizations available. At this time to, when a user selects a speech instance, you must do several...

over 3 years ago in Speech Services 0 Planned for future release

1 VOTE

Retreive sound file previously streamed to Watson Speech-to-Text

When working on improving our product, we'd like to capture field failures that could be used to train our custom speech-to-text model. We have a wake word (handled locally) and then we establish a connection to IBM Watson's Speech-to-Text service...

over 4 years ago in Speech Services 2 Not under consideration

1 VOTE

modification request for UI of the AI Minute asset

See attached document.

over 4 years ago in Speech Services 1 Not under consideration

4 VOTE

Add m4a speech to text support

This is a great format for content creation apps in iOS, why not support it?I have to get around by converting to wav or mp3 adding more waiting time

almost 5 years ago in Speech Services 1 Not under consideration

6 VOTE

Phoneme timings in Text to Speech service

We'd like to use the text to speech service to control an animatronic. The animatronic has a mouth and needs to manipulate its lips and jaws as it's speaking and Amazon had phoneme and viseme support which is what we were using. However, we're swi...

about 5 years ago in Speech Services 2 Planned for future release

2 VOTE

Utterance segmentation made sensitive to speaker identity (features)

Utterance segmentation appears to be entirely independent of (the features used for) speaker labeling. Specifically, it was noticed that even though the speaker labeling correctly identifies that a new speaker (very clear because it goes from male...

over 5 years ago in Speech Services 1 Functionality already exists

0 VOTE

Gender recognition/identification/classification of speech

Enhance feature to understand the gender & also age of person speaking for further analysis.

over 5 years ago in Speech Services 0 Not under consideration

18 VOTE

Ability to move fully trained STT models between separate instances (i.e. DEV vs. PROD or separate accounts)

Customers for best practice reasons keep environments separate (DEV vs. PROD) for security, performance, testing & availability of the model. Training STT models can take hours to train. Not utilizing fully trained models results in delays ...

over 5 years ago in Speech Services 4 Not under consideration

Please enter your email address

FILTER BY CATEGORY