Speech Services

Showing 13

More language options for speech to text (Turkish)

I just enrolled for Speec to text API. Unfortunately the languages are limited to 13. I am wondering if IBM are planning to add more languages whereas Google supports 60. Turkish is one of the languages i would like to see.

over 1 year ago in Speech Services 1 Not under consideration

24 VOTE

Add Turkish to Speech-to-Text and Text-to-Speech services

According to Wikipedia Turkish is the 13th most spoken language in the world. It'd be very helpful to add Turkish to the Speech services since it is also often required in some client engagements!

over 1 year ago in Speech Services 4 Not under consideration

1 VOTE

Watson Speech to Text should return timestamps accurate to milliseconds for transcription

Real-life scenario: Researchers from Brandeis University, Boston University, Harvard, Boston College, and Northeastern University are investigating cognitive aging and biomarkers of dementia. Currently, they have been hand-scoring cognitive interv...

over 2 years ago in Speech Services 2 Not under consideration

1 VOTE

usability of the new IBM Watson tts demo site

When we did the project "spaceships with opinions" the IBM Watson tts-demo site was a great help. So I feel I have a responsibility to tell you how I think the new site is failing. https://www.ibm.com/demos/live/tts-demo/self-service/home It works...

about 3 years ago in Speech Services 3 Not under consideration

1 VOTE

Retreive sound file previously streamed to Watson Speech-to-Text

When working on improving our product, we'd like to capture field failures that could be used to train our custom speech-to-text model. We have a wake word (handled locally) and then we establish a connection to IBM Watson's Speech-to-Text service...

over 4 years ago in Speech Services 2 Not under consideration

1 VOTE

modification request for UI of the AI Minute asset

See attached document.

over 4 years ago in Speech Services 1 Not under consideration

4 VOTE

Add m4a speech to text support

This is a great format for content creation apps in iOS, why not support it?I have to get around by converting to wav or mp3 adding more waiting time

almost 5 years ago in Speech Services 1 Not under consideration

0 VOTE

Gender recognition/identification/classification of speech

Enhance feature to understand the gender & also age of person speaking for further analysis.

over 5 years ago in Speech Services 0 Not under consideration

18 VOTE

Ability to move fully trained STT models between separate instances (i.e. DEV vs. PROD or separate accounts)

Customers for best practice reasons keep environments separate (DEV vs. PROD) for security, performance, testing & availability of the model. Training STT models can take hours to train. Not utilizing fully trained models results in delays ...

over 5 years ago in Speech Services 4 Not under consideration

1 VOTE

In the Speech-To-Text service, improve that output generated by speaker_labels option.

I have a major call center use case that requires text transcription of 2 distinct voices, a call center rep. and a customer. I need to group the transcribed text by each of these 2 people. The speaker_labels feature returns a list of time ranges,...

almost 6 years ago in Speech Services 1 Not under consideration

Please enter your email address

FILTER BY CATEGORY