We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:
Post your ideas
Post ideas and requests to enhance a product or service. Take a look at ideas others have posted and upvote them if they matter to you,
Post an idea
Upvote ideas that matter most to you
Get feedback from the IBM team to refine your idea
Help IBM prioritize your ideas and requests
The IBM team may need your help to refine the ideas so they may ask for more information or feedback. The product management team will then decide if they can begin working on your idea. If they can start during the next development cycle, they will put the idea on the priority list. Each team at IBM works on a different schedule, where some ideas can be implemented right away, others may be placed on a different schedule.
Receive notification on the decision
Some ideas can be implemented at IBM, while others may not fit within the development plans for the product. In either case, the team will let you know as soon as possible. In some cases, we may be able to find alternatives for ideas which cannot be implemented in a reasonable time.
Watson Speech to Text should return timestamps accurate to milliseconds for transcription
Researchers from Brandeis University, Boston University, Harvard, Boston College, and Northeastern University are investigating cognitive aging and biomarkers of dementia. Currently, they have been hand-scoring cognitive interviews which is arduous. To automate some of the manual work, Watson Text to Speech service is being used to run audio transcriptions through the service while getting back transcriptions with timestamps of when a word starts to be spoken and when it is ended in speech. The timestamps returned from the service return with 2 decimal places which is not enough precision that is required to compare to prior work (which uses 3 decimal places -- precision is up to milliseconds).
The issue that researchers mentioned above are running into is comparing transcription timestamps generated by Speech to Text service to hand-scoring done for interviews prior to using Watson. The precision mismatch does not allow the researchers to use Watson effectively.
No workaround is possible since there is a mismatch in precision level returned by Watson Speech-to-text service.
Having timestamps returned with 3 decimal places (with millisecond precision) would enable the researchers to automate the longitudinal research with Watson Speech-to-Text service.
Watson Speech-to-Text service returning timestamps with milliseconds precision will enable customers around the world to get more accurate transcriptions and use the service more confidently in research globally. With precision, researchers will be more likely to use the service and cite it in publications.
Every user using Watson Speech-to-Text around the world will get more precise data points for transcription without their current downstream applications breaking due to this change.
Do not place IBM confidential, company confidential, or personal information into any field.