AT&T Voice Recognition API Available in June
Written by Alex Armstrong   
Tuesday, 24 April 2012

AT&T is giving developers the tools to build apps and services that use your voice to communicate with smartphones, televisions and other digital devices.

More than a million hours of research and development has already gone into AT&T's pioneering Watson speech recognition technology that has already been powering advanced speech services in the marketplace for decades. For example it powers their mobile directory search and voice mail to text services.

attlogo

Now AT&T has announced that it plans to make available several of its Watson Speech APIs. Presumably the name relates to Edison's assistant who received the first phone call "Mr. Watson come here. I need you". Also not to be confused with IBM's Watson question answering program. 

As well as the API it plans to release a speech kit SDK to capture a user’s spoken words and send them into the network for transcription, for other developers to incorporate voice recognition and transcription capabilities in their own apps.

The first APIs, coming in June, according to the announcement will be focused on seven areas:

  • web search
  • local business search
  • question and answer
  • voice mail to text
  • SMS
  • U-verse electronic programming guide,
  • dictation

Later on we can expect APIs for gaming, social media and other areas. The speech recognition technology uses the subject area to improve its accuracy by using an a restricted set of target words.

You can see what AT&T think of the technology in the following promo video:

 

More Information

Giving Developers a Voice with the Watson Speech APIs

Related Articles

Speech recognition breakthrough

 

 

attlogo

pico book

 

Comments




or email your comment to: comments@i-programmer.info

 

To be informed about new articles on I Programmer, subscribe to the RSS feed, follow us on Google+, Twitter, Linkedin or Facebook or sign up for our weekly newsletter.

 

Banner


CISA and NSA - Use Rust Or Perhaps Java
02/07/2025

The CISA and the NSA are urging us to adopt memory-safe languages (MSLs) for the sake of cybersecurity. You probably think they mean Rust but things aren't as clear cut as you might expect.



Google Introduces Gemini CLI Open-Source Agent
08/07/2025

Google is introducing Gemini CLI, an open-source AI agent that offers lightweight access to Gemini, Google's conversational chatbot that is based on Google's multimodal large language model  [ ... ]


More News

Last Updated ( Tuesday, 24 April 2012 )