
In my conversations with clients, partners, and Interactive Voice Response (IVR) application developers, I am often struck by just how much of a bum rap speech recognition continues to undeservedly receive. Deploying an IVR application that uses speech recognition rather than one that relies exclusively on touchtone for input is still considered a risky endeavor.
The fear, understandably, is that callers will be frustrated by poor speech recognition and as a result will either hang up in disgust or angrily press the zero key to reach a human being. But almost always, that fear is based on personal, anecdotic "evidence", rather than sound, empirically-based research. What usually takes place is someone having a bad experience with a badly designed Voice User Interface (VUI) and the one thing they remember about that VUI is what made it novel: speech recognition.
In this issue of the VUI view, my aim is to make the case that the negative perception held about speech-enabled IVR applications is undeserved and unfounded. Indeed, I will argue that given two equally well designed VUIs, one relying exclusively on touch-tone and one that uses speech, the speech-enabled VUI will beat the touchtone-only VUI anytime.
How users feel about speech automated systems
First let's take a look at the bottom line: How has speech been received by IVR users?
The results below are drawn from a study commissioned by Nuance Communications and carried out by Harris Interactive, in which 326 interviews were conducted to gauge customer impressions and attitudes towards speech recognition systems. [1] [2]
Here are some of the results:
Two additional interesting observations about increased user-acceptance and preference under the right circumstance should be added to the above:
Speech vs. touchtone
So speech is relatively well received. But how does it stack up against traditional, time-tested touchtone-only systems?
Here are some results[1]:
Moreover, speech enabling a touchtone-only IVR system has been shown[3] to:
Why Speech is better
If you are interested in professional help with optimizing your voice applications, feel free to contact me at bouzid@angel.com or call 1-888-MYANGEL (1-888-692-6435) and say "Ahmed Bouzid".
References:
[1] http://www.speechtechmag.com/issues/8_5/cover/2399-1.html
[2] http://www.speechtechmag.com/whitepapers/nuance1.pdf
[3] http://www.speechtechmag.com/issues/8_4/cover/2185-1.html
[4] http://www.speechtechmag.com/whitepapers/convergys3.pdf
