Technology

The pros and cons of voice recognition software

Tuesday, December 4th, 2007

Science fiction films and TV shows have long touted the virtues of voice recognition technology, perhaps the most popular example being Star Trek: The Next Generation. Unlike Captain Kirk, who had to rely on subordinates to do his bidding, Captain Picard would simply call out, “Computer?” and the ship’s invisible intelligence would activate, jumping to obey his every command. It’s clear why this type of technology appeals to science fiction writers and fans; who wouldn’t want to interact with artificial intelligence as easily as we interact with other human beings? No more memorizing keyboard commands, no more texting on minuscule cell phone number pads, no more waiting for a computer to boot up/connect to a wireless network/open a large file. All we have to do is ask.

This voice-based interface is becoming less of a fantasy and more of a reality. Voice recognition software is improving greatly, and while we won’t likely be jumping to warp speed anytime soon, we may experience a hands-free voice activated interface within a year. One of the companies working in this vein is Nuance, who recently released Dragon NaturallySpeaking 9, a software application that allows you to speak at a normal speed while it “types” your words. Their ingenious website allows you to challenge the software to a typing race (which you will lose, trust me), showing off how accurate and fast their product is at recognizing speech, including unusual words like “aardvark”. In addition to typing applications, Nuance has a a voice control product which allows you to operate a mobile device with spoken commands. You can send short emails, set appointments in a calendar, place calls, and access the web with phrases like, “Go to website: www.cnn.com”.

Voice recognition technology could be a huge advantage for a couple of reasons. Wouldn’t it be great to access and answer emails during your morning commute, hands free? Or be able to dial a phone number by speaking the name of your friend or client? Not only would it cut down on the time we spend actually touching a computer, but it would allow more open access to technology for many people around the globe. Populations with low computer-literacy rates, like Grandma and Grandpa, would be able to stay in touch with family, access health information, and play internet bingo. All jokes aside, though, those with visual or physical impairments or in developing countries (where previous experience with computers is limited) will have an easier time operating computers and mobile phones with the further development of this technology.

Perhaps some of the more alert readers will have spotted a disconnect between the current capabilities of speech recognition and the Star Trek version. In the movies this technology seemed to easily recognize conversational language, intuitively carrying out orders given in any configuration. But what about slang? What about foreign languages, or even regional accents? Will a voice-activated dashboard device be able to “pahk the cah at Hahvahd Yahd?” What about timber and tone of voice? Obviously, these are problems that haven’t yet been fully addressed. We’ve all been frustrated by a telephone menu where we’re forced to speak commands, say our PIN number, etc., and as a result we’re unable to talk to a real person. Imagine if there was no “real” person to eventually reach by shouting, “CUSTOMER SERVICE!” into the handset. Such a possibility could become a reality, and it could cause major headaches as it did for one man. His bank’s voice recognition software froze his account because it “thought” he was a female. Not to mention security concerns about using your voice as a type of fingerprint to establish your identity. Clearly, our “intelligent” technology isn’t as fallible as we would like it to be.

Leading companies in the field of web applications are already being encouraged to make their products as accessible as possible to people who, for one reason or another, do not have physical access to a computer. Not only will it be a major step for those with physical impairments or handicaps, it will also benefit users who want to access a computer hands free (when driving, for medical procedures, for military operations, etc). However, a balance is required for voice recognition software to be effective without being overbearing. As with any new trend, it should be implemented slowly with great concern for security, and it should never be the only barrier between us and our email, internet access, or bank accounts. However, if ever there was a case for the good outweighing the bad, this might be it. Voice commands could allow a whole new set of users to experience the connectivity and benefits of a digital lifestyle.

By Haley January Eckels