Alan Cantor
Although not yet a mature technology, speech recognition has great potential as a partial (or total) replacement for the keyboard and mouse. Nevertheless, many students who switch to speech input use it inefficiently, give up on it altogether, and may even develop permanent vocal injuries. Speech recognition failures are attributable, in part, to inadequate accommodation planning and unrealistic expectations. This session describes an 11-stage planning model that increases the likelihood that students will benefit from speech recognition; and reviews common misconceptions that prevent students from achieving success with the technology.
To highlight the concerns that should be considered when planning an accommodation that includes speech recognition, I present an eleven stage planning model:
Consider a wide range of accommodation options. Few people can be fully accommodated with one or two items; some need ten, twenty or more adaptations. Generate specific accommodation ideas from these fourteen categories:
¨ Building modifications ¨ Low-tech devices
¨ Environmental adjustments ¨ Alternative formats to print
¨ Spatial reorganization ¨ Transportation services
¨ Work station modifications ¨ Personal support services
¨ Computer-based assistive technologies ¨ Human resource strategies
¨ Software customization ¨ Employment/educational policy changes
¨ Electromechanical devices ¨ Training and retraining
· Is the technology appropriate to the student’s age and level of development? Is it realistic for the tasks that he or she hopes to accomplish? Is voice input really necessary? Would macro software, abbreviation expansion software, MouseKeys, or training in keyboard-only techniques be more effective than or a worthwhile adjunct to speech recognition?
· Individuals whose primary access means is the voice should be evaluated by a Speech Language Pathologist, and receive voice training.
· Does the software have commands for revising documents by voice? For students who write essays and other complex documents, the ability to revise may be more important than speed and compatibility.
· For individuals who use voice input as a primary access technique, double or quadruple the minimum RAM requirement. A fast CPU is helpful. Soundcard and microphone quality are of paramount importance.
· Note: microphone quality varies from unit to unit!
· For example, position the monitor so that the student is not forced to crane the neck or bend forward to read the screen.
· Uninstall superfluous software. Free system resources (e.g., in Windows 98 and ME, use MSCONFIG to disable non-essential start-up group items.)
· Run the “Audio Setup Wizard,” coach on proper microphone placement, coach on dictation techniques, and conduct initial training. Once completed, the student can begin experimenting with dictation.
· Tweak the voice files to match the student’s vocabulary and writing style. (1) Create vocabulary files containing lists of the student’s unique words, names and expressions. Hint: base the list on custom dictionary files. (2) Assemble files containing samples of the student’s writing. Run the “Vocabulary Builder” on both sets of files.
· Focus on learning basic dictation techniques and improving voice file accuracy. Resist the temptation to produce perfectly formatted, error-free documents. Staying relaxed, cultivating healthy work habits, speaking clearly and fixing misrecognitions are the best ways to gain precision, competence, and comfort with speech input.
· Good vocal hygiene includes performing “warm-up” and “cool-down” exercises, drinking fluids, limiting total time on a voice input system, and taking regular breaks.
· The voice files will already be quite accurate and will not improve significantly by correcting common words. Focus on learning to revise documents.
· Experiment. Tune the voice files by rerunning “Audio Wizard;” dictating for one or two minutes in “General Training;” removing nuisance words from the lexicon; and updating word-usage and style data by running the “Vocabulary Building” with newly-created files.
· Follow-up after two weeks, one month, and three months. Difficulties noted during follow-up can usually be traced to skipping or not completing an earlier stage.
Many individuals become disillusioned when faced with the reality of voice recognition. Others become competent at dictating simple texts (Stage 8 or 9), but cannot use features that would enable them to work more effectively. When accommodating a student with speech recognition, it may be necessary to emphasize the following principles:
Once good accuracy is achieved, do not try to perfect voice files. They will never be perfect.
Learning to think and speak in grammatically-correct sentences and paragraphs takes discipline and effort. Some people cannot do it.
Copying text into one’s usual word processor after a dictation session is easy.
To use speech recognition effectively, one must be able to do both.
It is faster to undo, select an entire utterance and dictate it again, or use “Resume With...”
Use “Insert Before...” and “Insert After...” instead of “Move” commands. Specifying two or more target words can increase precision, e.g., “Insert After can increase precision,” “Insert Before accept the limitations.”
Revise using “Select” instead of “Backspace...” and “Delete...” commands. Select the text and dictate; it is not necessary to explicitly delete selected text. Use “Select xxx Through yyy” to mark larger selections.
Revise phrases even if some words within a phrase are correct.
Speech recognition has tremendous — but as yet unfulfilled — potential as an educational accommodation for students with a range of disabilities, including upper-body mobility impairments, computer-induced repetitive strain injuries, and certain learning disabilities. Two strategies to help people realize the promise of the technology: (1) Be methodical and thorough in planning and implementing the accommodation so that individuals are well-prepared to operate a PC by voice; and (2) clearly understand how current voice input technologies do (and do not) work.
Cantor, Alan. (1996). The Future of Workplace Accommodations: Containing Costs and Maximizing Effectiveness. National Conference on Disability and Work Conference Proceedings. October 1996. Toronto.
Cantor, Alan. (1998). Disability in the Workplace: Effective and Cost-effective Accommodation Planning. NATCON 1998 Conference Proceedings. Toronto.
Cantor, Alan (2001). Speech Recognition: An Accommodation Planning Perspective. 2001 CSUN Technology and Persons with Disability Conference. Los Angeles. www.interlog.com/~acantor/csun2001speech-rec-paper.htm
Kambeyanda, Singer and Cronk. (1997). Potential Problems Associated with Use of Speech Recognition Products. Assistive Technology, Volume 9, Number 2, 95 - 102.
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
CANTOR
+ASSOCIATES
workplace accommodation consultants
Alan Cantor
President
Cantor + Associates Inc.
32 Queensdale Avenue
Toronto Ontario M4J 1X9
( 416 406 5098
fax 416 406 5498
acantor@interlog.com
www.interlog.com/~acantor