Speech recognition
'''Speech recognition''' technologies allow computers equipped with a source of sound input, such as a microphone, to interpret human speech, e.g. for transcription or as an alternative method of interacting with a computer.
Classification
Such systems can be classified as to- whether they require the user to "train" the system to recognise their own particular speech patterns or not,
- whether the system is trained for one user only or is speaker independent,
- whether the system can recognise continuous speech or requires users to break up their speech into discrete words,
- whether the system is intended for clear speech material, or is designed to operate on distorted transfer channels (e.g. cellular phones) and possibly background noise or other speaker talking simultaneously, and
- whether the vocabulary the system recognises is small (in the order of tens or at most hundreds of words), or large (thousands of words).
Use
Commercial systems for speech recognition have been available off-the-shelf since the 1990s. However, it is interesting to note that despite the apparent success of the technology, few people use such speech recognition systems on their desktop computers. However, the use of speech recognition in telephone applications, for applications like travel booking and information, financial account information, and directory assistance has been increasing as the cost for implementing such voice-activated systems has dropped. It appears that most computer users can create and edit documents more quickly with a conventional keyboard, despite the fact that most people are able to speak considerably faster than they can type. Using both keyboard and speech recognition simultaneously, however, can in some cases be more efficient than using any one of these inputs alone. Additionally, heavy use of the speech organs results in vocal loading. Also, the typical office environment with a high amplitude of background speechs are among the most adverse environment for current speech recognition technologies. Large-vocabulary systems with speaker-independence and/or are designed to operate within an adverse environment, however, have significantly lower recognition rates. The typical achievable recognition rate (2003) for large-vocabulary speaker-indenependent are about 80%-90% for clear environment, and can be as low as 50% for scenarios like cellular phone with background noise.Technical Issues
Some of the key technical problems in speech recognition are that:- Inter-speaker differences and also intra-speaker variations are often large and difficult to account for. It is not clear which characteristics of speech are speaker-independent.
- Speech recognition system are based on simplified stochastic models, that do not match the real speech accurately.
- The interpretation of many phonemes, words and phrases are context sensitive. For example, phonemes are often shorter in long words than in short words. Words have different meanings in different sentences, e.g. "Philip lies" could be interpreted either as Philip being a liar, or that Philip is lying on a bed.
- Intonation and speech timbre can completely change the correct interpretation of a word or sentence, e.g. "Go!", "Go?" and "Go." can clearly be recognised by a human, but not so easily by a computer.
- Words and sentences can have several valid interpretations such that the speaker leaves the choice of the correct one to the listener.
- Written language may need punctuation according to strict rules that are not strongly present in speech, and are difficult to infer without knowing the meaning (commas, ending of sentences, quotations).
See also
External links
da:Talegenkendelse de:Spracherkennung es:Comprensión del lenguaje fi:Puheentunnistus fr:Reconnaissance vocale ja:音声認識 nl:Spraakherkenning pt:Reconhecimento de voz sv:taligenkänning Category:Computational linguistics Category:Speech recognitionpeech recognition
Seech recognition
Spech recognition
Spech recognition
Speeh recognition
Speec recognition
Speechrecognition
Speech ecognition
Speech rcognition
Speech reognition
Speech recgnition
Speech reconition
Speech recogition
Speech recogntion
Speech recogniion
Speech recogniton
Speech recognitin
Speech recognitio
pSeech recognition
Sepech recognition
Speech recognition
Speceh recognition
Speehc recognition
Speec hrecognition
Speechr ecognition
Speech ercognition
Speech rceognition
Speech reocgnition
Speech recgonition
Speech recongition
Speech recogintion
Speech recogntiion
Speech recogniiton
Speech recognitoin
Speech recognitino
Speech recognitio
SSpeech recognition
Sppeech recognition
Speeech recognition
Speeech recognition
Speecch recognition
Speechh recognition
Speech recognition
Speech rrecognition
Speech reecognition
Speech reccognition
Speech recoognition
Speech recoggnition
Speech recognnition
Speech recogniition
Speech recognittion
Speech recognitiion
Speech recognitioon
Speech recognitionn
peech recognition
seech recognition
spech recognition
spech recognition
speeh recognition
speec recognition
speechrecognition
speech ecognition
speech rcognition
speech reognition
speech recgnition
speech reconition
speech recogition
speech recogntion
speech recogniion
speech recogniton
speech recognitin
speech recognitio
pseech recognition
sepech recognition
speech recognition
speceh recognition
speehc recognition
speec hrecognition
speechr ecognition
speech ercognition
speech rceognition
speech reocgnition
speech recgonition
speech recongition
speech recogintion
speech recogntiion
speech recogniiton
speech recognitoin
speech recognitino
speech recognitio
sspeech recognition
sppeech recognition
speeech recognition
speeech recognition
speecch recognition
speechh recognition
speech recognition
speech rrecognition
speech reecognition
speech reccognition
speech recoognition
speech recoggnition
speech recognnition
speech recogniition
speech recognittion
speech recognitiion
speech recognitioon
speech recognitionn