Speech Recognition

1085 words - 4 pages

Speech recognition is the process by which the computer uses special software that enables the computer to take in what is said by a specific human or humans and be able to translate it in computer language so that the computer could now act on the instructions given to it. Just like clicking with your mouse, typing on your keyboard, or pressing a key on the phone keypad provides input to an application, speech recognition allows you to provide input by talking. Speech is basically just another user interface, an input method, like using a mouse or a keyboard. According to Webster Speech recognition is simply, it “converts spoken words to text”. If you are not familiar with the term speech recognition it goes by other terms such as automatic speech recognition, computer speech recognition and speech to text. Speech recognition works like this, first you speak into a microphone that is connected to a computer with the speak recognition program running. Some speech recognition software’s just to name a few are VoiceAttack (used in gaming), Trigamtech(features for medical users) and WRSToolkit(for dictionary and grammar). While saying what you would like the computer to do for you, as you speak you would see the words as you say them pop up on your screen. Information that I came upon while doing my research stated that “You can speak at your normal speaking speed (about 120 words per minute) and the computer "guesses" using mathematical algorithms what word you mean in that context from what it knows about the English language.” Using speech recognition programs on computers are not going to give you for certain the exact information that you asked for but it works and tries to understand what it has been told and in the end gives you back probabilities to what it thinks you were asking for.

Before speech recognition on computers could be possible VoIP (Voice over Internet Protocol) had been first developed fully. Speech recognition has different starting points and evolutions. It is assumed that the technology began with Alexander Graham Bell’s inventions in the 1870s. Bell and his cousins invented a recording device and then later so did Thomas Edison. Thomas Edison invented a similar device which he called “Ediphone”. These machines recorded dictation of letters for a secretary to type. This was a step in the direction of a machine that could automatically transcribe the sound of a human voice. Since the time of Bell and Edison various technologies have emerged that has moved speech recognition along. In the 1990s The Cambridge University invented the most widely used software for automatic speech recognition research. The goal of many researchers is to develop a machine that performs and responds like a human being. For now, the technology is used in providing telephone support, writing medical reports, answering financial questions and in a variety of tasks requiring a human to machine interface, such as providing updated travel...

Find Another Essay On Speech Recognition

Voice recognition software Essay

826 words - 3 pages Voice recognition software systems are getting a lot closer in meeting goals of people with dictation needs. Voice recognition systems today can be used by anyone of any profession, unlike old voice recognition software where only certain professions like doctors, could use them. Several companies have came out with products that have changed voice recognition software. A breakthrough software and the first truely continuous speech

Image Processing Based Finger-Vein Biometric Recognition System

2314 words - 9 pages for iris biometrics, or a complete picture of a face for face biometrics [4]. C. Voice Recognition Voice is a combination of physiological and behavioural biometrics.The Speech is most prominent & primary mode of Communication among of human being. The communication among human computer interaction is called human computer interface. Speech has potential of being important mode of interaction with computer [5]. Voice Recognition is a biometric

Speech Processing Filtering Through Various Multi-Channel Cochlear Implants

2289 words - 9 pages (NIDCD,2013). While many recipients of this innovative device report various levels of success, there remain to be common reports of difficulty with speech processing, especially in the presence of noise. These patient concerns, while complex in each individual case, poses the research question of how many electrodes (or channels) in a cochlear implant are necessary for good speech recognition? This question will be analyzed further in this paper in

Voice Recognition Systems

1248 words - 5 pages voice. Dragon Systems The unprecented leader in voice recognition technology is a company called Dragon Systems. Dragon Systems continues to pursue the natural speech revolution through its reasearch and development efforts. Through the work of this company, voice speech technology has become not only a reality, but a standard part of the computing industry. Dragon Systems was co-founded by Dr. James Baker and Dr. Janet Baker in 1982 and is

Managing Information Systems In Organizations

2544 words - 10 pages exciting useful tool for computer science. Voice technology is a valuable tool for individuals as a time saver, a necessary tool for the disabled, and has several practical uses in business. In Esther Schindler’s book, The Computer Speech Book, she talks about the need for a voice recognition system: “Right now, to communicate with any computer, you have to learn how to use essentially arbitrary hardware and software. You have to learn how to

The New Standard: VoiceXML

1286 words - 5 pages costs and deliver superior service. This standard has almost revolutionized the way that companies handle automated calls. This standard has started a competitive market for other platforms that enable businesses to improve all processes of their customer care and communication over the phone. VoiceXML or VXML is an open standard for building and controlling intelligent voice applications that incorporate speech recognition and text to speech

Artificial Intelligence (AI)

2804 words - 11 pages This research Paper has problems with formatting ABSTRACT Current neural network technology is the most progressive of the artificial intelligence systems today. Applications of neural networks have made the transition from laboratory curiosities to large, successful commercial applications. To enhance the security of automated financial transactions, current technologies in both speech recognition and handwriting recognition are likely

audio Based Event Detection in Videos - A Survey

698 words - 3 pages . Linear Predictive Coding considers source filter model of speech production. It estimates the basic parameters of speech signal which are formant frequencies and vocal tract function. LPC is used for automatic speech recognition, audio retrieval and audio segmentation. Line spectral frequencies (line spectral pairs) are estimated by breaking down the linear prediction polynomial into two separate polynomials. The line spectral frequencies are at the

Lossless Compression of Audio

1283 words - 5 pages quality audio. Besides, audio compression also can be applied in speech recognition systems in which data could be preserved during compression. Based on Avramovic and Galic’s study, they explained about algorithm on modular arithmetic and simple performance-based adaptation. As we can observed, their objective of study is to determine how good performance can be achieved without usage of least square adaptation in general the most

Artificial Inteligence

2576 words - 10 pages ABSTRACTCurrent neural network technology is the most progressive of the artificial intelligencesystems today. Applications of neural networks have made the transition from laboratorycuriosities to large, successful commercial applications. To enhance the security of automatedfinancial transactions, current technologies in both speech recognition and handwritingrecognition are likely ready for mass integration into financial


566 words - 2 pages steps in earning recognition for a speech. When a speech is being delivered the speaker offers his credentials through experience, confidence, as well as eye contact and sophistication. Credibility is the trust earned between the speaker and his/her audience.As soon as all has been achieved, the job of a public speaker is to ensure the audience will remember the central ideas after the speech. the speaker does this by reviewing main ideas, thoughts

Similar Essays

Speech Recognition

2547 words - 10 pages Speech Recognition Speech recognition is a computer application that lets people control a computer by speaking to it. In other words, rather than using a keyboard and mouse to communicate with the computer, the user speaks commands into a microphone that is connected to a computer. By speaking into the microphone, users can do two things. First, they can tell their computers to execute some commands such as open a document, save changes

Speech Recognition

2808 words - 11 pages Speech Recognition Nowadays, computer systems play a major role in our lives. They are used everywhere beginning with homes, offices, restaurants, gas stations, and so on. Nonetheless, for some, computers still represent the machine they will never know how to use. Communicating with a computer is done using a keyboard or a mouse, devices many people are not comfortable using. Speech recognition solves this problem and destroys the

Wireless Speech Recognition

1209 words - 5 pages Wireless Speech Recognition Introduction In today's ever changing world, full of technology, there are many advances being made in the world of computing. This can be seen a great deal in the area of speech recognition. Machines, computers specifically, are interacting more and more with humans and these interactions can now be driven by human speech. For this technology to be used at its highest potential it will have to be affordable and

How Speech Recognition Works.

637 words - 3 pages drives monkey feetJames is cool because he plays the guitarIt would switch over to the second sentence, because the first sentence quickly turned meaningless, while the second was both semantically and syntactically correct. This sort of "understanding" is imperative for a speech recognition engine, and especially one that has to deal with continuous speech.