Questions tagged [speech-recognition]

Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics

Speech recognition describes the process of analyzing an audio signal stream and convert it to text. This includes splitting the input into meaningful bits of information (tokenization) and jugding the relevance of a signal. The latter means distinguishing between irrelevant signals (e.g. noise in the background) and words to be recognized.

Ideally, speech recognition or speech-to-text mechanisms should not be biased towards a specific voice. It should be capable of recognizing arbitrary voices.

Further information

4857 questions
126
votes
4 answers

How can I use speech recognition without the annoying dialog in android phones

Is this possible without modify the android APIs? I've found a article about this. There's one a comment that I should do modifications to the android APIs. But it didn't say how to do the modification. Can anybody give me some suggestions on how…
Jim31837
  • 1,599
  • 3
  • 12
  • 11
84
votes
7 answers

Remove quotes from String in Python

I have a python Code that will recognize speech using the Google STT engine and give me back the results but I get the results in strings with "quotes". I don't want that quotes in my code as I will use it to run many commands and it doesn't work. I…
Alok Naushad
  • 975
  • 1
  • 6
  • 9
79
votes
4 answers

What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

There are two similar namespaces and assemblies for speech recognition in .NET. I’m trying to understand the differences and when it is appropriate to use one or the other. There is System.Speech.Recognition from the assembly System.Speech (in…
Michael Levy
  • 12,767
  • 15
  • 60
  • 98
78
votes
8 answers

Offline Speech Recognition In Android (JellyBean)

It looks as though Google has made offline speech recognition available from Google Now for third-party apps. It is being used by the app named Utter. Has anyone seen any implementations of how to do simple voice commands with this offline speech…
rmooney
  • 5,597
  • 3
  • 25
  • 28
58
votes
2 answers

Delay when using android.speech.SpeechRecognizer in DICTATION_MODE

I'm using android.speech.SpeechRecognizer in DICTATION_MODE to recognize commands during a long period of time. In this mode the call to the callback method onPartialResults delays much more than in normal mode. Does anybody know why this happen and…
acimutal
  • 1,935
  • 2
  • 14
  • 22
57
votes
5 answers

Android: Speech Recognition without using google server

I want to develop an Speech recognizer in android, which should work in offline. As the android's built-in speech recognizer uses google server which needs internet, i want an alternative which works in the absence of internet. Please suggest me…
Swathi EP
  • 3,766
  • 6
  • 24
  • 25
55
votes
11 answers

C# Speech Recognition - Is this what the user said?

I have need to write an application which uses a speech recognition engine -- either the built in vista one, or a third party one -- that can display a word or phrase, and recognise when the user reads it (or an approximation of it). I also need to…
RichieACC
  • 2,116
  • 5
  • 29
  • 40
54
votes
4 answers

Android Speech Recognition as a service on Android 4.1 & 4.2

I have managed to get continuous speech recognition working (using the SpeechRecognizer class) as a service on all Android versions up to 4.1. My question concerns getting it working on versions 4.1 and 4.2 as it is known there is a problem in that…
Graham Laming
  • 1,123
  • 2
  • 13
  • 18
53
votes
3 answers

Any OpenCV-like C/C++ library for Audio processing?

Is there anything more out there, that resembles (in spirit) OpenCV, but for processing audio and deriving some intelligence from it ? Capabilities could range from:- Multiplatform audio capture and audio playback DSP - Audio filters Tone…
bdutta74
  • 2,693
  • 3
  • 25
  • 49
51
votes
3 answers

ImportError: No module named request

I am trying to install python SpeechRecognition on my machine.When i am trying to install the package as pip install SpeechRecognition. I am getting the following error. import json, urllib.request ImportError: No module named request And then i…
Mulagala
  • 6,713
  • 10
  • 23
  • 45
47
votes
16 answers

Voice Recognition Software For Developers

Well the docs finally said it, I need to take it easy on my wrist for a few months. Being that I'm a .NET Developer this could end my livelihood for a little while, something I'm not anxious to do. That said, are there any good handsfree options for…
tekiegreg
  • 1,646
  • 5
  • 25
  • 41
45
votes
2 answers

How to use Speech Recognition inside the iOS SDK?

I know that there is no public API for the SIRI-Services, but is there an API for simple Speech-Recognition? So if I have a textfield and the user taps onto that textfield, a keyboard with the typically microphone button appears and if he pressed…
Sebastian Boldt
  • 4,947
  • 9
  • 46
  • 60
43
votes
5 answers

Voice Recognition stops listening after a few seconds

I tried a lot but can´t find it out, so I hope you can help me. I am trying to build my own voice recognition app, which doesn´t show up the dialog. I already wrote some code and it works quite fine, but my problem is that the recognizer seems to…
Chris
  • 501
  • 1
  • 5
  • 14
40
votes
3 answers

What does gs protocol mean?

I'm playing with Google Speech Recognition API After a successfully Getting started I'm trying to understand and made some changes in this first example but I don't know what "gs" protocol is and how to set it to use my own audio…
Salvador Rueda
  • 615
  • 2
  • 6
  • 13
40
votes
4 answers

What are language codes in Chrome's implementation of the HTML5 speech recognition API?

Chrome implemented the HTML5 speech recognition API. Many languages are supported. I wanna know which languages are supported and each language's corresponding code which is used in the HTML element's lang attribute. For instance: Polish…
weilou
  • 3,869
  • 8
  • 37
  • 56
1
2 3
99 100