Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics
Speech recognition describes the process of analyzing an audio signal stream and convert it to text. This includes splitting the input into meaningful bits of information (tokenization) and jugding the relevance of a signal. The latter means distinguishing between irrelevant signals (e.g. noise in the background) and words to be recognized.
Ideally, speech recognition or speech-to-text mechanisms should not be biased towards a specific voice. It should be capable of recognizing arbitrary voices.
Further information