Questions tagged [mozilla-deepspeech]

Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.

Code: https://github.com/mozilla/DeepSpeech

72 questions
14
votes
2 answers

How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?

I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…
4
votes
1 answer

DeepSpeech giving bad results

I am new to DeepSpeech i followed this link to create Speech to text code, but my results are no where near to the original speech. I am using Deepspeech 0.6.1 and have installed the relevant pretrained model. I am using this link to create my wav…
Ironman
  • 1,270
  • 2
  • 12
  • 35
4
votes
1 answer

How to install Mozilla's implementation of Baidu deep speech on Windows?

I'm on Windows 10, and when I try to install Mozilla's implementation of Baidu Deep speech as shown here. Using: pip install deepspeech I get this error : "Could not find a version that satisfies the requirement deepspeech (from versions:) No…
3
votes
1 answer

How do I get started training a custom voice model with Mozilla TTS on Ubuntu 20.04?

I'd like to create a custom voice in Mozilla TTS using audio samples I have recorded but am not sure how to get started. The Mozilla TTS project has documentation and tutorials, but I'm having trouble putting the pieces together -- it seems like…
GuyPaddock
  • 1,034
  • 12
  • 14
3
votes
1 answer

Converting Mozilla DeepSpeech model to use in tensorflow.js environment

I have been attempting to convert a Mozilla Deepspeech trained model for use in tensorflow.js, in ml5.js soundClassifier layer. My understanding is that Mozilla DeepSpeech uses TensorFlow. I have been trying to follow the hints found…
Allasso
  • 595
  • 4
  • 15
3
votes
1 answer

How to use mozilla deepspeech to convert speech to text using it's pre-trained model?

I want to convert speech to text using mozilla deepspeech. But the output is really bad. I have downloaded mozilla's pre trained model and then what i have done is this: BEAM_WIDTH = 500 LM_WEIGHT = 1.50 VALID_WORD_COUNT_WEIGHT =…
Amit Joshi
  • 61
  • 1
  • 1
  • 5
2
votes
0 answers

My CTC loss model's loss stagnates and then outputs only blank characters

I am trying to implement BaiDu's DeepSpeech1 in keras using CTC loss, my code is below: class dataGen(Sequence): # data generator for Mozilla common voice def __init__(self, audiopaths, transcripts, batch_size): self.x = audiopaths …
2
votes
1 answer

Bazel build not respecting my git submodules

I'm running a bazel build in a project that is composed of git submodules, with the following structure: /work/ ├── tensorflow/ [git submodule] └── train/ └── DeepSpeech/ [git submodule] └── native_client/ The build command looks like…
2
votes
0 answers

DeepSpeech: Distinguish speakers

I try to use DeepSpeech for call center, to dictate and summary conversation. In this scenario, there are always more than one speaker, usually two speakers. Is there any way to distinguish the speakers while DeepSpeech is recognizing the…
jin chong
  • 81
  • 5
1
vote
1 answer

while I was trying to train a DeepSpeech model on google colab, I'm getting an error saying that .whl file is not suported

commands i used !wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/ds_ctcdecoder-0.9.3-cp36-cp36m-manylinux1_x86_64.whl !pip install /content/~path~/ds_ctcdecoder-0.9.3-cp36-cp36m-manylinux1_x86_64.whl this gives me an…
1
vote
0 answers

how to pass chunk of audio files to Mozilla DeepSpeech web socket?

when a live speech is going on, I want to separate it into a chunk of short mp3 files and send it to a Mozilla DeepSpeech WebSocket for transcribing (speech is conducted using the device microphone)
1
vote
1 answer

Not able to train the Deepspeech model on Windows

I have tried to train Deepspeech model on Windows, as I can not use Linux. But, I am not able to train I am getting error File "E:/deepspeech-german-master/DeepSpeech/training/deepspeech_training/train.py", line 30, in from…
1
vote
1 answer

Build TFLite without ruy for android

I'm trying to analyze DeepSpeech's (a third-party library that uses TensorFlow and TFLite) performance on android devices and had built it successfully as they mentioned in their docs. After I read the source codes, I found out that tensorflow uses…
1
vote
1 answer

DeepSpeech-Polyglot: Signals.SIGSEGV: 11

I'm trying to generate the scorer of the DeepSpeech-Polyglot-Project. I have followed every step of the documentation but when i run: python3 /DeepSpeech/data/lm/generate_lm.py --input_txt /DeepSpeech/data_prepared/texts/${LANGUAGE}/clean_vocab.txt…
Chiara
  • 11
  • 1
1
vote
1 answer

How to check the number of parameters in Mozilla's DeepSpeech model?

I failed to find a simple way to print out how many parameters in the model and proposals to print it in release notes did not find the warm…
banderlog013
  • 965
  • 11
  • 19
1
2 3 4 5