Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.
Questions tagged [mozilla-deepspeech]
72 questions
14
votes
2 answers
How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?
I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…
![](../../users/profiles/9109418.webp)
Priyesh
- 341
- 1
- 7
4
votes
1 answer
DeepSpeech giving bad results
I am new to DeepSpeech i followed this link to create Speech to text code, but my results are no where near to the original speech. I am using Deepspeech 0.6.1 and have installed the relevant pretrained model. I am using this link to create my wav…
![](../../users/profiles/5685410.webp)
Ironman
- 1,270
- 2
- 12
- 35
4
votes
1 answer
How to install Mozilla's implementation of Baidu deep speech on Windows?
I'm on Windows 10, and when I try to install Mozilla's implementation of
Baidu Deep speech as shown here.
Using:
pip install deepspeech
I get this error : "Could not find a version that satisfies the requirement deepspeech (from versions:)
No…
![](../../users/profiles/9008922.webp)
Domy Kanyik-tesh
- 81
- 1
- 6
3
votes
1 answer
How do I get started training a custom voice model with Mozilla TTS on Ubuntu 20.04?
I'd like to create a custom voice in Mozilla TTS using audio samples I have recorded but am not sure how to get started. The Mozilla TTS project has documentation and tutorials, but I'm having trouble putting the pieces together -- it seems like…
![](../../users/profiles/4342230.webp)
GuyPaddock
- 1,034
- 12
- 14
3
votes
1 answer
Converting Mozilla DeepSpeech model to use in tensorflow.js environment
I have been attempting to convert a Mozilla Deepspeech trained model for use in tensorflow.js, in ml5.js soundClassifier layer. My understanding is that Mozilla DeepSpeech uses TensorFlow. I have been trying to follow the hints found…
![](../../users/profiles/2179138.webp)
Allasso
- 595
- 4
- 15
3
votes
1 answer
How to use mozilla deepspeech to convert speech to text using it's pre-trained model?
I want to convert speech to text using mozilla deepspeech. But the output is really bad.
I have downloaded mozilla's pre trained model and then what i have done is this:
BEAM_WIDTH = 500
LM_WEIGHT = 1.50
VALID_WORD_COUNT_WEIGHT =…
![](../../users/profiles/8646585.webp)
Amit Joshi
- 61
- 1
- 1
- 5
2
votes
0 answers
My CTC loss model's loss stagnates and then outputs only blank characters
I am trying to implement BaiDu's DeepSpeech1 in keras using CTC loss, my code is below:
class dataGen(Sequence): # data generator for Mozilla common voice
def __init__(self, audiopaths, transcripts, batch_size):
self.x = audiopaths
…
![](../../users/profiles/10651242.webp)
hhaefliger
- 472
- 2
- 16
2
votes
1 answer
Bazel build not respecting my git submodules
I'm running a bazel build in a project that is composed of git submodules,
with the following structure:
/work/
├── tensorflow/ [git submodule]
└── train/
└── DeepSpeech/ [git submodule]
└── native_client/
The build command looks like…
![](../../users/profiles/5578839.webp)
mathematiguy
- 21
- 3
2
votes
0 answers
DeepSpeech: Distinguish speakers
I try to use DeepSpeech for call center, to dictate and summary conversation.
In this scenario, there are always more than one speaker, usually two speakers. Is there any way to distinguish the speakers while DeepSpeech is recognizing the…
![](../../users/profiles/4948735.webp)
jin chong
- 81
- 5
1
vote
1 answer
while I was trying to train a DeepSpeech model on google colab, I'm getting an error saying that .whl file is not suported
commands i used
!wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/ds_ctcdecoder-0.9.3-cp36-cp36m-manylinux1_x86_64.whl
!pip install /content/~path~/ds_ctcdecoder-0.9.3-cp36-cp36m-manylinux1_x86_64.whl
this gives me an…
![](../../users/profiles/15524610.webp)
chamod rathnayake
- 63
- 1
- 6
1
vote
0 answers
how to pass chunk of audio files to Mozilla DeepSpeech web socket?
when a live speech is going on, I want to separate it into a chunk of short mp3 files and send it to a Mozilla DeepSpeech WebSocket for transcribing
(speech is conducted using the device microphone)
![](../../users/profiles/15524610.webp)
chamod rathnayake
- 63
- 1
- 6
1
vote
1 answer
Not able to train the Deepspeech model on Windows
I have tried to train Deepspeech model on Windows, as I can not use Linux. But, I am not able to train I am getting error
File "E:/deepspeech-german-master/DeepSpeech/training/deepspeech_training/train.py", line 30, in
from…
![](../../users/profiles/11622160.webp)
swati sharma
- 11
- 4
1
vote
1 answer
Build TFLite without ruy for android
I'm trying to analyze DeepSpeech's (a third-party library that uses TensorFlow and TFLite) performance on android devices and had built it successfully as they mentioned in their docs.
After I read the source codes, I found out that tensorflow uses…
![](../../users/profiles/6506806.webp)
user9886
- 13
- 2
1
vote
1 answer
DeepSpeech-Polyglot: Signals.SIGSEGV: 11
I'm trying to generate the scorer of the DeepSpeech-Polyglot-Project. I have followed every step of the documentation but when i run:
python3 /DeepSpeech/data/lm/generate_lm.py --input_txt /DeepSpeech/data_prepared/texts/${LANGUAGE}/clean_vocab.txt…
![](../../users/profiles/14999469.webp)
Chiara
- 11
- 1
1
vote
1 answer
How to check the number of parameters in Mozilla's DeepSpeech model?
I failed to find a simple way to print out how many parameters in the model and proposals to print it in release notes did not find the warm…
![](../../users/profiles/7599215.webp)
banderlog013
- 965
- 11
- 19