Questions tagged [google-speech-api]

With Google Speech API you can convert speech to text file based or live stream

The https://cloud.google.com/speech/ and is part of https://cloud.google.com/products/ to allow for speech to text conversion.

When using a live stream words are returned almost realtime. This is limited by 1 minute or when paused for ~1 second. When using an asynchronous file the speech to text can be as long as 80 minutes. See https://cloud.google.com/speech/limits

For more see https://cloud.google.com/speech/docs/

805 questions
46
votes
10 answers

ImportError: No module named google.cloud

I am unable to import google.cloud.speech from google.cloud import speech I have installed it using : pip install --upgrade google-cloud-speech -t dir-name It is giving me below error while importing it from dir-name ImportError: No module named…
34
votes
1 answer

Unable to get results from Google text to speech api while streaming audio from web

I want to stream audio from the web and convert that to text using Python Google-cloud-speech API. I have integrated that in my Django channels code. For frontend, I have directly copied this code and the backend has this code (please see below).…
26
votes
4 answers

portaudio.h: No such file or directory

I got the following error while trying to install pyaudio using pip3 in ubuntu 16.04: Collecting pyaudio Downloading PyAudio-0.2.11.tar.gz Installing collected packages: pyaudio Running setup.py install for pyaudio ... error Complete output…
Monojit Sarkar
  • 433
  • 1
  • 4
  • 10
23
votes
5 answers

Request had invalid authentication credentials. Expected OAuth 2 access token error in cloud speech api

i have followed the google cloud speech api quickstart of requesting api by using curl -s -H "Content-Type: application/json" \ -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) \ …
Patel
  • 363
  • 1
  • 3
  • 10
19
votes
3 answers

Google Speech Recognition API: timestamp for each word?

It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said "one two three for five" in a WAV file.…
Basj
  • 29,668
  • 65
  • 241
  • 451
17
votes
1 answer

Streaming Audio in FLAC or AMR_WB to the Google Speech API

I need to run the google speech api in somewhat low bandwidth environments. Based on reading about best practices, it seems my best bet is to use the AMR_WB format. However, the following code produces no exceptions, and I get no responses in the…
Wesley
  • 4,955
  • 8
  • 35
  • 59
17
votes
2 answers

Is there a way to force Google Speech api to return only words as response?

I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang="+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the…
sunil sunny
  • 3,793
  • 4
  • 21
  • 49
16
votes
1 answer

Google Cloud Speech with Javascript

In documentation and tutorial for REST API (Google Sppech API for Node: https://cloud.google.com/nodejs/apis), so my question is how to use the Cloud Speech API in JavaScript. Someone used on any page with javascript? 2020-04-24 EDIT: The accepted…
16
votes
1 answer

How to use google speech recognition api in python?

Stack overflow might not be the best place to ask this question but i need help. I have an mp3 file and i want to use google's speech recognition to get the text out of that file. Any ideas where i can find documentation or examples will be…
Vivek Anand
  • 615
  • 1
  • 7
  • 15
16
votes
2 answers

Different between Google Speech API and Web Speech API

I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can…
Kim Wong
  • 1,717
  • 3
  • 14
  • 20
16
votes
1 answer

Google Speech API V2

I have followed this link to generate API keys, but the Speech API was not found on the console. The application example I'm following is here: https://github.com/gillesdemey/google-speech-v2/ I keep on getting an error: Your client does not have…
Gourish Goudar
  • 169
  • 1
  • 1
  • 4
13
votes
1 answer

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to…
CIRCLE
  • 3,695
  • 3
  • 32
  • 51
13
votes
1 answer

Is Google Cloud Platform available in China?

I want to find out if there are any restrictions accessing GCP in China. Is GCP all available to use in China? If not, why?
고정환
  • 131
  • 1
  • 1
  • 4
13
votes
2 answers

User data privacy in Google speech to text api / Google cloud speech solution

What level of data privacy is maintained when I used Google speech to text api or google cloud speech solution. I want to know this because I am working on an application which converts audio files into text and those files may contain user…
12
votes
1 answer

How to translate live streaming using google speech api?

This is the Google Speech API docs: https://cloud.google.com/speech/docs/sync-recognize I trried this API for 2 weeks, but still can't solve my main purpose (translate live streaming). I'm using PHP. (other language suggestion is allowed, I will…
plonknimbuzz
  • 2,362
  • 2
  • 15
  • 27
1
2 3
53 54