Questions tagged [google-cloud-vision]

Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API.

Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API. It quickly classifies images into thousands of categories (e.g., "sailboat", "lion", "Eiffel Tower"), detects individual objects and faces within images, and finds and reads printed words contained within images.

You can build metadata on your image catalog, moderate offensive content, or enable new marketing scenarios through image sentiment analysis. Analyze images uploaded in the request, and in upcoming releases, integrate with your image storage on Google Cloud Storage. Documentation

770 questions
9
votes
4 answers

Cloud Vision API - PDF OCR

I just tested the Google Cloud Vision API to read the text, if exist, in a image. Until now I installed the Maven Server and the Redis Server. I just follow the instructions in this…
Christian Salvador
  • 189
  • 1
  • 1
  • 11
8
votes
1 answer

vision-client doesn't supprt api-key

It looks that google-cloud vision Python client (google.cloud.vision.client.Client) doesn't have an option to accept api-key. https://googlecloudplatform.github.io/google-cloud-python/stable/vision-client.html How can I use the client with api-key…
etusji
  • 128
  • 3
8
votes
1 answer

AggregateException when calling GetApplicationDefaultAsync()

I am trying to run the sample Vision API project. I basically copied and pasted the code Program.cs into my application and executed it. This line (which is line #36-#37 in Program.cs) GoogleCredential credential =…
Rakete1111
  • 42,521
  • 11
  • 108
  • 141
7
votes
1 answer

Is there a way to see estimated time for training a model in Google AutoML Vision?

I'm attempting to train a model to identify certain tags in images. I tried using the 1 hour free version and after an hour the training ended. The results weren't as accurate as I would like, so I took the plunge and selected the option which did…
Jay Song
  • 73
  • 5
7
votes
3 answers

Cloud Vision API Client threw an OS Error "too many open files"

I have met an Error of "Too many open files" when I run label detection via Cloud Vision API Client with Python. When I asked this probrem on GitHub before this post, the maintainer gave me an advice that the problem is general Python issue rather…
katsuya
  • 71
  • 2
7
votes
3 answers

Google vision Text Detection response to be line by line

I am using the Google vision api to perform text recognition on receipt images. I am getting some nice results returned but the format in which the return is quite unreliable. If there is a large gap between text the readout will print the line…
Wrumble
  • 221
  • 1
  • 3
  • 13
7
votes
0 answers

Google Cloud Vision OCR API returning incorrect values for bounding box/vertices

I'm using the "TEXT_DETECTION" option from the Google Cloud Vision API to OCR some images. The bounding box around individual characters is sometimes accurate and sometimes not, often within the same image. Is this a normal side-effect of a…
Mark Bench
  • 101
  • 3
7
votes
2 answers

Does Google Cloud Vision OCR API have better accuracy and performance than Tesseract OCR API

I have integrated Google Cloud Vision API in my java application for text recognition from complex formatted documents. One of my colleague suggested to use "Tesseract API".Can anyone please give difference between these two API's.And which is…
nik
  • 1,126
  • 3
  • 14
  • 27
7
votes
1 answer

React Native component for Google Cloud Vision API - Text Detection

I am using React Native's Image Picker component to capture images on my app. Before showing the picture I want to parse it using Google Cloud Vision's Text Detection API. I've been searching on components in React Native but no result. Does anybody…
Walter
  • 71
  • 1
  • 2
6
votes
3 answers

Text extraction - line-by-line

I am using Google Vision API, primarily to extract texts. I works fine, but for specific cases where I would need the API to scan the enter line, spits out the text before moving to the next line. However, it appears that the API is using some kind…
6
votes
1 answer

How to enable Google Vision API to access Google Cloud Storage Bucket within same project

I have uploaded some test images to a Google Cloud Bucket, but don't want to make them public (which would be cheating). When I try to run a rest call for Google Vision API I get: { "responses": [ { "error": { "code": 7, …
smackenzie
  • 2,476
  • 3
  • 29
  • 79
6
votes
3 answers

Google Cloud vision API: "Request had insufficient authentication scopes."

Hi I'm trying to use google's cloud vision API, specifically this example: https://cloud.google.com/vision/docs/label-tutorial#label_tutorial_1 I followed this tutorial: https://cloud.google.com/vision/docs/getting-started#set_up_a_service_account…
6
votes
5 answers

Google Cloud Vision API "PERMISSION_DENIED"

I am trying Google Cloud Vision API (beta) and it is returning "Permission Denied" message. But the "Cloud Vision API" is enabled for the project. Any help is appreciated. Error Details from Google APIs Explorer 403 OK - Show headers - { "error":…
Binoy Mathew
  • 123
  • 1
  • 2
  • 7
5
votes
1 answer

How to detect handwriting using Google Cloud Vision API

TL;DR: how can I detect the presence of handwriting in an image? I'm using Google's Python Vision API to scan for text in images, with generally good results. Most of the time the images contain printed text, but sometimes there is handwriting. As…
5
votes
2 answers

Google Cloud Vision - Which region does Google upload the images to?

I am building an OCR based solution to extract information from certain financial documents. As per the regulation in my country (India), this data cannot leave India. Is it possible to find the region where Google Cloud Vision servers are…
1
2
3
51 52