Questions tagged [computer-vision]

Use this tag for questions related to Computer Vision -- any aspect of software that enables computers to perceive, understand and react to their environment using cameras. For questions related to image filtering and quantification, use the tag [image-processing] instead.

Computer vision enables images, or sequences of images, to be processed by a computer using algorithms. There are many aspects to computer vision, including mathematics, physics (especially optics), imaging hardware, image-processing, signal-processing and machine-learning.

Some basic techniques used in computer vision are:

Image acquisition
Pre-processing
Feature Extraction
Detection/Segmentation
High-Level processing
Decision making

13127 questions

votes

1 answer

I want to know if there is the clothing object class in the MS COCO dataset?

I want to do something about clothing segmentation, so I need to train the network on dataset which has the clothing class. Anyone can answer me? I do not find something about classes in the http://mscoco.org/

image-processing neural-network computer-vision deep-learning image-segmentation

asked Feb 27 '17 at 07:51

guoqiang Wei

votes

2 answers

'Resource exhausted' memory error when trying to train a Keras model

I'm trying to train a VGG19 model for a binary image classification problem. My dataset doesn't fit into the memory, so I use batches and the .fit_generator function of the model. However, even when trying to train with batches, I get the following…

python tensorflow computer-vision deep-learning keras

asked Feb 18 '17 at 12:44

Denis Yakovenko

2,398
3
38
70

votes

1 answer

TensorFlow: Does each session run initiate a different batch of data in a graph?

Meaning to say if I have the following graph like: images, labels = load_batch(...) with slim.arg_scope(inception_resnet_v2_arg_scope()): logits, end_points = inception_resnet_v2(images, num_classes = dataset.num_classes, is_training =…

python machine-learning tensorflow computer-vision deep-learning

asked Feb 10 '17 at 12:55

kwotsin

2,642
8
27
55

votes

1 answer

How to find the angle formed by blades of a wind turbine when the yaw is changed?

This is a continuation of the question from Here-How to find angle formed by the blades of a wind turbine with respect to a horizontal imaginary axis? I've decided to use the following methodology for this-  Getting a frame from a camera and…

c++ opencv math computer-vision angle

asked Feb 09 '17 at 04:12

AGP

votes

1 answer

Deep learning for face detection in mobile device

I want to create a face detection mobile app and I want to do it with a regular Deep Learning(Convolutional Network). I will train it with my computer and use trained data in the mobile app. My question is that: can I get very fast computation in…

mobile machine-learning neural-network computer-vision deep-learning

asked Feb 04 '17 at 01:17

Fcoder

8,466
17
54
90

votes

1 answer

HOG Person Detector: False Positive detections on background subtracted images

I am working on a project which requires detection of people in a scene. Initially after running the HOG detector on the original frames a particular background object was being detected as a person on all the frames, giving me 3021 false positive…

opencv image-processing computer-vision detection background-subtraction

asked Feb 03 '17 at 00:04

Nik

votes

5 answers

Faster-RCNN, why don't we just use only RPN for detection?

As we know, faster-RCNN has two main parts: one is region proposal network(RPN), and another one is fast-RCNN. My question is, now that region proposal network(RPN) can output class scores and bounding boxes and is trainable, why do we need…

machine-learning computer-vision object-detection

asked Feb 01 '17 at 09:32

user3094631

votes

2 answers

Tensorflow: Is preprocessing on TFRecord files faster than real-time data preprocessing?

In Tensorflow, it seems that preprocessing could be done on either during training time, when the batch is created from raw images (or data), or when the images are already static. Given that theoretically, the preprocessing should take roughly…

python machine-learning tensorflow computer-vision deep-learning

asked Jan 23 '17 at 12:14

kwotsin

2,642
8
27
55

votes

0 answers

How to get the z coordinate (distance) from a Kinect v2 RGB coordinate?

I am trying to convert libfreenect RGB coordinates into DEPTH coordinates, in order to find the z coordinate (distance of object). I tried so many things but I was not able to get the resolutions right. Any ideas? I am using rgbmat in order to apply…

computer-vision kinect openkinect kinect-v2 libfreenect2

asked Jan 13 '17 at 13:13

Engo

votes

4 answers

Drawing contours using cv2.approxPolyDP() in python

I'm unable to draw complete contours of a shape using cv2.approxPolyDP(). I'm getting the following result: But I want output like this: Here's my code: import cv2 im = cv2.imread('C:\Python27\Test\Targets\s1.jpg') # read picture imgray =…

python image opencv computer-vision opencv3.0

asked Jan 10 '17 at 19:27

Vikas Tomar

votes

1 answer

What is the difference between contours and contours[0] when len(contours)=1?

I want to find the contours of an image to then draw its convex hull. What I am doing is loading the image, threshold it, find its contours and then draw the convex hull. gray = cv2.imread(test_paths[i], 0) ret, thresh = cv2.threshold(gray, 127,…

python opencv computer-vision opencv-contour

asked Jan 10 '17 at 15:07

Manuel Lagunas

2,026
15
30

votes

1 answer

What to do with negative rho values in hough transform?

Here is my code for creating the hough accumulator for lines in image : void hough_lines_acc(cv::Mat img_a_edges, std::vector > &hough_acc) { for (size_t r = 0; r < img_a_edges.rows; r++) { for (size_t c = 0; c <…

c++ opencv computer-vision

asked Jan 05 '17 at 16:01

darth vader

votes

4 answers

Is it possible for either Microsoft Computer Vision API or Google's Cloud Vision API to get a location for objects?

I am trying to develop an application that needs to know the location of tagged objects in an image. Knowing that there is a "piano" in an image is not enough, I need to know where that piano is in the image. Both Microsoft's Computer Vision API…

api computer-vision microsoft-cognitive google-cloud-vision

asked Dec 23 '16 at 20:56

abagshaw

4,960
3
28
58

votes

2 answers

Detecting individual boxes in W2 with opencv - python

I've done extensive research and cannot find a combination of techniques that will achieve what I need. I have a situation where I need to perform OCR on hundreds of W2s to extract the data for a reconciliation. The W2s are very poor quality, as…

python python-2.7 opencv computer-vision edge-detection

asked Dec 14 '16 at 23:38

keyoung1

votes

1 answer

Crop image with corrected distortion in OpenCV (Python)

I've been looking for a response in other posts and trying several ways to get it but I couldn't find anything solving my problem. I'm trying to correct optical distortion in an image and I get it, but now I want to crop my image in order to delete…

python opencv computer-vision crop distortion

asked Dec 14 '16 at 18:53

Carlos

Prev 1 2 3

…

99 100 Next