Questions tagged [computer-vision]

Use this tag for questions related to Computer Vision -- any aspect of software that enables computers to perceive, understand and react to their environment using cameras. For questions related to image filtering and quantification, use the tag [image-processing] instead.

Computer vision enables images, or sequences of images, to be processed by a computer using algorithms. There are many aspects to computer vision, including mathematics, physics (especially optics), imaging hardware, image-processing, signal-processing and machine-learning.

Some basic techniques used in computer vision are:

Image acquisition
Pre-processing
Feature Extraction
Detection/Segmentation
High-Level processing
Decision making

13127 questions

votes

0 answers

OpenNI Intrinsic and Extrinsic calibration

How would one extract the components of the intrinsic and extrinsic calibration parameters from OpenNI for a device such as the PrimeSense? After some searching I only seem to find how to do it through ROS, but it is not clear how it would be done…

computer-vision openni 3d-reconstruction

asked Dec 12 '16 at 22:31

Jack H

2,116
3
32
52

votes

2 answers

What is the relationship between color space RGB, XYZ and the color matching function?

What is the relationship between color spaces (RGB, XYZ) and the color matching function? Let's say we have some color matching function in the color space XYZ (3 row matrix). We also have the transformation matrix which translates from XYZ…

colors computer-vision rgb vision

asked Dec 02 '16 at 18:04

RebeccaK375

votes

2 answers

Python OpenCV pure white background

I have a DIY home photo studio to take photos. I want them to be on pure white background (#FFFFFF) with minimum user interaction This is original picture i have (no any processing, just raw JPG from camera) I implement simple python program with…

python opencv computer-vision

asked Nov 29 '16 at 19:58

striker

1,223
3
15
25

votes

1 answer

What is the definition of "high-capacity cnn" or "high-capacity architecture"?

I found the phrase "high-capacity cnn" in these two papers: 1.Rich feature hierarchies for accurate object detection and semantic segmentation 2.Region-based Convolutional Networks for Accurate Object Detection and Segmentation I've searched it up…

machine-learning computer-vision deep-learning

asked Oct 31 '16 at 06:29

Shieh Kai Yong

votes

2 answers

Keras imageGenerator Exception: output of generator should be a tuple (x, y, sample_weight) or (x, y). Found: None

I'm currently trying to follow the example here using a dataset I generated by myself. The back end is run using Theano. The directory structure is exactly the same: image_sets/ dogs/ dog001.jpg dog002.jpg ... cats/ …

machine-learning computer-vision theano keras

asked Oct 17 '16 at 18:05

Andros Wong

votes

2 answers

Issues with shaping Tensorflow/TFLearn inputs/outputs for images

To learn more about deep learning and computer vision, I'm working on a project to perform lane-detection on roads. I'm using TFLearn as a wrapper around Tensorflow. Background The training inputs are images of roads (each image represented as a…

python machine-learning computer-vision neural-network tensorflow

asked Oct 15 '16 at 21:30

Janum Trivedi

1,530
2
13
24

votes

0 answers

How do I minimize global error across multiple image homographies?

I am stitching together multiple images with arbitrary 3D views of a planar surface. I have some estimation of which images overlap and a coarse estimate of each pairwise homography between pairs of overlapping images. However, I need to refine my…

computer-vision homography image-stitching levenberg-marquardt

asked Oct 06 '16 at 04:31

user2446426

votes

2 answers

Recognition of an animal in pictures

I am facing a challenging problem. On the courtyard of company I am working is a camera trap which takes a photo of every movement. On some of these pictures there are different kinds of animals (mostly deep gray mice) that cause damages to our…

computer-vision real-time video-processing image-recognition

asked Oct 05 '16 at 15:22

Lukas Hruby

votes

2 answers

How to calculate % score from ORB algorithm?

I am using the ORB algorithm of OpenCV 2.4.9 with Python to compare images. The ORB algorithm does not return the similarity score as a percentage. Is there any way to do this? My code to compare images using ORB is as follows img1 =…

python opencv image-processing computer-vision orb

asked Sep 16 '16 at 09:25

user93

1,776
1
23
43

votes

1 answer

Dlib frontal face detection for small faces

I am using Dlib's frontal face detector to detect faces in an images; however, it cannot detect faces smaller than 80 by 80 pixels. Dlib's example in face_detection_ex.cpp upsamples the input image using pyramid_up() to increase the face sizes.…

computer-vision face-detection dlib

asked Sep 09 '16 at 19:57

mhaghighat

1,073
16
27

votes

1 answer

What is the order of mean values in Caffe's train.prototxt?

In my Caffe 'train.prototxt' I'm doing some input data transformation, like this: transform_param { mirror: true crop_size: 321 mean_value: 104 # Red ? mean_value: 116 # Blue ? mean_value: 122 # Green ? } Now I want to…

machine-learning computer-vision neural-network deep-learning caffe

asked Sep 03 '16 at 11:01

mcExchange

4,998
12
43
81

votes

2 answers

2D Coordinate to 3D world coordinate

I want to convert 2D Image coordinates to 3D world coordinates. I am using the ZED camera which is a stereo camera and the sdk shipped with it, provides the disparity map. Hence I have depth. The two cameras are parallel to each other. Although this…

opencv computer-vision

asked Aug 12 '16 at 04:02

Shashwat Verma

votes

1 answer

Outermost contour extraction from silhouette

I need to retrieve the outermost contour of several silhouettes, possibly storing the contour coordinates in a clockwise or counterclockwise order. From what I've read, this kind of result can be archived by using OpenCV's Canny + findContours.…

c++ opencv computer-vision

asked Aug 01 '16 at 11:51

Izzy88

votes

2 answers

Distance Transform in OpenCV Python automatically converting CV_8UC3 to CV_32SC1 creating an assertion error

I am trying to apply the WaterShed algorithm to an image as per the tutorial: OpenCv WaterShed Docs . I have earlier applied Otsu's thresholding after Gaussian filtering and Morpholigical Transformations on a greyscale image to improve Image quality…

python opencv computer-vision opencv3.0 watershed

asked Jul 07 '16 at 09:07

Anindita Bhowmik

votes

2 answers

How to flip only one axis of transformation matrix?

I have a 4x4 transformation matrix. However, after trying out the transformation I noticed that movement and rotation of the Y axis is going the opposite way. The rest is correct. I got this matrix from some other API so probably it is the…

opengl graphics computer-vision

asked Jun 29 '16 at 10:16

5argon

2,697
3
23
50

Prev 1 2 3

…

100 Next