Questions tagged [apple-vision]

Apple Vision is a high-level computer vision framework to identify faces, detect and track features, and classify images, video, tabular data, audio and motion sensors data.

Apple Vision framework performs face and face landmark detection on input images and video, barcode recognition, image registration, text detection, and, of course, feature tracking. Vision API allows the use of custom CoreML models for tasks like classification or object detection.

145 questions

votes

2 answers

iOS revert camera projection

I'm trying to estimate my device position related to a QR code in space. I'm using ARKit and the Vision framework, both introduced in iOS11, but the answer to this question probably doesn't depend on them. With the Vision framework, I'm able to get…

asked Jun 16 '17 at 02:25

Guig

8,612
5
47
99

votes

8 answers

Converting a Vision VNTextObservation to a String

I'm looking through the Apple's Vision API documentation and I see a couple of classes that relate to text detection in UIImages: 1) class VNDetectTextRectanglesRequest 2) class VNTextObservation It looks like they can detect characters, but I don't…

ios machine-learning ocr nslinguistictagger apple-vision

asked Jun 13 '17 at 23:34

Adrian

14,925
16
92
163

votes

3 answers

Apple Vision framework – Text extraction from image

I am using Vision framework for iOS 11 to detect text on image. The texts are getting detected successfully, but how we can get the detected text?

swift machine-learning computer-vision coreml apple-vision

asked Jun 15 '17 at 11:25

Abhishek

4,857
5
20
25

votes

3 answers

Classify faces from VNFaceObservation

I'm working with Vision framework to detect faces and objects on multiple images and works fantastic. But I have a question that I can't find on documentation. The Photos app on iOS classify faces and you can click on face and show all the images…

ios ios11 image-recognition face apple-vision

asked Aug 03 '17 at 09:54

mhergon

1,488
1
16
35

votes

3 answers

Apple Vision image recognition

As many other developers, I have plunged myself into Apple's new ARKit technology. It's great. For a specific project however, I would like to be able to recognise (real-life) images in the scene, to either project something on it (just like…

swift machine-learning augmented-reality arkit apple-vision

asked Sep 11 '17 at 08:56

Marc Van Deuren

votes

3 answers

ARKit and Vision frameworks for Object Recognition

I would really like some guidance on combining Apple's new Vision API with ARKit in a way that enables object recognition. This would not need to track the moving object, just recognize it stable in 3d space for the AR experience to react…

swift scenekit augmented-reality arkit apple-vision

asked Aug 30 '17 at 10:46

cnzac

votes

3 answers

Convert VNRectangleObservation points to other coordinate system

I need to convert the VNRectangleObservation received CGPoints (bottomLeft, bottomRight, topLeft, topRight) to another coordinate system (e.g. a view's coordinate on screen). I define a request: // Rectangle Request let…

ios swift computer-vision apple-vision

asked Dec 21 '17 at 16:08

mihaicris

votes

2 answers

VNTrackRectangleRequest internal error

I'm trying to get a simple rectangle tracking controller going, and I can get rectangle detection going just fine, but the tracking request always ends up failing for a reason I can't quite find. Sometimes the tracking request will fire it's…

ios swift apple-vision

asked Sep 06 '17 at 09:07

Andy Heard

1,462
14
24

votes

3 answers

Vision Framework Barcode detection for iOS 11

I've been implementing a test of the new Vision framework which Apple introduced in WWDC2017. I am specifically looking at the barcode detection - I've been able to get after scanning the image from Camera/Gallery that it's a barcode image or not.…

barcode ios11 xcode9-beta swift4 apple-vision

asked Jun 21 '17 at 18:04

Hitesh Arora

votes

2 answers

How can I tell which languages are available for text recognition in Apple's Vision framework?

I'm trying to add the option to my app to allow for different languages when using Apple's Vision framework for recognising text. There seems to be a function for programmatically returning the supported languages but I'm not sure if I'm calling it…

swift machine-learning augmented-reality coreml apple-vision

asked Oct 03 '19 at 13:03

mralexhay

votes

1 answer

Apple Vision – Can't recognize a single number as region

I want to use VNDetectTextRectanglesRequest from a Vision framework to detect regions in an image containing only one character, number '9', with the white background. I'm using following code to do this: private func performTextDetection() { …

swift machine-learning mobile ocr apple-vision

asked Jan 06 '18 at 11:47

AndrzejZ

votes

0 answers

How to convert BoundingBox from VNRequest to CVPixelBuffer Coordinate

I try to crop a CVImageBuffer (from AVCaptureOutput) using the boundingBox of detected face from Vision (VNRequest). When I draw over the AVCaptureVideoPreviewLayer using : let origin = previewLayer.layerPointConverted(fromCaptureDevicePoint:…

ios swift iphone avfoundation apple-vision

asked Oct 08 '19 at 14:45

Alak

1,187
2
10
17

votes

1 answer

ARKit – sceneView renders its content at 120 fps (but I need 30 fps)

I'm developing ARKit app along with Vision/AVKit frameworks. My app recognizes hand gestures ("Victory", "Okey", "Fist" gestures) for controlling a video. So I'm using MLModel for classification of my hand gestures. App works fine but view's content…

swift scenekit augmented-reality arkit apple-vision

asked Dec 01 '18 at 13:37

Andy Fedoroff

26,838
8
85
144

votes

1 answer

ARKit & Vision frameworks – Detecting wall edges

I wonder is it theoretically possible to detect wall edges/lines (like in the picture)? All I could achieve is detecting the vertices of rectangles that are visible to Camera Preview. But we can't consider real walls as rectangles. So, is there…

augmented-reality object-detection arkit coreml apple-vision

asked Aug 13 '18 at 09:15

arturdev

10,228
2
34
62

votes

1 answer

How can I take a photo of a detected rectangle in Apple Vision framework

How can I take a photo (get an CIImage) from the successful VNRectangleObservation object? I have a video capture session running and in func captureOutput(_ output: AVCaptureOutput, didOutput sampleBuffer: CMSampleBuffer, from connection:…

ios computer-vision rectangles apple-vision

asked Jan 09 '18 at 14:59

denis631

1,596
1
14
35

2 3

…

9 10 Next