I'm using Tesseract but I don't know whether it neglects any nontext area and targets text only. Do I have to remove any nontext area as a preprocessing step for better output?
Asked
Active
Viewed 2,573 times
1 Answers
2
Tesseract has a pretty good algorithm to detect text, but it will eventually give false-positive matches.
Ideally, you would pre-process the image before submitting it to tesseract. Some time ago I engaged in a similar task, so I suggest you take a look at the following material:
![](../../users/profiles/-1.webp)
Community
- 1
- 1
![](../../users/profiles/176769.webp)
karlphillip
- 87,606
- 33
- 227
- 395