Getting the coordinates of a “text” bounding box as a gray image using command line in linux

Question

Getting the coordinates of a “text” bounding box as a gray image using command line in linux

What the title says.

Strictly speaking, what I define as a “text” bounding box for a gray image is a set of 4 coordinates (x, y, x + width, y + height) that should define the area of the rectangle in this image that has the maximum the number of non-white pixels and at the same time the smallest possible number of white pixels (excluding the maximum number of non-white pixels). I have quoted text, as the images do not actually contain text, because the images contain only pixels with colors.

By installing ImageMagick in my Ubuntu and typing the command: in the terminal $convert input.png -trim ouput.png, I get:

Open the two images in new tabs in your web browser, and you will understand their difference, and you will also understand what I define as a bounding box. Output.png has actually the width and height that I am looking for. I do not know how to get the x and y coordinates.

The answer presented here (1) for pdf pages does not meet my criteria, since the "text" bounding box that gs gives me has large white margins (and as far as I can understand, what gs defines as " the text "bounding box for pdf is something different from my definition of the bounding box" text "for an image).

+4

linux bash imagemagick ghostscript superuser

liaguridio 27 . '15 7:08

2

" ", , , .

PDF , , . , , .

" " " , ". , , . , . , , , , OCR, , OCR .

0

KenS 27 . '15 8:26

Mark Setchell · Accepted Answer · 2015-09-27T08:47:46+0000

, , , , -trim , :

identify -format "%@" image.png
200x100+10+20

,

identify -format "%@" paper.png
406x620+38+68

, 38 68 , 406 620 .

, :

convert paper.png -stroke red -fill none -draw "rectangle 38,68 444,688" result.png

, convert identify:

convert -format %@ paper.png info:
406x620+38+68

Getting the coordinates of a “text” bounding box as a gray image using command line in linux

More articles: