79

This is really a two part question, since I don't fully understand how these things work just yet:

My situation: I'm writing a web app which lets the user upload an image. My app then resizes to something displayable (eg: 640x480-ish) and saves the file for use later.

My questions:

  1. Given an arbitrary JPEG file, is it possible to tell what the quality level is, so that I can use that same quality when saving the resized image?
  2. Does this even matter?? Should I be saving all the images at a decent level (eg: 75-80), regardless of the original quality?

I'm not so sure about this because, as I figure it: (let's take an extreme example), if someone had a 5 megapixel image saved at quality 0, it would be blocky as anything. Reducing the image size to 640x480, the blockiness would be smoothed out and barely less noticeable... until I saved it with quality 0 again...

On the other end of the spectrum, if there was an image which was 800x600 with q=0, resizing to 640x480 isn't going to change the fact that it looks like utter crap, so saving with q=80 would be redundant.

Am I even close?

I'm using GD2 library on PHP if that is of any use

nickf
  • 499,078
  • 194
  • 614
  • 709
  • 2
    nickf Could you please accept @DuyLuc's answer which uses identify -format '%Q'? It answers the problem, unlike the currently accepted answer. Just tried it on my files and it works. Thanks! – Alex I May 29 '20 at 08:22

11 Answers11

127

You can view compress level using the identify tool in ImageMagick. Download and installation instructions can be found at the official website.

After you install it, run the following command from the command line:

identify -format '%Q' yourimage.jpg

This will return a value from 0 (low quality, small filesize) to 100 (high quality, large filesize).

Information source

M. Justin
  • 5,160
  • 1
  • 38
  • 65
DuyLuc
  • 1,476
  • 2
  • 10
  • 9
  • 2
    The specific item you're looking for in the output is the Quality: item which appears right before the Properties: section with all the EXIF data. With maximum quality this value will be 100. – Oran Dennison May 31 '14 at 00:18
  • 4
    @OranDennison Your comment was ambiguous enough to make me look for a "Quality" tag in EXIF data. But there is not : "Quality" is a feature of `identify -verbose`'s output. – Skippy le Grand Gourou Nov 10 '16 at 15:19
  • 5
    I don't understand why this answer has not been marked as solution??? It really nails it down without any blabla. – basZero Aug 17 '17 at 13:20
  • @basZero because although **ImageMagick returns a quality assessment number that doesn't mean it's correct.** See http://www.faqs.org/faqs/jpeg-faq/part1/section-5.html from the jpeg creators themselves, and https://photo.stackexchange.com/questions/88167/is-it-possible-to-find-out-what-compression-ratio-was-used-for-a-particular-jpeg – matt wilkie Feb 18 '21 at 23:25
27
  1. JPEG is a lossy format. Every time you save a JPEG same image, regardless of quality level, you will reduce the actual image quality. Therefore even if you did obtain a quality level from the file, you could not maintain that same quality when you save a JPEG again (even at quality=100).

  2. You should save your JPEG at as high a quality as you can afford in terms of file size. Or use a loss-less format such as PNG.

Low quality JPEG files do not simply become more blocky. Instead colour depth is reduced and the detail of sections of the image are removed. You can't rely on lower quality images being blocky and looking ok at smaller sizes.

According to the JFIF spec. the quality number (0-100) is not stored in the image header, although the horizontal and vertical pixel density is stored.

Community
  • 1
  • 1
Ash
  • 57,515
  • 31
  • 146
  • 168
  • 6
    +1, it's true in general that resaving a JPEG will tend to reduce quality, but in some restricted cases this can be worked around -- see http://en.wikipedia.org/wiki/Jpeg#Lossless_editing. E.g. Irfanview has a plugin for lossless JPEG cropping/rotating. – j_random_hacker Jan 08 '10 at 02:33
  • do you know if the same applies to mpeg videos? – johnny Jan 11 '10 at 18:02
  • @johnny, MPEG2 (and I believe MPEG4) also uses lossy compression, so re-encoding multiple times will reduce the quality. Not sure if the compression level is stored in the MPEG headers though. – Ash Jan 12 '10 at 01:43
  • The only places in the compression process where loss actually happens is (potentially) rounding error and the quantization step. If the quatization can be performed identically and the math is done with enough precision, then it is possible to guarantee that the quality does not decrease.The major problem here is that the standard does not precisely define AFAIK how the quantization step is to be performed.We'd have to make certain guesses for files not encoded with a known encoder. There are of course other factors in practice, but for the OP's use case it is at least technically feasible. – Tim Seguine Nov 02 '12 at 14:52
  • While it is technically true that every cycle of re-saving a JPEG file reduces the quality further, the quality loss is minimal after the first save. Saving a RAW image with 80%-90% JPEG quality will cause a easily visible quality loss. However, if you then re-open and re-save the file many times, the additional quality loss is almost invisible (if you stick to the same quality setting). – Peter Oct 29 '16 at 19:08
  • 3
    How can this answer be the solution? It doesn't answer the question at all. – basZero Aug 17 '17 at 13:18
12

For future visitors, checking the quality of a given jpeg, you could just use imagemagick tooling:

$> identify -format '%Q' filename.jpg
   92%
Joe Lencioni
  • 9,595
  • 17
  • 50
  • 65
JAR.JAR.beans
  • 8,075
  • 3
  • 39
  • 52
10

Jpeg compression algorithm has some parameters which influence on the quality of the result image.

One of such parameters are quantization tables which defines how many bits will be used on each coefficient. Different programs use different quatization tables.

Some programs allow user to set quality level 0-100. But there is no common defenition of this number. The image made with Photoshop with 60% quality takes 46 KB, while the image made with GIMP takes only 26 KB.

Quantization tables are also different.

There are other parameters such subsampling, dct method and etc.

So you can't describe all of them by single quality level number and you can't compare quality of jpeg images by single number. But you can create such number like photoshop or gimp which will describe compromiss between size on quality.

More information: http://patrakov.blogspot.com/2008/12/jpeg-quality-is-meaningless-number.html

Common practice is that you resize the image to appropriate size and apply jpeg after that. In this case huge and middle images will have the same size and quality.

ton4eg
  • 1,327
  • 1
  • 10
  • 10
6

As there are already two answers using identify, here's one that also outputs the file name (for scanning multiple files at once):

If you wish to have a simple output of filename: quality for use on multiple images, you can use

identify -format '%f: %Q' *

to show the filename + compression of all files within the current directory.

serv-inc
  • 29,557
  • 9
  • 128
  • 146
4

Here is a formula I've found to work well:

  1. jpg100size (the size it should not exceed in bytes for 98-100% quality) = width*height/1.7

  2. jpgxsize = jpg100size*x (x = percent, e.g. 0.65)

so, you could use these to find out statistically what quality your jpg was last saved at. if you want to get it down to let's say 65% quality and if you want to avoid resampling, you should compare the size initially to make sure it's not already too low, and only then reduce the quality

Raul Popa
  • 51
  • 2
  • 1
    Cool to see that someone else has been playing around with the same idea as me. I'm trying to figure it what is a "good" size for a jpeg (or png) image, when used in our CMS. I've done some statistics on what's already in our database, and I've also been playing around with some figures.. As of now I will start warning users uploading a JPEG-image which is larger than 4K and 32x32 pixels and have an Area:Bytes-ratio <1. – elgholm May 23 '16 at 09:18
  • 2
    Raul, how'd you calculate the `1.7` coefficient? – Hassan Baig Feb 06 '18 at 00:33
  • more like trial and error, but it was very close for a straight linear formula – Raul Popa Sep 01 '19 at 13:27
3

So, there are basically two cases you care about:

  1. If an incoming image has quality set too high, it may take up an inappropriate amount of space. Therefore, you might want, for example, to reduce incoming q=99 to q=85.

  2. If an incoming image has quality set too low, it might be a waste of space to raise it's quality. Except that an image that's had a large amount of data discarded won't magically take up more space when the quality is raised -- blocky images will compress very nicely even at high quality settings. So, in my opinion it's perfectly OK to raise incoming q=1 to q=85.

From this I would think simply forcing a decent quality setting is a perfectly acceptable thing to do.

Eric Seppanen
  • 5,653
  • 26
  • 24
  • 3
    I realize this is a rather old thread, but your second point isn't correct. [Compressed images will often eat more space when re-compressed](http://stackoverflow.com/a/12557675/1096794). – Ben D Sep 24 '12 at 01:32
  • @BenD If you increase the quality between compression you don't magically get frequency components which weren't in there in the first compressed file. If this happens in practice, then the compressor is not being very intelligent, since the data is a priori already appropriately quantized. I think the issue here is that the standard allows for small differences in the decoded image, which then has a potentially different DCT the second time. With an appropriately coded jpeg library, the point #2 should be possible in theory AND in practice. – Tim Seguine Nov 02 '12 at 15:19
  • 1
    @Tim I may be misunderstanding your critique, but I believe that you're wrong on this point. Because JPEG compressions often result in noise, and the resulting JPEG does not store it's current compression level, secondary compressions (i.e Original -> Quality=60 ->Quality=80) will often be larger than the primary compression. I've run this test using both GD and desktop compressors (see my original link)... Because the compressor cannot know the image's current compression state it cannot be intelligent when running subsequent compressions. If you know of a way around this I'd love to know. – Ben D Nov 03 '12 at 18:14
  • As a test, try compressing an image to very low quality (say, q=10). The result will be mess (blurry blocks, etc), but it will have a small file size. Now try recompressing your new JPG to q=90... the new compression will result in a substantially larger file with worse quality, because it is trying to faithfully preserve all the new artifacts. I'm sure you could get around this with a simple `if filesize(original) – Ben D Nov 03 '12 at 18:38
  • 1
    @BenD The biggest problem here is the quantization step in the compression. Unless you know intimate details about the compressor used, it might be impossible to guess the tradeoffs made during quantization. Also since the standard allows for a 1 bit difference per block IIRC during decompression, and the DCT isn't required to be done with perfect rounding accuracy means that you are mostly right. But these are pretty much practical problems only. If you decompress a jpeg with accuracy up to the machine epsilon and then perform the DCT again without quantization, – Tim Seguine Nov 05 '12 at 19:55
  • the resulting coefficents should be almost identical to the ones in the compressed file. Again all I really meant is that the coefficients shouldn't change in the absence of other factors. I don't know of a specific way of getting such results, because I think all encoders preform quantization and not necessarily the same way as the first encoder. – Tim Seguine Nov 05 '12 at 19:58
  • 1
    This is a fascinating discussion. I almost don't mind the fact that it means my original answer is wrong. :) – Eric Seppanen Nov 08 '12 at 01:03
2

Every new save of the file will further decrease overall quality, by using higher quality values you will preserve more of image. Regardless of what original image quality was.

Alexander Taran
  • 6,549
  • 2
  • 37
  • 59
2

If you resave a JPEG using the same software that created it originally, using the same settings, you'll find that the damage is minimized - the algorithm will tend to throw out the same information it threw out the first time. I don't think there's any way to know what level was selected just by looking at the file; even if you could, different software almost guarantees different parameters and rounding, making a match almost impossible.

Mark Ransom
  • 271,357
  • 39
  • 345
  • 578
2

This may be a silly question, but why would you be concerned about micromanaging the quality of the document? I believe if you use ImageMagick to do the conversion, it will manage the quality of the JPEG for you for best effect. http://www.php.net/manual/en/intro.imagick.php

bryanjonker
  • 3,231
  • 3
  • 21
  • 37
  • I sometimes want to make sure an artist actually delivered an uncompressed image (several times they have accidentally given me a compressed JPEG). – David Dunham Feb 25 '16 at 18:59
0

Here are some ways to achieve your (1) and get it right.

  1. There are ways to do this by fitting to the quantization tables. Sherloq - for example - does this:

    https://github.com/GuidoBartoli/sherloq

    The relevant (python) code is at https://github.com/GuidoBartoli/sherloq/blob/master/gui/quality.py

  2. There is another algorithm written up in https://arxiv.org/abs/1802.00992 - you might consider contacting the author for any code etc.

  3. You can also simulate file_size(image_dimensions,quality_level) and then invert that function/lookup table to get quality_level(image_dimensions,file_size). Hey presto!

  4. Finally, you can adopt a brute-force https://en.wikipedia.org/wiki/Error_level_analysis approach by calculating the difference between the original image and recompressed versions each saved at a different quality level. The quality level of the original is roughly the one for which the difference is minimized. Seems to work reasonably well (but is linear in the for-loop..).

Most often the quality factor used seems to be 75 or 95 which might help you to get to the result faster. Probably no-one would save a JPEG at 100. Probably no-one would usefully save it at < 60 either.

I can add other links for this as they become available - please put them in the comments.

jtlz2
  • 4,913
  • 6
  • 43
  • 84