Using libjpeg (or libjpeg-turbo) for JPEG encoding, I was wondering if there were any improvements providing multiple lines of scan for the jpeg_write_scanlines function at once. I did some tests on 720x288 images, but only get 0.5% when processing the entire image at the same time.
I assume that this increase is only due to the removal of overhead call stacks, but I expected a bit more, at least with libjpeg-turbo.
A performance test was run with Callgrind (in Valgrind), so maybe I missed something. Or I really misunderstood how a JPEG encoder works.
source share