Bug 702092

Summary: opj_tcd_decode_tile() allocates wastefully compared to luratech.
Product: Ghostscript Reporter: Julian Smith <julian.smith>
Component: JPX/JBIG2 encode/decodeAssignee: Default assignee <ghostpdl-bugs>
Status: RESOLVED DUPLICATE    
Severity: normal    
Priority: P4    
Version: master   
Hardware: All   
OS: All   
Customer: Word Size: ---

Description Julian Smith 2020-02-05 15:00:37 UTC
Memory use of this command is much larger for normal builds than luratech builds:

    gs -sOutputFile=/dev/null -dBATCH -dNOPAUSE -r300 -sDEVICE=ppmraw ../tests_private/comparefiles/Bug694621.pdf

In a normal build, this has maximum heap usage (from valgrind --tool=massif) of 150MB, of which 109,511,619B is from opj_tcd_decode_tile() where it allocates a UINT32 for each pixel.

Luratech's maximum heap usage for the same input file is 40MB. It looks like luratech's equivalent to opj_tcd_decode_tile() is s_jpxd_process() which allocates 27,377,904B, almost exactly a quarter of normal build.

Luratech's code does not default to 32 bits per pixel. Instead it appears to explicitly look at the colorspace and allocate accordingly, hence reducing memory use significantly. The callback s_jpxd_write_data() uses colorspace when generating the output data.

So it might be worth improving opj_tcd_decode_tile() to use a similar technique to luratech, in order to avoid the excessive memory use for this sort of input file.
Comment 1 Julian Smith 2020-02-07 18:08:37 UTC

*** This bug has been marked as a duplicate of bug 702096 ***