Summary: | ps2ascii fails on PDF generated by Adobe InDesign | ||
---|---|---|---|
Product: | Ghostscript | Reporter: | Jason Rhinelander <ghostscript> |
Component: | PDF Interpreter | Assignee: | Alex Cherepanov <alex> |
Status: | NOTIFIED FIXED | ||
Severity: | normal | ||
Priority: | P2 | ||
Version: | 8.14 | ||
Hardware: | PC | ||
OS: | Linux | ||
Customer: | Word Size: | --- | |
Attachments: |
Simple PDF causing problem
patch |
Description
Jason Rhinelander
2004-05-14 15:47:10 UTC
Created attachment 666 [details]
Simple PDF causing problem
Created attachment 667 [details]
patch
There's no way to recover ASCII from the strings encofed for a
CID font. The patch attached fixes the PostScript error but generates
wrong results. It just dumps the strings in the unmodified encoding.
Extraction of text from PDF should be done before conversion to PostScript
using /ToUnicode CMap. The latter is an enhancement request, not a bug.
We should apply the patch and close the bug, but open a new enhancement request for the Unicode mode. The patch is committed to head branch. An enhancement request (bug 687492) was creates to track the development of ps2ascii utility. There are 2 issues here: (1) Decode source strings with well-known CMap files into Unicode or ASCII when possible. (2) Use ToUnicode CMap if possible, but first we need to pass it from PDF to PostScript level (bug 685335). |