Extracting text from PDF

Rolf Kutz rkutz at mst-aerospace.de
Tue Jul 3 06:08:39 PDT 2007


Kurt Pfeifle schrieb:
>>> Most likely, your PDF contains what text you see on screen (or on
>>> paper, when printed) only in the form of bitmaps, not proper fonts...
>> How can I check this?
> 
> In acroread or in kpdf look for the menu entry where you can look at the document properties. There you should see a tab which allows you to check for the fonts.
> 
> See if the fonts are there, and what kind of names they have.

There are fonts there:

QPHYDB+Nimbus_Roman_No9_L.Regular.0.0.Set0
RCISND+Numbus_Sans_L.Bold.0.0.Set0
VKJNGT+Nimbus_Sans_L.Regular.0.0.Set0

> That said, this problem ("a bitmap font was used") usually does not appear with Firefox. Helge's guess about the root of the problem may be a much better one.
> 
> If you use your firefox to "print to file" your job, please upload the resulting PostScript. I/we can then try to convert with a CUPS/pstops + Ghostscript commandline chain (using different versions of Ghostscript and parameter variations) to see if we find one which does not show your problem....

I will post an URL later.

regards, Rolf




More information about the cups mailing list