搜索 | 用户支持

防范以用户支持为名的诈骗。我们绝对不会要求您拨打电话或发送短信,及提供任何个人信息。请使用“举报滥用”选项报告涉及违规的行为。

Learn More

Does Firefox Automatically perform OCR on PDF Documents?

more options

My bank delivers monthly statements as rasterized copies of their paper statements. They are clearly pixelated and not text. However, when I open one of these PDFs in Firefox I am able to select the rasterized text, as you can see from the attached screenshot clip.

How is this possible?

My bank delivers monthly statements as rasterized copies of their paper statements. They are clearly pixelated and not text. However, when I open one of these PDFs in Firefox I am able to select the rasterized text, as you can see from the attached screenshot clip. How is this possible?
已附加屏幕截图

所有回复 (9)

more options

I assume that your bank actually sends real PDF files. If you use Print then in some cases Firefox converts the page to an image.

more options

I would have assumed the same thing except that Sumatra won't t allow me to highlight and copy text and Acrobat will select it but won't copy it. Firefox allows both.

more options

Also I've never seen a pixelated PDF that still contains text. Will wonders never cease?!

由Helmanfrow于修改

more options

If the PDF consists purely of a series of full-page images, unfortunately, Firefox's PDF viewer doesn't have the ability to OCR it.

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Firefox's PDF viewer is based on the pdf.js JavaScript library, which ignores these "security" restrictions by default. It is a bit of an annoyance to people who create the PDFs, but Mozilla doesn't seem inclined to enforce the restrictions in Firefox.

more options

jscher2000 - Support Volunteer said

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Yes, I did a little more digging and that's apparently what it is. The document is protected from editing and apparently this can sometimes present text as pixelated images.

由Helmanfrow于修改

more options

jscher2000 - Support Volunteer said

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Yes, the document is password-protected so that's probably it.

more options

By the way, when you select text in Firefox's PDF viewer, you are selecting a transparent layer of text positioned in front of the page image.

more options

It's funny that "security" can be partially bypassed by simply ignoring it in code.

more options

Helmanfrow said

It's funny that "security" can be partially bypassed by simply ignoring it in code.

Once upon a time, basing "security" on the honor system actually worked, I guess.