Question

This thread was archived. Please ask a new question if you need help.

Why Firefox pdf reader can't find or copy russian symbols across file?

6 replies
1 has this problem
144 views
Last reply by jscher2000 - Support Volunteer

3 years ago

8/19/20, 10:44 PM

So, I try to migrate from Chrome. I open pdf in Chrome, select some text "альтаир" and now I have exactly "альтаир" in buffer (clipboard). Then find "альтаир". So ok, I finded all "альтаир". I open pdf in Firefox, select some text "альтаир" and there is problem with coding Î ̧Ú‡Ë . So, of course, if I write "альтаир" in find window - it will not work. WAIDW ? Thanks!

Attached screenshots

Chosen solution

Hopefully that doesn't affect very many documents. Not sure there's any way of knowing.

👍 1

Answer 1 · 2020-08-19 22:44:08

jscher2000 - Support Volunteer

Top 10 Contributor

8/20/20, 1:32 AM

Hi Orbb, can you share a link to a PDF that has this problem?

Note: This forum diverts posts containing a URL to a link moderation queue, so it's normal that there is a delay of several minutes before your reply appears if you include a URL.

Answer 2 · 2020-08-19 22:44:08

Orbb Question owner

8/20/20, 2:54 AM

jscher2000 said

Hi Orbb, can you share a link to a PDF that has this problem? Note: This forum diverts posts containing a URL to a link moderation queue, so it's normal that there is a delay of several minutes before your reply appears if you include a URL.

https://krasheninin.tech/books/charles-petzold-code.pdf

Answer 3 · 2020-08-19 22:44:08

jscher2000 - Support Volunteer

Top 10 Contributor

8/20/20, 3:22 AM

I am attaching comparison screenshots between that PDF and another one that popped up in a search. This shows the HTML of the transparent text layer used for searching and selection (and the nonsense characters).

I suspect the problem in your example is that the document that was converted to PDF did not use Unicode encoding but instead used one of the older methods of character substitution that were common before Unicode became standardized and widespread. But I haven't dug into the PDF in detail.

When you are viewing a web page, Firefox allows changing among character encodings to see which one works best, but this isn't an option for PDFs (grayed out).

You could test the latest version of the PDF.js viewer by saving that PDF and loading it into the web app version here:

https://mozilla.github.io/pdf.js/web/viewer.html

(Look for the file folder icon on the viewer's toolbar.)

If the same problem occurs, then this bug probably needs to be fixed "upstream" in the PDF.js project. You can submit an issue here:

https://github.com/mozilla/pdf.js/issues/

If the problem does not occur in the web app version then either a bug fix will arrive in Firefox eventually, or there is a separate bug in how the viewer is implemented in Firefox.

Modified August 20, 2020, 3:23:11 AM PDT by jscher2000 - Support Volunteer

Answer 4 · 2020-08-19 22:44:08

cor-el

Moderator
Top 10 Contributor

8/20/20, 3:37 AM

Yes, it doesn't find the 'Altair' text, but I can find the '8800' text.

Answer 5 · 2020-08-19 22:44:08

Orbb Question owner

8/20/20, 6:51 AM

jscher2000 said

the document that was converted to PDF did not use Unicode encoding

But Chrome can handle such bad files. That's why I thought I was doing something wrong.

jscher2000 said

You could test the latest version of the PDF.js viewer by saving that PDF and loading it into the web app version here: https://mozilla.github.io/pdf.js/web/viewer.html

I can, but the result is the same obviously. Thus, if I correctly understood, the short answer is "it just doesn't work" . Thank you for you time!

Answer 6 · 2020-08-19 22:44:08

jscher2000 - Support Volunteer

Top 10 Contributor

8/20/20, 9:20 AM

Chosen Solution

Hopefully that doesn't affect very many documents. Not sure there's any way of knowing.

Search Support

Why Firefox pdf reader can't find or copy russian symbols across file?

Chosen solution

All Replies (6)

Chosen Solution

Ask a Question

Explore Our Help Articles

Mozilla Account

Search Support

Why Firefox pdf reader can't find or copy russian symbols across file?

Chosen solution

All Replies (6)

Chosen Solution