Image object not recognized #3336
Unanswered
flange-ipb
asked this question in
Q&A
Replies: 1 comment
-
This specific page only references one actual image. Figure 3 is included with plain drawing commands. To extract it, you would have to render the page as an image, but this is out of scope for pypdf. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm extracting images from scientific papers. For this PDF I'm having troubles to extract Fig. 3 on page 10 - this image object is not included in
PageObject.images
.I have the same issue in PyMuPDF, see PyMuPDF#4577.
Environment
Python version: 3.13.3
Code + PDF
Extracts all images from the given document:
The PDF in question can be found here. I am not the author of this document. It is published under CC-BY 4.0 and the license terms are included in the document. This license is not viral, so I think it's legal to include it into your test dataset.
Traceback
No traceback
Beta Was this translation helpful? Give feedback.
All reactions