Table image extraction for reference #755

rajuptvs · 2025-01-15T23:20:54Z

Requested feature

Background

Currently, the system supports referencing images using URIs in markdown formatting, which has proven valuable for many data pipeline implementations. For example:

Proposed Enhancement

I propose extending this URI reference functionality to table images as well. This addition would provide more flexibility in document handling, particularly in cases where current markdown tables created may not be correct.

Technical Implementation

I've already prototyped a similar functionality using the following approach:

Store image data in item.image and its URI in item.image.uri using the item.get_image()
Implement reference handling through the existing image processing pipeline:

elif image_mode == ImageRefMode.REFERENCED:

    new_doc = self._with_pictures_refs(

        image_dir=artifacts_dir, reference_path=reference_path

    )

I think this would enable extensibility of pipelines using docling and very beneficial to do various kinds of post-processing on table images.

...

Alternatives

...

wcool1 · 2025-01-16T09:44:55Z

Hello sir. I wonder that referencing images using URIs in markdown formatting, which has proven valuable for many data pipeline implementations? Why URIs are better than text/table in the output of markdown by OCR? Could you give me some cases or prove？

jyothisv · 2025-01-21T19:37:01Z

This feature would be quite useful for me. Alternatively, is there an easy way to use another model (say an LLM) specifically to convert table images into markdown and inject the output into the document?

PeterStaar-IBM · 2025-01-28T07:40:42Z

@rajuptvs Actually, this feature is already supported. Any DocItem from the DoclingDocument can be cropped from the original document, just ensure that you keep the page_images at conversion.

josippavicic · 2025-01-28T10:24:06Z

Can someone give a code example for this supported feature?

PeterStaar-IBM · 2025-01-28T10:29:16Z

@josippavicic Just call this get_image (https://github.com/DS4SD/docling-core/blob/b787d53173e9e2325f25f03a7e442d5b4194e5a4/docling_core/types/doc/document.py#L568) on any DocItem in the Document.

 for item, level in true_doc.iterate_items():
     if isinstance(item, DocItem):
          pil_image = item.get_image()

rajuptvs added the enhancement New feature or request label Jan 15, 2025

PeterStaar-IBM closed this as completed Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Table image extraction for reference #755

Table image extraction for reference #755

rajuptvs commented Jan 15, 2025

wcool1 commented Jan 16, 2025

jyothisv commented Jan 21, 2025

PeterStaar-IBM commented Jan 28, 2025

josippavicic commented Jan 28, 2025

PeterStaar-IBM commented Jan 28, 2025

Table image extraction for reference #755

Table image extraction for reference #755

Comments

rajuptvs commented Jan 15, 2025

Requested feature

Background

Proposed Enhancement

Technical Implementation

Alternatives

wcool1 commented Jan 16, 2025

jyothisv commented Jan 21, 2025

PeterStaar-IBM commented Jan 28, 2025

josippavicic commented Jan 28, 2025

PeterStaar-IBM commented Jan 28, 2025