Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build(deps): update docling[tesserocr] requirement from <=2.8.3,>=2.4.2 to >=2.4.2,<=2.18.0 #540

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Feb 4, 2025

Updates the requirements on docling[tesserocr] to permit the latest version.

Release notes

Sourced from docling[tesserocr]'s releases.

v2.18.0

Feature

Fix

  • markdown: Fix parsing if doc ending with table (#873) (5ac2887)
  • markdown: Add support for HTML content (#855) (94751a7)
  • docx: Merged table cells not properly converted (#857) (0cd81a8)
  • Processing of placeholder shapes in pptx that have text but no bbox (#868) (eff16b6)
  • KeyError in tableformer prediction (#854) (b1cf796)
  • Fixed docx import with headers that are also lists (#842) (2c037ae)
  • Use new add_code in html backend and add more typing hints (#850) (2a1f8af)
  • markdown: Fix empty block handling (#843) (bccb022)
  • Fix for the crash when encountering WMF images in pptx and docx (#837) (fea0a99)

Documentation

  • Updated the readme with upcoming features (#831) (d7c0828)
  • Add example for inspection of picture content (#624) (f9144f2)
Changelog

Sourced from docling[tesserocr]'s changelog.

v2.18.0 - 2025-02-03

Feature

Fix

  • markdown: Fix parsing if doc ending with table (#873) (5ac2887)
  • markdown: Add support for HTML content (#855) (94751a7)
  • docx: Merged table cells not properly converted (#857) (0cd81a8)
  • Processing of placeholder shapes in pptx that have text but no bbox (#868) (eff16b6)
  • KeyError in tableformer prediction (#854) (b1cf796)
  • Fixed docx import with headers that are also lists (#842) (2c037ae)
  • Use new add_code in html backend and add more typing hints (#850) (2a1f8af)
  • markdown: Fix empty block handling (#843) (bccb022)
  • Fix for the crash when encountering WMF images in pptx and docx (#837) (fea0a99)

Documentation

  • Updated the readme with upcoming features (#831) (d7c0828)
  • Add example for inspection of picture content (#624) (f9144f2)

v2.17.0 - 2025-01-28

Feature

  • CLI: Expose code and formula models in the CLI (#820) (6882e6c)
  • Add platform info to CLI version printout (#816) (95b293a)
  • ocr: Expose rec_keys_path in RapidOcrOptions to support custom dictionaries (#786) (5332755)
  • Introduce automatic language detection in TesseractOcrCliModel (#800) (3be2fb5)

Fix

  • Fix single newline handling in MD backend (#824) (5aed9f8)
  • Use file extension if filetype fails with PDF (#827) (adf6353)
  • Parse html with omitted body tag (#818) (a112d7a)

Documentation

... (truncated)

Commits
  • b5da408 chore: bump version to 2.18.0 [skip ci]
  • 5ac2887 fix(markdown): fix parsing if doc ending with table (#873)
  • a40544a chore: clean up top-level file (#872)
  • 94751a7 fix(markdown): add support for HTML content (#855)
  • 6a76b49 feat: Expose equation exports (#869)
  • 0cd81a8 fix(docx): merged table cells not properly converted (#857)
  • eff16b6 fix: Processing of placeholder shapes in pptx that have text but no bbox (#868)
  • b1cf796 fix: KeyError in tableformer prediction (#854)
  • 70d68b6 feat: Add option to define page range (#852)
  • d727b04 feat(docx): Support of SDTs in docx backend (#853)
  • Additional commits viewable in compare view

Most Recent Ignore Conditions Applied to This Pull Request
Dependency Name Ignore Conditions
docling[tesserocr] [>= 2.10.dev0, < 2.11]
docling[tesserocr] [>= 2.14.dev0, < 2.15]
docling[tesserocr] [>= 2.15.dev0, < 2.16]
docling[tesserocr] [>= 2.16.dev0, < 2.17]
docling[tesserocr] [>= 2.17.dev0, < 2.18]

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Updates the requirements on [docling[tesserocr]](https://github.com/DS4SD/docling) to permit the latest version.
- [Release notes](https://github.com/DS4SD/docling/releases)
- [Changelog](https://github.com/DS4SD/docling/blob/main/CHANGELOG.md)
- [Commits](DS4SD/docling@v2.4.2...v2.18.0)

---
updated-dependencies:
- dependency-name: docling[tesserocr]
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Feb 4, 2025
@mergify mergify bot added the ci-failure label Feb 4, 2025
@bbrowning
Copy link
Contributor

@dependabot ignore this minor version

@dependabot dependabot bot closed this Feb 4, 2025
Copy link
Contributor Author

dependabot bot commented on behalf of github Feb 4, 2025

OK, I won't notify you about version 2.18.x again, unless you re-open this PR.

@dependabot dependabot bot deleted the dependabot/pip/docling-tesserocr--gte-2.4.2-and-lte-2.18.0 branch February 4, 2025 11:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci-failure dependencies Pull requests that update a dependency file python Pull requests that update Python code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant