Skip to content

Commit

Permalink
chore: make tests lighter (#228)
Browse files Browse the repository at this point in the history
Signed-off-by: Panos Vagenas <[email protected]>
  • Loading branch information
vagenas authored Nov 4, 2024
1 parent 244ca69 commit 8fb445f
Show file tree
Hide file tree
Showing 32 changed files with 1,423 additions and 11,439 deletions.
2 changes: 1 addition & 1 deletion .github/workflows/checks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ jobs:
run: |
for file in docs/examples/*.py; do
# Skip batch_convert.py
if [[ "$(basename "$file")" == "batch_convert.py" ]]; then
if [[ "$(basename "$file")" =~ ^(batch_convert|minimal|export_multimodal|custom_convert|develop_picture_enrichment).py ]]; then
echo "Skipping $file"
continue
fi
Expand Down
3 changes: 1 addition & 2 deletions docs/examples/batch_convert.py
Original file line number Diff line number Diff line change
Expand Up @@ -106,8 +106,7 @@ def main():
Path("./tests/data/2206.01062.pdf"),
Path("./tests/data/2203.01017v2.pdf"),
Path("./tests/data/2305.03393v1.pdf"),
Path("./tests/data/redp5110.pdf"),
Path("./tests/data/redp5695.pdf"),
Path("./tests/data/redp5110_sampled.pdf"),
]

# buf = BytesIO(Path("./test/data/2206.01062.pdf").open("rb").read())
Expand Down
1,883 changes: 0 additions & 1,883 deletions tests/data/groundtruth/docling_v1/redp5110.doctags.txt

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v1/redp5110.json

This file was deleted.

2,732 changes: 0 additions & 2,732 deletions tests/data/groundtruth/docling_v1/redp5110.md

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v1/redp5110.pages.json

This file was deleted.

299 changes: 299 additions & 0 deletions tests/data/groundtruth/docling_v1/redp5110_sampled.doctags.txt

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions tests/data/groundtruth/docling_v1/redp5110_sampled.json

Large diffs are not rendered by default.

421 changes: 421 additions & 0 deletions tests/data/groundtruth/docling_v1/redp5110_sampled.md

Large diffs are not rendered by default.

Large diffs are not rendered by default.

460 changes: 0 additions & 460 deletions tests/data/groundtruth/docling_v1/redp5695.doctags.txt

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v1/redp5695.json

This file was deleted.

726 changes: 0 additions & 726 deletions tests/data/groundtruth/docling_v1/redp5695.md

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v1/redp5695.pages.json

This file was deleted.

1,883 changes: 0 additions & 1,883 deletions tests/data/groundtruth/docling_v2/redp5110.doctags.txt

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v2/redp5110.json

This file was deleted.

2,615 changes: 0 additions & 2,615 deletions tests/data/groundtruth/docling_v2/redp5110.md

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v2/redp5110.pages.json

This file was deleted.

299 changes: 299 additions & 0 deletions tests/data/groundtruth/docling_v2/redp5110_sampled.doctags.txt

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions tests/data/groundtruth/docling_v2/redp5110_sampled.json

Large diffs are not rendered by default.

395 changes: 395 additions & 0 deletions tests/data/groundtruth/docling_v2/redp5110_sampled.md

Large diffs are not rendered by default.

Large diffs are not rendered by default.

460 changes: 0 additions & 460 deletions tests/data/groundtruth/docling_v2/redp5695.doctags.txt

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v2/redp5695.json

This file was deleted.

666 changes: 0 additions & 666 deletions tests/data/groundtruth/docling_v2/redp5695.md

This file was deleted.

1 change: 0 additions & 1 deletion tests/data/groundtruth/docling_v2/redp5695.pages.json

This file was deleted.

Binary file removed tests/data/redp5110.pdf
Binary file not shown.
Binary file added tests/data/redp5110_sampled.pdf
Binary file not shown.
Binary file removed tests/data/redp5695.pdf
Binary file not shown.
2 changes: 1 addition & 1 deletion tests/test_backend_docling_parse.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ def _get_backend(pdf_doc):


def test_text_cell_counts():
pdf_doc = Path("./tests/data/redp5695.pdf")
pdf_doc = Path("./tests/data/redp5110_sampled.pdf")

doc_backend = _get_backend(pdf_doc)

Expand Down
2 changes: 1 addition & 1 deletion tests/test_backend_docling_parse_v2.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ def _get_backend(pdf_doc):


def test_text_cell_counts():
pdf_doc = Path("./tests/data/redp5695.pdf")
pdf_doc = Path("./tests/data/redp5110_sampled.pdf")

doc_backend = _get_backend(pdf_doc)

Expand Down
2 changes: 1 addition & 1 deletion tests/test_backend_pdfium.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ def _get_backend(pdf_doc):


def test_text_cell_counts():
pdf_doc = Path("./tests/data/redp5695.pdf")
pdf_doc = Path("./tests/data/redp5110_sampled.pdf")

doc_backend = _get_backend(pdf_doc)

Expand Down

0 comments on commit 8fb445f

Please sign in to comment.