chore: use shared containers for integration tests #924

gruuya · 2025-01-30T13:10:19Z

Closes #923

This PR makes integration tests re-use the same set of docker containers, hence allowing them to be run concurrently

all existing tests are moved to a shared_tests module, with a single top-level shared.rs file, thus requiring only one compilation step (as opposed to compiling every test file individually)
set_test_fixture is made sync, since there's no async code in it
given 1 and 2, the test fixture is wrapped in a OnceLock, so that it is invoked only once for the resulting (shared) test binary; it also has a corresponding destructor to spin down the containers after the tests run
if there's a need for integration tests which don't use the shared docker containers, their test files should be placed alongside shared.rs (top-level); otherwise they should be created in the shared_tests module
one (shared) file can now contain an arbitrary number of tests that utilize the shared set of containers (without having to resort to decorating them with #[serial])

Finally, with this change my integration test runs go from taking ~3.5 minutes down to 30 seconds.

EDIT: The CI unit workflow run duration also seems ~10min shorter now.

gruuya · 2025-01-30T13:15:11Z

crates/integration_tests/src/lib.rs

 pub struct TestFixture {
    pub _docker_compose: DockerCompose,
-    pub rest_catalog: RestCatalog,
+    pub catalog_config: RestCatalogConfig,


Since the tests now share this fixture but they're all executed in different runtimes, they can't share the same client, as those can get dropped prematurely when a test ends, resulting in dispatch task is gone: runtime dropped the dispatch task

crates/integration_tests/tests/shared/append_partition_data_file_test.rs

kevinjqliu

generally lgtm. Since we're using a shared catalog, are there any concerns with side effects?
In pyiceberg we use a table identifier fixture to generate table names so they dont conflict

gruuya · 2025-01-30T16:27:02Z

generally lgtm. Since we're using a shared catalog, are there any concerns with side effects? In pyiceberg we use a table identifier fixture to generate table names so they dont conflict

Good point, I could also extract the namespace as a shared fixture among the tests (in the PR i just ignore the result of create_namespace assuming it was already created if it was an error).

EDIT: I've extracted two common fixtures now: the apple-ios namespace and the foo-bar-baz schema.

xxchan

Not sure if I understand it correctly:

Before this PR, tests are compiled into separated binaries. In theory they can run concurrently (e.g., if use cargo nextest), but just cargo test will run them in serial.
In this PR, we put them into 1 binary. And then they are run concurrently by #[tokio::tests].
- We use "shared container" to also save the time of spinning up and down containers. But need to take care of potential conflicts.

crates/integration_tests/tests/shared_tests/scan_all_type.rs

gruuya · 2025-02-02T06:53:09Z

Before this PR, tests are compiled into separated binaries. In theory they can run concurrently (e.g., if use cargo nextest), but just cargo test will run them in serial.

Correct; at present the tests can't be run concurrently because they depend on separate docker container sets. (Also even if they didn't, cargo nextest would compile each top-level test file separately.)

In this PR, we put them into 1 binary. And then they are run concurrently by #[tokio::tests].

We use "shared container" to also save the time of spinning up and down containers. But need to take care of potential conflicts.

Yep, that's it in a nutshell. Using the shared container set is the biggest time-saver (eliminating the docker build-start-stop overhead for each test). Having made them use shared containers, the next simple improvement is to make them all compile to one test binary (thus eliminating multiple compilations), as is done through having a single top-level shared.rs file, and consequently run them concurrently as well.

xxchan

Code changes and the idea LGTM 👍 Maybe we could rename the PR title to "use shared container for integration tests" instead as it's the biggest time-saver?

kevinjqliu · 2025-02-02T18:41:43Z

thanks! generally lgtm.
Does this also affect CI runs? if so, would be great to also include how much time we saved there too.

gruuya · 2025-02-03T07:48:40Z

thanks! generally lgtm. Does this also affect CI runs? if so, would be great to also include how much time we saved there too.

Good question; a brief glance suggests it may have shaved about ~5 minutes from the unit CI workflow, e.g. by comparing the Test step duration from a job on this PR(3m 51s) and a job on another PR(8m 45s).

The difference would certainly grow as more and more integration tests are added.

EDIT: It's actually closer to 10 minutes, since the integration tests are run twice, once in the Test step and once in the Async-std Test step.

kevinjqliu · 2025-02-03T16:15:23Z

thats amazing! Thanks for looking into that, i like faster CI :)

gruuya · 2025-02-04T07:07:27Z

Hey @ZENOTME @Fokko, can you also take a look at this proposal?

By extension make them use the same set of docker containers.

ZENOTME · 2025-02-06T17:49:19Z

Hey @ZENOTME @Fokko, can you also take a look at this proposal?

Thanks! @gruuya It's a great job to improve our ci. the idea and code change LGTM.

ZENOTME · 2025-02-06T17:53:20Z

crates/integration_tests/tests/shared_tests/mod.rs

+    ns
+}
+
+fn test_schema() -> Schema {


Each test is an independent test and I think it's common for them to have different schemas, e.g. scal_all_type. Do we really need to extract this as a public function? 🤔

Hey, thanks for the review! So initially I found that append_data_file_test.rs, append_partition_data_file_test.rs and conflict_commit_test.rs use the same schema, and I figured it made sense to extract it simply for de-duplication (and allowing any future tests where schema isn't that important to re-use it as well).

I don't have any strong opinions on it though—would you prefer for the schemas to be constructed directly in each of those tests as before?

I figured it made sense to extract it simply for de-duplication (and allowing any future tests where schema isn't that important to re-use it as well).

Both way looks good to me. I'm fine to keep it now for reuse by 3 case and we can remove them when need to evolve schema indenpently.

ZENOTME · 2025-02-07T09:06:47Z

cc @liurenjie1024 @Xuanwo @sdd

Xuanwo

Thank you @gruuya for working on this!

gruuya commented Jan 30, 2025

View reviewed changes

gruuya force-pushed the concurrent-integration-tests branch from fd569a9 to 110ce33 Compare January 30, 2025 13:17

gruuya mentioned this pull request Jan 30, 2025

chore: Improve efficiency of Docker fixtures in integration_tests #923

Closed

kevinjqliu reviewed Jan 30, 2025

View reviewed changes

gruuya force-pushed the concurrent-integration-tests branch from fcf379f to 7fb4d98 Compare January 31, 2025 08:56

xxchan reviewed Feb 2, 2025

View reviewed changes

crates/integration_tests/tests/shared_tests/scan_all_type.rs Outdated Show resolved Hide resolved

xxchan approved these changes Feb 2, 2025

View reviewed changes

gruuya changed the title ~~chore: make integration tests run concurrently~~ chore: use shared containers for integration tests Feb 3, 2025

gruuya added 2 commits February 5, 2025 14:04

Make integration tests run concurrently

b5cd038

By extension make them use the same set of docker containers.

Extract the common namespace and schema as fixtures for the shared tests

42bc45d

gruuya force-pushed the concurrent-integration-tests branch from 7587992 to ac1821f Compare February 5, 2025 13:05

Make shared tests use random namespaces

3d9efa5

gruuya force-pushed the concurrent-integration-tests branch from ac1821f to 3d9efa5 Compare February 5, 2025 13:09

ZENOTME reviewed Feb 6, 2025

View reviewed changes

ZENOTME approved these changes Feb 7, 2025

View reviewed changes

Xuanwo approved these changes Feb 8, 2025

View reviewed changes

Xuanwo merged commit b454ce6 into apache:main Feb 8, 2025
18 checks passed

gruuya mentioned this pull request Feb 8, 2025

feat(datafusion): Expose DataFusion statistics on an IcebergTableScan #880

Open

chore: use shared containers for integration tests #924

chore: use shared containers for integration tests #924

Uh oh!

Conversation

gruuya commented Jan 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gruuya Jan 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

gruuya commented Jan 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xxchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gruuya commented Feb 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xxchan left a comment

Choose a reason for hiding this comment

Uh oh!

kevinjqliu commented Feb 2, 2025

Uh oh!

gruuya commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevinjqliu commented Feb 3, 2025

Uh oh!

gruuya commented Feb 4, 2025

Uh oh!

ZENOTME commented Feb 6, 2025

Uh oh!

ZENOTME Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

gruuya Feb 7, 2025

Choose a reason for hiding this comment

Uh oh!

ZENOTME Feb 7, 2025

Choose a reason for hiding this comment

Uh oh!

ZENOTME commented Feb 7, 2025

Uh oh!

Xuanwo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gruuya commented Jan 30, 2025 •

edited

Loading

gruuya commented Jan 30, 2025 •

edited

Loading

gruuya commented Feb 2, 2025 •

edited

Loading

gruuya commented Feb 3, 2025 •

edited

Loading