Use tmpfs for integration tests #5959

flouthoc · 2025-01-30T20:19:39Z

What type of PR is this?

/kind api-change
/kind bug
/kind cleanup
/kind deprecation
/kind design
/kind documentation
/kind failing-test
/kind feature
/kind flake
/kind other

What this PR does / why we need it:

How to verify it

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

None

flouthoc · 2025-01-31T19:28:30Z

@Luap99 It seems increasing CPU for vfs has some improvement.

flouthoc · 2025-01-31T19:29:00Z

Also containerized_integration came from 27 something to 19m

Luap99 · 2025-02-03T10:43:19Z

cirrus graph for the container task seems broken as this usage makes no sense.

Looking at another int test we do not seem max out the cpu much so I am not sure if 8 cores make much sense. Also memory usage seems quite low so I think we can reduce that as well.

Overall the first question is what is the goal here? 30 mins total CI like podman? What is the target time we are aiming for?
Once we know that we can see how much we want to bump the cores, because we also need to keep in mind of costs. If double the cores means 2/3 of the time then the costs will most likely be higher.

flouthoc · 2025-02-03T15:03:35Z

Overall the first question is what is the goal here? 30 mins total CI like podman

I think we are already under 30mins, this PR was just an experiment. Increasing cores reduces time by a lot but yeah I don't think it's a good idea to keep throwing money at the problem if its not needed.

Luap99 · 2025-02-03T15:37:51Z

Overall the first question is what is the goal here? 30 mins total CI like podman

I think we are already under 30mins, this PR was just an experiment. Increasing cores reduces time by a lot but yeah I don't think it's a good idea to keep throwing money at the problem if its not needed.

I am talking about total time for CI to finish. Not just a single task time. There is a bit of overhead in scheduling tasks.

You need to look at the cirrus build page at the top it shows you Finished in 57:37 on this PR (because it was not rebased on the unit test speedup)

flouthoc · 2025-02-03T16:12:11Z

Overall the first question is what is the goal here? 30 mins total CI like podman

I think we are already under 30mins, this PR was just an experiment. Increasing cores reduces time by a lot but yeah I don't think it's a good idea to keep throwing money at the problem if its not needed.

I am talking about total time for CI to finish. Not just a single task time. There is a bit of overhead in scheduling tasks.

You need to look at the cirrus build page at the top it shows you Finished in 57:37 on this PR (because it was not rebased on the unit test speedup)

Yes I am only talking about integration tests, waiting for this to merge for unit tests here #5954

flouthoc · 2025-02-03T19:19:50Z

@Luap99 I see minor improvement in other tests but not in containerized_integration.

flouthoc · 2025-02-06T14:57:27Z

@Luap99 @nalind PTAL

Luap99

Does the tmpfs change actually do anything? AFAICT we set TMPDIR=/var/tmp so nothing ends up on tmpfs?

In general a PR like this benefits from precise numbers before/after. A commit like "bump cpu and memory" is not helping any future reader. It is missing the why we do this and the numbers on much we safe.

contrib/cirrus/lib.sh

contrib/cirrus/setup.sh

.cirrus.yml

Luap99 · 2025-02-07T17:16:43Z

Actually never mind we only set TMPDIR: '/var/tmp' on the conformance vfs task per 8b0ecd7

So I think the tests were already using tmpfs

TomSweeneyRedHat · 2025-02-17T22:02:01Z

LGTM

TomSweeneyRedHat · 2025-02-17T22:02:17Z

/lgtm

TomSweeneyRedHat · 2025-02-17T22:02:48Z

@flouthoc you may need a rebase here.

flouthoc · 2025-02-17T22:23:59Z

@flouthoc you may need a rebase here.

Done.

flouthoc · 2025-02-18T13:49:26Z

@rhatdan @nalind @TomSweeneyRedHat PTAL

imagebuildah/stage_executor.go

nalind · 2025-02-18T21:32:16Z

LGTM

Use regular `cat` to test the same functionality instead of using python image specifically for this part of test. Signed-off-by: flouthoc <[email protected]>

Signed-off-by: flouthoc <[email protected]>

use /tmp as TMPDIR so tests use tmpfs Signed-off-by: flouthoc <[email protected]>

Add `run_with_log` to mkcw tests. Add `sleep 1` during cleanup between attempting `luksClose` and unmounting the filesystem mounted on the device /dev/mapper/"$uuid". Without this somehow we end up in a state where mount is still being used by the kernel because when we do `lsof /dev/mapper/"$uuid"` it shows nothing but `dmsetup info -c $uuid` shows the device is still under use. Adding `sleep 1` in between somehow fixes this. Also this problem with `cryptsetup` is pretty common for reference one thread which I found https://lore.kernel.org/all/[email protected]/T/ Signed-off-by: flouthoc <[email protected]>

flouthoc · 2025-02-18T22:50:49Z

Rebased with main.

giuseppe

/lgtm

giuseppe · 2025-02-19T10:11:05Z

/approve

rhatdan · 2025-02-19T14:00:00Z

/approve

openshift-ci · 2025-02-19T14:00:08Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: flouthoc, giuseppe, Luap99, rhatdan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [rhatdan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

flouthoc force-pushed the integrate-experiment branch from 0f7b48c to 1c2b97f Compare January 30, 2025 20:20

flouthoc marked this pull request as draft January 30, 2025 21:16

openshift-ci bot added the do-not-merge/work-in-progress label Jan 30, 2025

flouthoc force-pushed the integrate-experiment branch 2 times, most recently from 7ad3de4 to df76c15 Compare January 31, 2025 16:50

flouthoc force-pushed the integrate-experiment branch 5 times, most recently from 5020a4b to c7dbcd8 Compare February 3, 2025 18:23

flouthoc force-pushed the integrate-experiment branch 4 times, most recently from e548e62 to fe0750f Compare February 6, 2025 14:56

flouthoc marked this pull request as ready for review February 6, 2025 14:56

openshift-ci bot removed the do-not-merge/work-in-progress label Feb 6, 2025

flouthoc changed the title ~~time integration tests~~ Use tmpfs for integration tests and bump resources Feb 6, 2025

flouthoc requested a review from Luap99 February 7, 2025 16:44

flouthoc assigned nalind Feb 7, 2025

Luap99 reviewed Feb 7, 2025

View reviewed changes

flouthoc force-pushed the integrate-experiment branch from fe0750f to 4547c96 Compare February 7, 2025 20:01

openshift-ci bot assigned TomSweeneyRedHat Feb 17, 2025

openshift-ci bot added the lgtm label Feb 17, 2025

flouthoc force-pushed the integrate-experiment branch from 91fc35a to 391ba38 Compare February 17, 2025 22:23

openshift-ci bot removed the lgtm label Feb 17, 2025

This was referenced Feb 18, 2025

Test mkcw-commit flakes sometimes #5983

Open

Test mkcw-convert flakes a lot in CI #5980

Open

nalind reviewed Feb 18, 2025

View reviewed changes

imagebuildah/stage_executor.go Outdated Show resolved Hide resolved

flouthoc force-pushed the integrate-experiment branch 2 times, most recently from 1d432ff to a8c42b1 Compare February 18, 2025 21:26

flouthoc requested a review from nalind February 18, 2025 21:27

flouthoc added 4 commits February 18, 2025 14:49

test: heredoc remove python dependency from test

c86f554

Use regular `cat` to test the same functionality instead of using python image specifically for this part of test. Signed-off-by: flouthoc <[email protected]>

heredoc: create temp subdirs for each build

efb28dc

Signed-off-by: flouthoc <[email protected]>

test: use /tmp as TMPDIR

d7d7878

use /tmp as TMPDIR so tests use tmpfs Signed-off-by: flouthoc <[email protected]>

flouthoc force-pushed the integrate-experiment branch from a8c42b1 to c87fd8e Compare February 18, 2025 22:50

giuseppe approved these changes Feb 19, 2025

View reviewed changes

openshift-ci bot assigned giuseppe Feb 19, 2025

openshift-ci bot added the lgtm label Feb 19, 2025

openshift-ci bot added the approved label Feb 19, 2025

openshift-merge-bot bot merged commit 3d14858 into containers:main Feb 19, 2025
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use tmpfs for integration tests #5959

Use tmpfs for integration tests #5959

flouthoc commented Jan 30, 2025

flouthoc commented Jan 31, 2025

flouthoc commented Jan 31, 2025

Luap99 commented Feb 3, 2025

flouthoc commented Feb 3, 2025

Luap99 commented Feb 3, 2025

flouthoc commented Feb 3, 2025

flouthoc commented Feb 3, 2025

flouthoc commented Feb 6, 2025

Luap99 left a comment

Luap99 commented Feb 7, 2025

TomSweeneyRedHat commented Feb 17, 2025

TomSweeneyRedHat commented Feb 17, 2025

TomSweeneyRedHat commented Feb 17, 2025

flouthoc commented Feb 17, 2025

flouthoc commented Feb 18, 2025

nalind commented Feb 18, 2025

flouthoc commented Feb 18, 2025

giuseppe left a comment

giuseppe commented Feb 19, 2025

rhatdan commented Feb 19, 2025

openshift-ci bot commented Feb 19, 2025

Use tmpfs for integration tests #5959

Use tmpfs for integration tests #5959

Conversation

flouthoc commented Jan 30, 2025

What type of PR is this?

What this PR does / why we need it:

How to verify it

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

flouthoc commented Jan 31, 2025

flouthoc commented Jan 31, 2025

Luap99 commented Feb 3, 2025

flouthoc commented Feb 3, 2025

Luap99 commented Feb 3, 2025

flouthoc commented Feb 3, 2025

flouthoc commented Feb 3, 2025

flouthoc commented Feb 6, 2025

Luap99 left a comment

Choose a reason for hiding this comment

Luap99 commented Feb 7, 2025

TomSweeneyRedHat commented Feb 17, 2025

TomSweeneyRedHat commented Feb 17, 2025

TomSweeneyRedHat commented Feb 17, 2025

flouthoc commented Feb 17, 2025

flouthoc commented Feb 18, 2025

nalind commented Feb 18, 2025

flouthoc commented Feb 18, 2025

giuseppe left a comment

Choose a reason for hiding this comment

giuseppe commented Feb 19, 2025

rhatdan commented Feb 19, 2025

openshift-ci bot commented Feb 19, 2025