syncobj: use eventfd instead of stalling fd checks #9437

gulafaran · 2025-02-18T17:40:33Z

use eventfd and add it to the event loop and when it recieves a signal release the the queued surfacestate, this means we dont stall entire compositor when waiting for materilisation of the fd. and change its related usage.

should solve a lot of "compositor 5 fps while ingame overlay shows 300fps" and other similiar weird stalls.

on nvidia this shuffles compositor stalls to sometimes a noticeable rendering stutter, technically the acquire point isnt signaled fast enough so the buffer is stuck and nothing gets rendered until it is. pretty much the same stall as seen before only now its per surface rendering. but that i contribute to nvidia driver being simply shit, and is visible on other compositors.

NVIDIA/open-gpu-kernel-modules#777
NVIDIA/open-gpu-kernel-modules#743

fixes #7857
fixes #9340
fixes #9340
fixes #8588
fixes #7643
fixes #7317
fixes #9376
fixes #6844
fixes #6912

potentially helps:
#6617
#9011

remaining quirks to be fixed

[ x ] resizing rendering window like vkcube is delayed hard
its because we are getting a gazzillion xdg shell configure requests that fill the wayland event loop, and eventfd signal gets behind in the queue.
[ x ] walker crashes https://paste.pika-os.com/upload/bat-tiger-jaguar
Doesn't seem to work on GTK4:4.16 wmww/gtk4-layer-shell#50 requires gtk4-layer-shell 1.0.4
test and see if implicit/explicit sync didnt break hard or some other card/vendors AMD/Intel/etc
test and see if gpu/cpu usage didnt regress into other areas. not resetting damage or similiar
[ x ] syncobj points for async buffers

vaxerski

huge if works

src/protocols/DRMSyncobj.hpp

src/protocols/core/Compositor.cpp

ikalco · 2025-02-26T05:12:15Z

wait... for ds, shouldn't we release the wl_buffer whenever DRM signals the out fence?
so the client can reuse that buffer, and the output won't artifact cause drm said scanout is done

edit:
uuuh it seems HL just isn't using the out fence lol, am I missing something?
for reference (https://docs.kernel.org/gpu/drm-kms.html)

“OUT_FENCE_PTR”:
Use this property to pass a file descriptor pointer to DRM. Once the Atomic Commit request call returns OUT_FENCE_PTR will be filled with the file descriptor number of a Sync File. This Sync File contains the CRTC fence that will be signaled when all framebuffers present on the Atomic Commit * request for that given CRTC are scanned out on the screen.

gulafaran · 2025-02-26T06:23:25Z

wait... for ds, shouldn't we release the wl_buffer whenever DRM signals the out fence? so the client can reuse that buffer, and the output won't artifact cause drm said scanout is done

edit: uuuh it seems HL just isn't using the out fence lol, am I missing something? for reference (https://docs.kernel.org/gpu/drm-kms.html)

“OUT_FENCE_PTR”:
Use this property to pass a file descriptor pointer to DRM. Once the Atomic Commit request call returns OUT_FENCE_PTR will be filled with the file descriptor number of a Sync File. This Sync File contains the CRTC fence that will be signaled when all framebuffers present on the Atomic Commit * request for that given CRTC are scanned out on the screen.

There is a lot wrong at the moment, explicit sync currently relies on new buffer attached, the client can commit multiple times on the same buffer if it wants to, and does sometimes, and this PR probably exposed that quirk even more, and yeah direct scanout i havent begun reading into but removed that buffer manual buffer shenanigans because it was conflicting with the changes i made, there is also a few other things im checking that has popped up while testing

gulafaran · 2025-02-26T06:29:50Z

wait... for ds, shouldn't we release the wl_buffer whenever DRM signals the out fence? so the client can reuse that buffer, and the output won't artifact cause drm said scanout is done

edit: uuuh it seems HL just isn't using the out fence lol, am I missing something? for reference (https://docs.kernel.org/gpu/drm-kms.html)

“OUT_FENCE_PTR”:
Use this property to pass a file descriptor pointer to DRM. Once the Atomic Commit request call returns OUT_FENCE_PTR will be filled with the file descriptor number of a Sync File. This Sync File contains the CRTC fence that will be signaled when all framebuffers present on the Atomic Commit * request for that given CRTC are scanned out on the screen.

And "releasing" when it comes to explicit sync is sending the release point not the buffer .release

UjinT34 · 2025-02-26T09:06:43Z

output->state->enableExplicitOutFenceForNextCommit() should be called before the commit if out fence is needed. It fills output->state->state().explicitOutFence.
Who should handle that fd?

Note that OUT_FENCE_PTR shouldn't be used when tearing is allowed.

JunaidQrysh · 2025-02-26T14:18:58Z

I tested this pr & it fixed glmark2 lag but minecraft lag is still the same and also it introduced slight lag in other areas like workspace switching and cursor lag when wlogout is opened.

ikalco · 2025-02-26T15:37:07Z

output->state->enableExplicitOutFenceForNextCommit() should be called before the commit if out fence is needed. It fills output->state->state().explicitOutFence. Who should handle that fd?

Note that OUT_FENCE_PTR shouldn't be used when tearing is allowed.

we use that fd, i think we import it as a sync_file, then wait for it to signal, meaning aq plane is done with the primary buffer

gulafaran · 2025-02-26T16:10:07Z

I tested this pr & it fixed glmark2 lag but minecraft lag is still the same and also it introduced slight lag in other areas like workspace switching and cursor lag when wlogout is opened.

yeah got a few very broken things to figure out, im trying tho.

JunaidQrysh · 2025-03-02T09:39:41Z

@gulafaran you did amazing work, now everything runs smoothly. I ran various gpu stress tests, they ran without any lag.
This will be a major change in hyprland for nvidia users. I can finally ditch kde plasma for gaming.

ikalco · 2025-03-02T15:58:48Z

for this part

// #TODO does this apply to explicit sync?
    if (!syncobj && previousBuffer && previousBuffer->buffer && !previousBuffer->buffer->isSynchronous()) {

the lockedByBackend is unlocked when page flip buffer swap happens, so technically that's when we should signal release point since DRM scanout is no longer using the buffer
but also, im pretty sure this is what the DRM out fence does, so you could use that instead of lockedByBackend

czM1K3 · 2025-03-02T18:12:35Z

I can also say that this fixed my issue with my RX 580 so it might fix something like #8119 and others.

gulafaran · 2025-03-02T18:21:23Z

I can also say that this fixed my issue with my RX 580 so it might fix something like #8119 and others.

Probably will, and also some minecraft flickering and other things, im pushing soon a fix for a gtk issue and hyprlock issue so its not quite flawless yet 👍

gulafaran · 2025-03-03T03:47:11Z

I tested this pr & it fixed glmark2 lag but minecraft lag is still the same and also it introduced slight lag in other areas like workspace switching and cursor lag when wlogout is opened.

should be in a better state now, but not done yet.

src/helpers/sync/SyncReleaser.cpp

src/protocols/DRMSyncobj.cpp

src/helpers/Monitor.cpp

src/helpers/sync/SyncReleaser.cpp

src/protocols/LinuxDMABUF.cpp

src/protocols/core/Compositor.cpp

src/protocols/core/Compositor.hpp

vaxerski

fine by me. Runs alright on my end, but I don't know about others.

fufexan · 2025-03-06T16:31:49Z

WFM as well, fixed desync problems.

src/protocols/core/Compositor.cpp

cleanup a bit missing removals if resource not good, erasing from containers etc. make use of unique ptrs instead. and add default destructors.

remove early buffer release that was breaking explicit sync, the buffer needs to exist until the surface commit event has been emitted and draw calls added egl sync points, move to eventfd signaling instead of stalling sync point checks, and recommit pending commits if waiting on a signal. add a CDRMSyncPointState helper class. move a few weak pointers to shared pointers so they dont destruct before we need to use them.

eventfd requires us to queue pending stats until ready and then apply to current, and also when no ready state exist commit the client commit on the current existing buffer, if there is one.

clear current buffer damage on current buffer commits.

remove unused code, and ensure we dont commit a empty texture causing locksession protocol and gtk4-layer-shell misbehaving.

ensure the containers having the various buffers actually gets cleaned up from their containers, incase the CSignal isnt signaled because of expired smart pointers or just wrong order destruction because mishaps. also move the acquire/point setting to buffer attaching. instead of on precommit.

remove unused code and merge sync fds if fence is valid, remove manual directscanout buffer dropping that signals release point on pageflip, it can cause us to signal the release point while still keeping the current buffer and rendering it yet again causing wrong things.

delay buffer releases on non syncobj surfaces until next commit, and check on async buffers if syncobj and drop and signal the release point on backend buffer release.

ensure we follow protocol by replacing acquire/release points if they arrive late and replace already existing ones. also remove unneded brackets, and dont try to manual lock/release buffers when it comes to explicit protocol. it doesnt care about buffer releases only about acquire and release points and signaling them.

set points in precommit, before checking protocol errors and we catch any pending acquire/release points arriving late.

remove destructor resource destroying, let resources destroys them on their events, and move SSurfaceStates to types/SurfaceState.hpp

have to actually store the mergedfd to use it.

ensure the current asynchronous buffer is actually released on pageflip not the previous. cleanup a bit FD handling in commitPendingAndDoExplicitSync, and reuse the in fence when syncing surfaces.

calling resetexplicitfence without properly ensuring the FD is closed before will leak it, store it per monitor and let it close itself with the CFileDescriptor class.

buffers were never being sent released properly.

ensure the infence fd survives the scope of attemptdirectscanout so it doesnt close before it should have.

we might hit a race to finish on exit where the timeline just has destructed but the buffer waiter is still pending. and such we removeAllWaiters null dereferences.

remove quack comment, change to m_foo and use a std::vector and weakpointer in the waiter for removal instead of a std::list.

vaxerski

lgtm, are there any issues left? Works fine for me

remove unused async buffer drop, only related to directscanout and is handled elsewhere.

vaxerski

lets merge this shit

gulafaran marked this pull request as draft February 18, 2025 17:41

github-actions bot added core protocols render helpers labels Feb 18, 2025

vaxerski reviewed Feb 19, 2025

View reviewed changes

src/protocols/DRMSyncobj.hpp Outdated Show resolved Hide resolved

src/protocols/core/Compositor.cpp Outdated Show resolved Hide resolved

src/protocols/core/Compositor.cpp Outdated Show resolved Hide resolved

gulafaran force-pushed the eventfd branch from 7d74ba2 to 46dbd22 Compare February 22, 2025 12:55

github-actions bot added the config label Feb 22, 2025

gulafaran force-pushed the eventfd branch 4 times, most recently from 53df780 to e331b6c Compare February 24, 2025 10:20

gulafaran force-pushed the eventfd branch from e331b6c to a836b2f Compare February 28, 2025 14:43

gulafaran force-pushed the eventfd branch from c1c1a56 to e7bf039 Compare March 3, 2025 03:32

nnyyxxxx reviewed Mar 5, 2025

View reviewed changes

src/helpers/sync/SyncReleaser.cpp Outdated Show resolved Hide resolved

src/protocols/DRMSyncobj.cpp Outdated Show resolved Hide resolved

src/helpers/Monitor.cpp Outdated Show resolved Hide resolved

vaxerski reviewed Mar 5, 2025

View reviewed changes

vaxerski previously approved these changes Mar 6, 2025

View reviewed changes

ikalco reviewed Mar 12, 2025

View reviewed changes

src/protocols/core/Compositor.cpp Outdated Show resolved Hide resolved

gulafaran and others added 19 commits March 14, 2025 09:41

syncobj: cleanup and use uniqueptrs

5313104

cleanup a bit missing removals if resource not good, erasing from containers etc. make use of unique ptrs instead. and add default destructors.

syncobj: queue pending states for eventfd

70b662c

eventfd requires us to queue pending stats until ready and then apply to current, and also when no ready state exist commit the client commit on the current existing buffer, if there is one.

syncobj: clear current buffer damage

1cb97c4

clear current buffer damage on current buffer commits.

syncobj: cleanup code and fix hyprlock

1d58101

remove unused code, and ensure we dont commit a empty texture causing locksession protocol and gtk4-layer-shell misbehaving.

syncobj: delay buffer release on non syncobj

4c88419

delay buffer releases on non syncobj surfaces until next commit, and check on async buffers if syncobj and drop and signal the release point on backend buffer release.

syncobj: lets not complicate things

a887626

set points in precommit, before checking protocol errors and we catch any pending acquire/release points arriving late.

syncobj: move SSurfaceState to types

3a2ee96

remove destructor resource destroying, let resources destroys them on their events, and move SSurfaceStates to types/SurfaceState.hpp

syncobj: actually store the merged fd

8e973b0

have to actually store the mergedfd to use it.

syncobj: cleanup a bit around fences

4215a9f

ensure the current asynchronous buffer is actually released on pageflip not the previous. cleanup a bit FD handling in commitPendingAndDoExplicitSync, and reuse the in fence when syncing surfaces.

syncobjs: ensure fence FD doesnt leak

d3548fa

calling resetexplicitfence without properly ensuring the FD is closed before will leak it, store it per monitor and let it close itself with the CFileDescriptor class.

syncobj: ensure buffers are actually released

75bde16

buffers were never being sent released properly.

types: Defer buffer sync releaser until unlock

c41afa2

syncobj: store directscanout fence in monitor

51c5d7e

ensure the infence fd survives the scope of attemptdirectscanout so it doesnt close before it should have.

syncobj: check if if acquire is expired

02bf184

we might hit a race to finish on exit where the timeline just has destructed but the buffer waiter is still pending. and such we removeAllWaiters null dereferences.

syncobj: code style changes

bba8290

remove quack comment, change to m_foo and use a std::vector and weakpointer in the waiter for removal instead of a std::list.

gulafaran force-pushed the eventfd branch from a14d37c to 466b83e Compare March 14, 2025 08:45

vaxerski previously approved these changes Mar 14, 2025

View reviewed changes

syncobj: remove unused async buffer drop

76b5dcd

remove unused async buffer drop, only related to directscanout and is handled elsewhere.

gulafaran dismissed vaxerski’s stale review via 76b5dcd March 14, 2025 13:49

gulafaran force-pushed the eventfd branch from 466b83e to 76b5dcd Compare March 14, 2025 13:49

vaxerski approved these changes Mar 14, 2025

View reviewed changes

vaxerski merged commit 6ffde36 into hyprwm:main Mar 14, 2025
12 checks passed

vaxerski mentioned this pull request Mar 14, 2025

Breaking changes tracker #8424

Open

tomkoid mentioned this pull request Mar 14, 2025

Enabling vsync in games causes screen tearing when moving the mouse (Nvidia gpu) #7652

Closed

gulafaran mentioned this pull request Mar 14, 2025

Artifacts, Glitching in Hyprland #6994

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

syncobj: use eventfd instead of stalling fd checks #9437

syncobj: use eventfd instead of stalling fd checks #9437

gulafaran commented Feb 18, 2025 •

edited by vaxerski

Loading

vaxerski left a comment

ikalco commented Feb 26, 2025 •

edited

Loading

gulafaran commented Feb 26, 2025

gulafaran commented Feb 26, 2025

UjinT34 commented Feb 26, 2025

JunaidQrysh commented Feb 26, 2025

ikalco commented Feb 26, 2025

gulafaran commented Feb 26, 2025

JunaidQrysh commented Mar 2, 2025

ikalco commented Mar 2, 2025 •

edited

Loading

czM1K3 commented Mar 2, 2025

gulafaran commented Mar 2, 2025

gulafaran commented Mar 3, 2025

vaxerski left a comment

fufexan commented Mar 6, 2025

vaxerski left a comment

vaxerski left a comment

syncobj: use eventfd instead of stalling fd checks #9437

syncobj: use eventfd instead of stalling fd checks #9437

Conversation

gulafaran commented Feb 18, 2025 • edited by vaxerski Loading

remaining quirks to be fixed

vaxerski left a comment

Choose a reason for hiding this comment

ikalco commented Feb 26, 2025 • edited Loading

gulafaran commented Feb 26, 2025

gulafaran commented Feb 26, 2025

UjinT34 commented Feb 26, 2025

JunaidQrysh commented Feb 26, 2025

ikalco commented Feb 26, 2025

gulafaran commented Feb 26, 2025

JunaidQrysh commented Mar 2, 2025

ikalco commented Mar 2, 2025 • edited Loading

czM1K3 commented Mar 2, 2025

gulafaran commented Mar 2, 2025

gulafaran commented Mar 3, 2025

vaxerski left a comment

Choose a reason for hiding this comment

fufexan commented Mar 6, 2025

vaxerski left a comment

Choose a reason for hiding this comment

vaxerski left a comment

Choose a reason for hiding this comment

gulafaran commented Feb 18, 2025 •

edited by vaxerski

Loading

ikalco commented Feb 26, 2025 •

edited

Loading

ikalco commented Mar 2, 2025 •

edited

Loading