Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) #16985

Phylliida · 2025-11-04T00:22:49Z

This adds extra functions

ggml_conv_2d_circular
ggml_conv_2d_dw_circular
ggml_conv_2d_dw_direct_circular
ggml_conv_transpose_2d_p0_circular
ggml_conv_2d_direct_circular
ggml_pad_circular
ggml_pad_ext_circular

That have equivalent signatures to the non-circular versions (I considered modifying the existing ones, but didn't want to break existing code). Instead of padding with zeros, they act "on a torus" and loop x and y around.

I implemented this for CUDA, CPU, and Vulkan, as those are the primary backends people use in KoboldCpp/Stable Diffusion Cpp to generate images. For other backends, it'll fall back to non-circular.

This can be used to make seamless textures, see leejet/stable-diffusion.cpp#914 for an example and the changes needed on the image generation side. For some models (Stable Diffusion) simply calling the circular functions is sufficient, for other models (Qwen Image) you need to modify Rope embeddings slightly as well (so they cleanly loop).

I ran CI tests and added tests for these, but happy to answer any questions/modify things as needed.

jeffbolznv · 2025-11-04T02:21:48Z

ggml/include/ggml.h

            int                   d1); // dilation dimension 1

+
+    GGML_API struct ggml_tensor * ggml_conv_2d_circular(


I'd personally prefer the wrapping to be an option on existing commands (either add an optional parameter to existing functions, or do something like ggml_mul_mat_set_prec to modify it after it's created. But the core maintainers should decide. I just don't want to end up with 2^N different convolution functions as these additional options keep getting added.

I don't think an optional parameter is an option (iiuc) since they are c api? (could hack with macros, but not ideal) But I'm open to some state modifying thing if that's what they want

jeffbolznv · 2025-11-04T02:26:45Z

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp

            uint32_t H_idx = OH_idx * p.s1 + KH_idx_b * p.d1 - p.p1;
            uint32_t W_idx = OW_idx * p.s0 + KW_idx_b * p.d0 - p.p0;
 #endif
+            H_pos            = (p.circular != 0) ? wrap_coord(int(H_idx), p.H) : H_idx;


This causes a pretty significant slowdown in test-backend-ops -o CONV_2D perf for some cases. Once #16978 lands, circular should become a spec constant and this problem will go away.

Acly · 2025-11-04T09:44:09Z

I am wondering, is it possible to add only a variant of ggml_pad with circular padding, use that as separate operation before the convolutions, then do the convolution without padding? How much slower is that?

Adding circular padding natively to all convolutions on all/most backends is a lot of investment. I'm not sure how common it is, so it would be interesting to know the trade-off.

Phylliida added 5 commits November 3, 2025 13:27

Feat: Added vulkan circular tiling support

f6ac084

Feat: Added cpu circular

d7f5958

Feat: Added cuda kernels

1b62b49

Added tests

60bed3b

Added tests

5700a4e

Phylliida requested review from 0cc4m, ggerganov and slaren as code owners November 4, 2025 00:22

github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 4, 2025

DajanaV mentioned this pull request Nov 4, 2025

UPSTREAM PR #16985: Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) auroralabs-loci/llama.cpp#67

Open

This was referenced Nov 4, 2025

Seamless texture generation support for qwen image leejet/stable-diffusion.cpp#914

Open

Add circular tiling support (for making seamless textures) ggml-org/ggml#1374

Open

jeffbolznv reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) #16985

Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) #16985

Uh oh!

Phylliida commented Nov 4, 2025

Uh oh!

jeffbolznv Nov 4, 2025

Uh oh!

Phylliida Nov 4, 2025 •

edited

Loading

Uh oh!

jeffbolznv Nov 4, 2025

Uh oh!

Acly commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		int d1); // dilation dimension 1


		GGML_API struct ggml_tensor * ggml_conv_2d_circular(

Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) #16985

Are you sure you want to change the base?

Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) #16985

Uh oh!

Conversation

Phylliida commented Nov 4, 2025

Uh oh!

jeffbolznv Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Phylliida Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeffbolznv Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Acly commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Phylliida Nov 4, 2025 •

edited

Loading