Skip to content

Conversation

@vpietila-amd
Copy link
Contributor

Proposed changes

Added merging of multiple forward convolution groups into a single GEMM batch. The majority of the required components were already available and the only major code changes are in the group offset calculations in the CK Tile grouped forward convolution kernel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants