Cheer up! Metal is now supported! by freelw · Pull Request #28 · freelw/cpp-transformer

freelw · 2025-06-14T12:41:40Z

Cheer up! Metal is now supported!

Copilot

Pull Request Overview

Adds support for Apple Metal GPU by introducing a Metal backend alongside existing CUDA and CPU paths, updating the generic backend interface, and providing matching build configurations and documentation.

Introduce metal-cpp/Foundation headers to mirror core Foundation APIs for Metal.
Implement MetalKops in kernel_ops.cpp to handle Metal compute pipelines and update CUDA/CPU ops to use tensor-based division.
Modify BackendOps (div, alloc) signatures and update tooling (README, VSCode settings) to enable Metal GPU builds.

Reviewed Changes

Copilot reviewed 131 out of 131 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
backends/gpu/metal/metal-cpp/Foundation/NSDictionary.hpp	Added Foundation dictionary wrapper
backends/gpu/metal/kernel_ops.cpp	Implemented `MetalKops` for creating and encoding Metal pipelines
backends/gpu/cuda/kernel.cu	Switched `div` kernel to a tensor-based version with safety tweak
backends/backend_ops.h	Updated virtual `div` and `alloc` methods to accept tensor inputs
README.md	Documented Metal support and new `build_mac_gpu.sh` script
.vscode/settings.json	Expanded IntelliSense header list for Metal

Comments suppressed due to low confidence (3)

backends/gpu/metal/metal-cpp/Foundation/NSBundle.hpp:37

[nitpick] The second parameter is unnamed, which hurts readability. Consider naming it (e.g., pComment) to clarify its purpose.

class String* LocalizedString(const String* pKey, const String*);

README.md:61

The documentation references build_mac_gpu.sh, but this script isn't tracked in the repo. Either include the script or update the instructions to match available files.

./build_mac_gpu.sh

.vscode/settings.json:51

[nitpick] Including an extensive list of standard headers for IntelliSense can slow down editor performance. Consider trimming this list to only the most critical entries.

"typeinfo": "cpp",

Copilot · 2025-06-14T12:43:05Z

backends/gpu/metal/kernel_ops.cpp

+        throw std::runtime_error("Failed to create compute command encoder");
+    }
+    NS::Error* error = nullptr;
+    MTL::ComputePipelineState* pipelineState = device->newComputePipelineState(function, &error);


The created pipelineState is never released, leading to a Metal resource leak. Consider calling pipelineState->release() after encoding or wrapping it in an autorelease pool.

Copilot · 2025-06-14T12:43:06Z

backends/gpu/metal/kernel_ops.cpp

+    NS::Error* error = nullptr;
+    MTL::ComputePipelineState* pipelineState = device->newComputePipelineState(function, &error);
+    if (!pipelineState) {
+        std::cerr << "Error creating compute pipeline state: " << error->localizedDescription()->utf8String() << std::endl;


If error remains null, calling localizedDescription() will crash. Add a null check for error before dereferencing.

Suggested change

std::cerr << "Error creating compute pipeline state: " << error->localizedDescription()->utf8String() << std::endl;

if (error) {

std::cerr << "Error creating compute pipeline state: " << error->localizedDescription()->utf8String() << std::endl;

} else {

std::cerr << "Error creating compute pipeline state: Unknown error." << std::endl;

}

freelw and others added 30 commits June 12, 2025 17:32

update

5ee7df6

update

0440711

update

7cd0bfe

introduce metal

4984660

mnist

7311561

update

ad9d22e

metal imp init

45a24e7

update

3c6ef03

update

3deb060

update

ed6f765

update

2dd3043

update

e9c643e

update

62fbe49

update

f2217e8

update

cb6643d

update

4df053a

wip

fc40ba4

update

ee7155a

wip

b405c02

wip

b4ab6b9

wip

a1f927f

wip

115415f

wip

c67676a

wip

3e8ca0e

wip

2585f87

wip

dc7f03d

update

87a86e5

fill good

1d75d39

at wip

848b772

update

f7b8d3b

freelw and others added 24 commits June 14, 2025 16:47

update

1fcee57

metal gpu speed up good !

acd8b61

sequence mask done

4470902

softmax done

95fdfc1

softmax_backward_kernel

018b54b

update

775c4c8

update

d3d399d

update

0e401c4

embedding bp wip

672c904

fix cuda

3cf6b18

update

9e5a03c

embedding good

578cf69

avg good

7a15534

var good

9a40a92

norm wip

ecd4773

norm backward wip

cc6e580

norm good

f93bd9e

mulsv wip

6543c50

mulsv wip

c798565

update

4a204fb

metal test pass

fa3c6c1

yes

878f598

update

3c53219

update

ecda365

freelw requested a review from Copilot June 14, 2025 12:41

freelw merged commit eff98c2 into main Jun 14, 2025
1 check passed

Copilot AI reviewed Jun 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cheer up! Metal is now supported!#28

Cheer up! Metal is now supported!#28
freelw merged 82 commits intomainfrom
wangli_dev_20250612

freelw commented Jun 14, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 14, 2025

Uh oh!

Copilot AI Jun 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-        std::cerr << "Error creating compute pipeline state: " << error->localizedDescription()->utf8String() << std::endl;
+        if (error) {
+            std::cerr << "Error creating compute pipeline state: " << error->localizedDescription()->utf8String() << std::endl;
+        } else {
+            std::cerr << "Error creating compute pipeline state: Unknown error." << std::endl;
+        }

Conversation

freelw commented Jun 14, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants