Upgrade init_tensor API to return a ggml_status #11854

WilliamTambellini · 2025-02-14T01:49:25Z

To prepare for an 'abort-free' ggml, as agreeed with Diego in the ggml repo, upgrade the backend init_tensor APIs to return a ggml_status.

Make sure to read the contributing guidelines before submitting a PR

WilliamTambellini · 2025-02-14T19:34:40Z

@slaren review please. Tks.

ggml/src/ggml-backend.cpp

ggml/src/ggml-cuda/CMakeLists.txt

ggml/src/ggml-cuda/ggml-cuda.cu

WilliamTambellini · 2025-02-18T17:09:56Z

Tks @slaren
Reready for review.

ggml/src/ggml-backend.cpp

tests/test-backend-ops.cpp

ggml/src/ggml-backend.cpp

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

graehl

ok, so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

WilliamTambellini · 2025-02-19T17:23:19Z

Tks @graehl

so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

Yes but that a another PR in the ggml repo

WilliamTambellini · 2025-02-19T17:23:43Z

@slaren reready for review please. Best.

matiaslin

Good step forward towards the goal of returning an error instead of crashing.

github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Feb 14, 2025

WilliamTambellini force-pushed the init_tensor branch from 150ffe8 to d12a712 Compare February 14, 2025 18:29

slaren reviewed Feb 17, 2025

View reviewed changes

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-cuda/CMakeLists.txt Outdated Show resolved Hide resolved

ggml/src/ggml-cuda/ggml-cuda.cu Outdated Show resolved Hide resolved

WilliamTambellini force-pushed the init_tensor branch from d12a712 to 1205554 Compare February 18, 2025 17:09

WilliamTambellini mentioned this pull request Feb 18, 2025

Add option not to abort on cuda malloc errors ggml-org/ggml#1083

Open

slaren reviewed Feb 18, 2025

View reviewed changes

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

WilliamTambellini force-pushed the init_tensor branch 3 times, most recently from 29998fc to e2486eb Compare February 18, 2025 22:04

Upgrade init_tensor API to return a ggml_status

51a0f6c

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

WilliamTambellini force-pushed the init_tensor branch from e2486eb to 51a0f6c Compare February 18, 2025 23:48

This comment was marked as outdated.

Sign in to view

graehl approved these changes Feb 19, 2025

View reviewed changes

matiaslin approved these changes Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade init_tensor API to return a ggml_status #11854

Upgrade init_tensor API to return a ggml_status #11854

WilliamTambellini commented Feb 14, 2025

WilliamTambellini commented Feb 14, 2025

WilliamTambellini commented Feb 18, 2025

This comment was marked as outdated.

graehl left a comment

WilliamTambellini commented Feb 19, 2025

WilliamTambellini commented Feb 19, 2025

matiaslin left a comment

Upgrade init_tensor API to return a ggml_status #11854

Are you sure you want to change the base?

Upgrade init_tensor API to return a ggml_status #11854

Conversation

WilliamTambellini commented Feb 14, 2025

WilliamTambellini commented Feb 14, 2025

WilliamTambellini commented Feb 18, 2025

This comment was marked as outdated.

graehl left a comment

Choose a reason for hiding this comment

WilliamTambellini commented Feb 19, 2025

WilliamTambellini commented Feb 19, 2025

matiaslin left a comment

Choose a reason for hiding this comment