Sparse tensor data structures #1186

aestriplex · 2025-09-28T23:26:51Z

This PR implements a first draft of the data structures required to implement sparse tensors support in ACL.
Here are three points worth discussing:

In the COOTensor and CSRTensor classes, I implemented two different strategies for saving indexes. Specifically, in COO tensors, indexes are saved as std::vector (i.e., not in the same buffer provided by the tensor allocator, which is used by values instead). In CSR tensors, on the other hand, I stored them in the allocator buffer and saved only the corresponding (integer) offsets as class members. The method used in CSR is more efficient when working with the tensor (e.g., in a kernel/operator), so I thought I would extend it to the COO tensor as well. Are there any problems if this approach is too “low level”?
The test class should be converted into a Fixture.
As Pablo said in his comment on my design document in the mailing list, we need to make sure that the validation of current operators fails when using sparse tensors. A good idea, in my opinion, would be to create a “validate” method in ICpuOperator - and other kind of base operators -, so that it can be called within the validate of classes that implement this interface and do not (yet) support sparse tensors (e.g., CpuGemmConv2d). This method, i.e., not performing the direct check !src->info()->is_sparse(), allows these checks to be extended in the future without having to modify all operators.

COOTensor init CSRTensor init added SparseTensorAllocator moved SparseTensor to arm_compute/core fixed to_dense method for COOTensor fixed to_sparse and to_dense methods for CSRTensor changed CSR indices; print function moved to the bae class implemented get_value to CSR and COO tensors added SparseIterator Change-Id: Id5ed095587662b750b89cbabd600a286ab9bf53b

aestriplex · 2025-09-28T23:27:35Z

arm_compute/runtime/COOTensor.h

+     */
+    COOTensor(const ITensor *tensor);
+
+    std::vector<Coordinates> _indices;


See point 1 in the PR description

aestriplex · 2025-09-28T23:27:45Z

arm_compute/runtime/CSRTensor.h

+     */
+    CSRTensor(const ITensor *tensor);
+
+    size_t _crow_bytes; /**< Row index size in bytes */


See point 1 in the PR description

tests/validation/cpu/unit/SparseTensor.cpp

morgolock

Hi @aestriplex

I had a quick look at this and I think it's a good starting point. Please have a look and let me know your thoughts.

arm_compute/core/IReducibleTensor.h

morgolock · 2025-10-01T14:29:18Z

src/runtime/COOTensor.cpp

+#ifdef ARM_COMPUTE_ASSERTS_ENABLED
+void COOTensor::print(std::ostream &os) const
+{
+    const uint8_t *data = static_cast<const uint8_t *>(buffer());
+
+    if(_indices.empty())
+    {
+        os << "index: [] values: []" << std::endl;
+        return;
+    }
+
+    for(size_t i = 0; i < _indices.size(); ++i)
+    {
+        const Coordinates &coord = _indices[i];
+        os << "index: [";
+        for (size_t j = 0; j < coord.num_dimensions(); ++j)
+        {
+            os << coord[j];
+            if (j < coord.num_dimensions() - 1) os << ", ";
+        }
+        os << "]  values: ";
+        print_values(os, data, i, dense_volume(sparse_dim()));
+    }
+}
+#endif // ARM_COMPUTE_ASSERTS_ENABLED


We already have a print method in https://github.com/ARM-software/ComputeLibrary/blob/main/src/core/ITensor.cpp

We should move this in void ITensor::print(std::ostream &s, IOFormatInfo io_fmt) const

I could move both this print function and CSR's to ITensor. In particular, something like this

void ITensor::print(std::ostream &s, IOFormatInfo io_fmt) const { // initialize variables and set precision ... switch (this->info()->tensor_format()) { case TensorFormat::Dense: print_dense(s, io_fmt); break; case TensorFormat::CSR: print_csr(s); break; case TensorFormat::COO: print_coo(s); break; default: ARM_COMPUTE_ERROR("Unsupported tensor format"); break; } // Restore output stream flags s.copyfmt(stream_status); }

where print_coo and print_csr are the two functions moved from the two final classes.

The main problem is that those functions use some internals of the two classes, for example _indices, _crow_bytes, _col_bytes. It seems to me that having those functions defined in the base class can be misleading. Wouldn't it be cleaner if we have those two functions in their proper classes? That way, I should add the parameter IOFormatInfo io_fmt to both, so we have exactly the same signature as in ITensor, i.e. they overlap perfectly the base class method.

morgolock · 2025-10-01T14:30:36Z

src/runtime/CSRTensor.cpp

+void CSRTensor::print(std::ostream &os) const
+{
+    const uint8_t *_row_offsets = _allocator.data();
+    const uint8_t *_col_indices = _allocator.data() + _crow_bytes;
+    const uint8_t       *values = _allocator.data() + _crow_bytes + _col_bytes;
+
+    os << "r_offsets: [";
+    for(size_t i = 0; i < _crow_bytes / index_size; ++i)
+    {
+        os << *reinterpret_cast<const int32_t *>(_row_offsets + (i * index_size));
+        if (i < _crow_bytes / index_size - 1)
+        {
+            os << ", ";
+        }
+    }
+    os << "] cols: [";
+    for(size_t i = 0; i < _col_bytes / index_size; ++i)
+    {
+        os << *reinterpret_cast<const int32_t *>(_col_indices + (i * index_size));
+        if (i < _col_bytes / index_size - 1)
+        {
+            os << ", ";
+        }
+    }
+    os << "] values: ";
+    print_values(os, values, 0, nnz());
+}
+#endif // ARM_COMPUTE_ASSERTS_ENABLED


Move this to existing void ITensor::print(std::ostream &s, IOFormatInfo io_fmt) const

tests/validation/cpu/unit/SparseTensor.cpp

morgolock · 2025-10-01T14:54:46Z

src/runtime/SparseTensorAllocator.cpp

+
+namespace arm_compute
+{
+SparseTensorAllocator::SparseTensorAllocator(IMemoryManageable *owner) : _owner(owner), _associated_memory_group(nullptr), _memory(), _values_bytes(0), _indices_bytes(0)


I think we should use the existing TensorAllocator to allocate any buffers.

I thought a lot about this one. The main reason I created a new class is that the original one (i.e. TensorAllocator) allocates the memory buffer using the TensorShape contained in TensorInfo. In the case of sparse tensors, however, the amount of memory that we need to allocate is smaller than that obtained by doing dim_1 x dim_2 x ... x dim_n, because otherwise there would be no advantage at all in using sparse over dense tensors.
Since I didn't want to touch the allocation mechanism of TensorAllocator — on which the memory allocation of all other tensors is based — I thought it would be wiser to create a special class.
It has the same internal mechanism (i.e. it inherits from ITensorAllocator, and it uses Memory, MemoryGroup and IMemoryManageable), but it exposes this init method

void init(const TensorInfo &input, size_t values_bytes, size_t indices_bytes, size_t alignment = 0);

Change-Id: I3a3fca2b842702672c915e1357602bf0fe47bd47

aestriplex commented Sep 28, 2025

View reviewed changes

tests/validation/cpu/unit/SparseTensor.cpp Outdated Show resolved Hide resolved

morgolock requested changes Oct 1, 2025

View reviewed changes

cr fixes

6d993d1

Change-Id: I3a3fca2b842702672c915e1357602bf0fe47bd47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sparse tensor data structures #1186

Sparse tensor data structures #1186

aestriplex commented Sep 28, 2025

Uh oh!

aestriplex Sep 28, 2025

Uh oh!

aestriplex Sep 28, 2025

Uh oh!

Uh oh!

morgolock left a comment

Uh oh!

Uh oh!

morgolock Oct 1, 2025 •

edited

Loading

Uh oh!

aestriplex Oct 6, 2025

Uh oh!

morgolock Oct 1, 2025

Uh oh!

Uh oh!

morgolock Oct 1, 2025

Uh oh!

aestriplex Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sparse tensor data structures #1186

Are you sure you want to change the base?

Sparse tensor data structures #1186

Conversation

aestriplex commented Sep 28, 2025

Uh oh!

aestriplex Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

aestriplex Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

morgolock left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

morgolock Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aestriplex Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

morgolock Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

morgolock Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

aestriplex Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

morgolock Oct 1, 2025 •

edited

Loading