CDRIVER-6055 Audit array allocations #2098

Julia-Garland · 2025-08-26T15:36:18Z

Motivation

"There are several points in the codebase that use bson_malloc(sizeof(T) * N) to allocate arrays of objects. This should not do an in-situ multiplication, since a large value of N will cause integer overflow and result in either an allocation failure or a bogus allocation size. Also, N = 0 can cause issues since malloc(0) is undefined/unspecified."

Summary

Array allocations of the form bson_malloc(sizeof(T) * N) now use a new function bson_array_alloc() which handles multiplication of the two terms internally. Likewise, equivalent calls to 'bson_malloc0(sizeof(T) * N) now usebson_array_alloc0()`.

…nment no longer used

src/libbson/src/bson/memory.h

vector-of-bool · 2025-08-29T15:46:19Z

src/libbson/src/bson/memory.h

+bson_array_alloc(size_t type_size, size_t num_elems);
+BSON_EXPORT(void *)
+bson_array_alloc0(size_t type_size, size_t num_elems);


Since this is very similar to calloc, I recommend consolidating these into a single new function bson_calloc that uses the calloc function on the BSON vmemtable, rather than adding two new public APIs. This will also defer the calloc logic to the underlying allocator, which may have more smarts based on object size.

vector-of-bool · 2025-08-29T15:52:53Z

src/libmongoc/src/mongoc/mongoc-set.c

@@ -27,7 +27,7 @@ mongoc_set_new(size_t nitems, mongoc_set_item_dtor dtor, void *dtor_ctx)
   mongoc_set_t *set = (mongoc_set_t *)bson_malloc(sizeof(*set));

   set->items_allocated = BSON_MAX(nitems, 1);
-   set->items = (mongoc_set_item_t *)bson_malloc(sizeof(*set->items) * set->items_allocated);
+   set->items = (mongoc_set_item_t *)bson_array_alloc(sizeof(*set->items), set->items_allocated);


Since the pattern everywhere is almost always (T*)bson_array_alloc(sizeof(T), N), this is a good time to use a function-like macro:

// Plain function void* _bson_alloc_n_impl(size_t item_size, size_t count); // Macro that does the right thing every time: #define bson_alloc_n(Type, Count) \ ((Type*)_bson_alloc_n_impl(sizeof(Type), (Count))

This also ensures the returned pointer type is correctly used, rather than allowing the implicit-cast from void*.

Since most of the original function calls I changed were to bson_malloc, not bson_malloc0, would consolidating into a calloc wrapper where memory is always zeroed out be cause for any efficiency concerns?

I would rather keep a distinct non-zero alloc for cases that are performance sensitive. For example allocations in mongoc-set.c might be arbitrarily large. Maybe add a bson_alloc_n0?

Julia-Garland added 2 commits August 26, 2025 10:28

Add bson_array_alloc function

09ffe7e

Replace in situ multiplication with array_alloc function call

b06e491

Julia-Garland self-assigned this Aug 26, 2025

Julia-Garland added 2 commits August 26, 2025 12:06

Use correct alignof macro

cccae5d

Split bson_alloc_array into two versions for bson_malloc0 calls; alig…

b4c8bb8

…nment no longer used

Julia-Garland force-pushed the audit-array-allocations.cdriver-6055 branch from 6cdb790 to b4c8bb8 Compare August 27, 2025 14:04

Julia-Garland added 2 commits August 27, 2025 11:30

Fix incorrect call

34fa159

Create docs page for bson_alloc_array(0)

de6fd91

Julia-Garland marked this pull request as ready for review August 27, 2025 16:17

Julia-Garland requested a review from a team as a code owner August 27, 2025 16:17

Julia-Garland requested review from connorsmacd and vector-of-bool August 27, 2025 16:17

Julia-Garland added 2 commits August 27, 2025 12:48

Extend title underline

1a955ac

Fix function name in docs

fcd73e6

Julia-Garland force-pushed the audit-array-allocations.cdriver-6055 branch from 0342bfe to fcd73e6 Compare August 27, 2025 17:33

connorsmacd requested changes Aug 28, 2025

View reviewed changes

src/libbson/src/bson/memory.h Show resolved Hide resolved

vector-of-bool requested changes Aug 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CDRIVER-6055 Audit array allocations #2098

CDRIVER-6055 Audit array allocations #2098

Julia-Garland commented Aug 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

vector-of-bool Aug 29, 2025

Uh oh!

vector-of-bool Aug 29, 2025

Uh oh!

Julia-Garland Aug 29, 2025

Uh oh!

kevinAlbs Sep 2, 2025

Uh oh!

Uh oh!

CDRIVER-6055 Audit array allocations #2098

Are you sure you want to change the base?

CDRIVER-6055 Audit array allocations #2098

Conversation

Julia-Garland commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Summary

Uh oh!

Uh oh!

vector-of-bool Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

vector-of-bool Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

Julia-Garland Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

kevinAlbs Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Julia-Garland commented Aug 26, 2025 •

edited

Loading