chore: flush after serializing big string #5401

kostasrim · 2025-07-02T09:02:47Z

Serializing entries of a bucket during snapshot might contain large strings which potentially cross the serialization threshold imposed by FlushIfNeeded(). If the next element in the bucket is a big value (let's say a hash table), we will first serialize its first element of the container and then Flush which can lead to memory pressure. To avoid that, we call FlushIfNeeded() after we serialize each string.

Resolves #5394

kostasrim · 2025-07-02T09:05:53Z

src/server/rdb_save.cc

+    // We flush here because if the next element in the bucket we are serializing is a container,
+    // it will first serialize the first entry and then flush the internal buffer, even if
+    // crossed the limit.
+    FlushIfNeeded(FlushState::kFlushEndEntry);


@adiholden This is a general issue not only limited to strings right ? Same applies for other data types as well that are not chunked (treated as big values)? So for example, we don't flush after we serialize a json. What if, all the elements in the bucket are json data types ? Shall I take care of that as well.

p.s, I could also add a test but 🤷

why here and not in RdbSerializer::SaveEntry at the end of the function? this way you can be sure its actually after you finished saving the entry and not in the middle if you have some other serialization after SaveValue which I see we dont have today but might in the future. By moving to the end of save entry it will apply also to all datatypes

p.s add a test

kostasrim · 2025-07-02T09:06:09Z

src/server/snapshot.cc

@@ -64,7 +64,7 @@ size_t SliceSnapshot::GetThreadLocalMemoryUsage() {
 }

 bool SliceSnapshot::IsSnaphotInProgress() {
-  return tl_slice_snapshots.size() > 0;
+  return !tl_slice_snapshots.empty();


I could not resist, I will refactor all those cases eventually

Signed-off-by: kostas <[email protected]>

adiholden · 2025-07-14T13:16:18Z

tests/dragonfly/replication_test.py

+    await wait_for_replicas_state(c_replica)
+
+    # make sure capacity hasn't changed after seeding
+    new_capacity = await get_memory(c_master, "prime_capacity")


lets move this right after line 3419

adiholden · 2025-07-14T13:16:35Z

tests/dragonfly/replication_test.py

+@dfly_args({"proactor_threads": 1})
+async def test_big_strings(df_factory):
+    master = df_factory.create(proactor_threads=1, serialization_max_chunk_size=1)
+    replica = df_factory.create(proactor_threads=1, serialization_max_chunk_size=1)


nit : serialization_max_chunk_size no affect in replica

adiholden · 2025-07-14T13:20:46Z

tests/dragonfly/replication_test.py

+    # inbetween. This is not a great test for this but we are limited because we can't fill
+    # a bucket with big strings as the memory in the gh runner is fairly limited. We at
+    # least check for correctness and *some* improvement in the memory foot print.
+    assert peak_memory < used_memory + five_mb


I feel this is not very stable. I think that if we want to check that we actually flushed the data we can have a statistic for serializer peak bytes (updated every time we call SerializerBase::FlushToSink) which will be printed at the end of replication , part of the statistics and you can read it in this test to make sure the bytes in no more than a single entry size

chore: flush after serializing big string

eba10c5

kostasrim self-assigned this Jul 2, 2025

kostasrim commented Jul 2, 2025

View reviewed changes

kostasrim requested a review from adiholden July 2, 2025 09:06

kostasrim added 2 commits July 10, 2025 14:25

move FlushIfNeeded in SaveEntry

75dd84d

test

798d425

Signed-off-by: kostas <[email protected]>

adiholden reviewed Jul 14, 2025

View reviewed changes

metrics

6a76ce5

kostasrim requested a review from adiholden July 17, 2025 07:28

adiholden approved these changes Jul 17, 2025

View reviewed changes

kostasrim merged commit da59e40 into main Jul 17, 2025
10 checks passed

kostasrim deleted the kpr10 branch July 17, 2025 12:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: flush after serializing big string #5401

chore: flush after serializing big string #5401

Uh oh!

kostasrim commented Jul 2, 2025

Uh oh!

kostasrim Jul 2, 2025

Uh oh!

kostasrim Jul 2, 2025

Uh oh!

adiholden Jul 3, 2025

Uh oh!

adiholden Jul 3, 2025

Uh oh!

kostasrim Jul 2, 2025

Uh oh!

adiholden Jul 14, 2025

Uh oh!

adiholden Jul 14, 2025

Uh oh!

adiholden Jul 14, 2025

Uh oh!

Uh oh!

Uh oh!

chore: flush after serializing big string #5401

chore: flush after serializing big string #5401

Uh oh!

Conversation

kostasrim commented Jul 2, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!