add compaction for batch by primary key fields #1479

typik89 · 2025-03-03T13:47:26Z

Problem

I'm using jdbc sink task to save data to Postgresql. I wanted to use configuration property of jdbc driver reWriteBatchedInserts=true to optimize saving data.
This property toggles converting my separate upsert queries to one multi-value upsert. But there is problem when such multi-value query contains updates for same primary key.

Solution

I think it would be useful to have a feature that, before forming batches of database queries, reduces batches of records by keeping only the most recent record for a primary key.

record(primaryKey,value): (1,1),(2,2),(1,3) -> (2,2),(1,3)

It might be practical not only for my case and potentially could reduce number of queries when there are updates by same primary key.

Testing done:

Unit tests
Integration tests
System tests
Manual tests

confluent-cla-assistant · 2025-03-03T13:47:34Z

❌ Error getting contributor login(s).
Please ensure the email address associated with this commit is added to your Github account.

typik89 · 2025-03-03T22:07:12Z

❌ Error getting contributor login(s). Please ensure the email address associated with this commit is added to your Github account.

done

add compaction for batch by primary key fields

b2fd81d

typik89 requested a review from a team as a code owner March 3, 2025 13:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add compaction for batch by primary key fields #1479

add compaction for batch by primary key fields #1479

Uh oh!

typik89 commented Mar 3, 2025

Uh oh!

confluent-cla-assistant bot commented Mar 3, 2025

Uh oh!

typik89 commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

add compaction for batch by primary key fields #1479

Are you sure you want to change the base?

add compaction for batch by primary key fields #1479

Uh oh!

Conversation

typik89 commented Mar 3, 2025

Problem

Solution

Testing done:

Uh oh!

confluent-cla-assistant bot commented Mar 3, 2025

Uh oh!

typik89 commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant