HIVE-29210: Minor compaction produces duplicates conditionally in cas… #6101

tanishq-chugh · 2025-09-24T12:41:59Z

…e of HMS instance running initiator crash

What changes were proposed in this pull request?

Compactor cleaner fix to address duplicate directories created from multiple jobs running same compaction

Why are the changes needed?

To address the race condition where multiple jobs running same compaction leads to duplicate data

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manual Testing after reproducing the issue on a deployed cluster

…e of HMS instance running initiator crash

aturoczy · 2025-09-24T22:52:36Z

ql/src/java/org/apache/hadoop/hive/ql/io/AcidUtils.java

          return 1;
        }
      }
+      else if (visibilityTxnId != parsedDelta.visibilityTxnId) {


else if (visibilityTxnId != parsedDelta.visibilityTxnId) {
return visibilityTxnId < parsedDelta.visibilityTxnId ? 1 : -1;
}

Made this change in 8711dcd

we shouldn't allow concurrent compaction with above properties

kuczoram · 2025-09-25T08:25:07Z

Hi @tanishq-chugh ,
thanks a lot for this PR. It looks good, only one thing I am missing. Could you please add a test case for this scenario? I think in the TestCleaner tests you can define a directory structures and see it the cleaner properly cleans them up.

…ction job concurrently

tanishq-chugh · 2025-09-26T16:25:22Z

Hi @kuczoram
Thanks for pointing this out. I have added a new test case in TestCleaner for this scenario.
Added in commit - 611d9a7

sonarqubecloud · 2025-09-27T11:19:50Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

kuczoram

Thanks a lot @tanishq-chugh! It looks good to me.

deniskuzZ · 2025-09-29T08:14:18Z

hi @tanishq-chugh, how could we get into the situation with concurrent minor compactions? If that is related to compaction running on HMS I already mentioned to you, that we should just drop the support (it's already deprecated)
#6068 (comment)

deniskuzZ · 2025-09-29T08:17:19Z

ql/src/java/org/apache/hadoop/hive/ql/io/AcidUtils.java

                  && next.minWriteId == prev.minWriteId
-                  && next.statementId == prev.statementId) {
+                  && next.statementId == prev.statementId
+                  && (next.isDeleteDelta || prev.isDeleteDelta)) {


why is this change needed?

tanishq-chugh · 2025-09-29T08:19:49Z

Hi @deniskuzZ
This is not related to compaction running on HMS, but happens when compaction workers are indeed running on HS2 itself.
I have added the detailed information regarding how it happens in the description of the Hive JIRA: HIVE-29210

deniskuzZ · 2025-09-29T08:21:34Z

ql/src/test/org/apache/hadoop/hive/ql/txn/compactor/TestCleaner.java

+
+    // Overlapping compacted deltas with different visibilityTxnIDs simulating concurrent compaction from two workers
+    addDeltaFile(t, null, 22L, 23L, 2, 24);
+    addDeltaFile(t, null, 22L, 23L, 2, 25);


I don't think we should allow this to happen in first place

deniskuzZ · 2025-09-29T08:25:27Z

Hi @deniskuzZ This is not related to compaction running on HMS, but happens when compaction workers are indeed running on HS2 itself. I have added the detailed information regarding how it happens in the description of the Hive JIRA: HIVE-29210

@tanishq-chugh HIVE-29210 mentions HMS local workers

tanishq-chugh · 2025-09-29T08:31:22Z

@deniskuzZ
In a case, with multiple HiveServer2 (HS2) instances, one of the HS2 instances may run on the same host as the Hive Metastore (HMS). In this setup, the initiator runs within HMS, while the compaction worker threads run within HS2.

It mentions initiator in HMS but worker threads within HS2.

deniskuzZ · 2025-09-29T08:33:52Z

@tanishq-chugh, revokeFromLocalWorkers was only needed when HMS had local workers. As we remove support it should be just dropped, instead of adding workarounds in a code

HIVE-29210: Minor compaction produces duplicates conditionally in cas…

4dd3120

…e of HMS instance running initiator crash

asf-ci-hive added tests pending tests passed and removed tests pending labels Sep 24, 2025

aturoczy reviewed Sep 24, 2025

View reviewed changes

Address Review comments - 1

8711dcd

asf-ci-hive added tests pending tests passed and removed tests passed tests pending labels Sep 25, 2025

Add test case for validating cleanup after two workers run same compa…

611d9a7

…ction job concurrently

asf-ci-hive added tests pending and removed tests passed labels Sep 26, 2025

asf-ci-hive added tests unstable tests pending and removed tests pending tests unstable labels Sep 26, 2025

asf-ci-hive added tests passed and removed tests pending labels Sep 27, 2025

kuczoram approved these changes Sep 27, 2025

View reviewed changes

kuczoram merged commit 96cf347 into apache:master Sep 29, 2025
2 checks passed

deniskuzZ reviewed Sep 29, 2025

View reviewed changes

HIVE-29210: Minor compaction produces duplicates conditionally in cas… #6101

HIVE-29210: Minor compaction produces duplicates conditionally in cas… #6101

Uh oh!

Conversation

tanishq-chugh commented Sep 24, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

aturoczy Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

tanishq-chugh Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

deniskuzZ Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

kuczoram commented Sep 25, 2025

Uh oh!

tanishq-chugh commented Sep 26, 2025

Uh oh!

sonarqubecloud bot commented Sep 27, 2025

Quality Gate passed

Uh oh!

kuczoram left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deniskuzZ commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deniskuzZ Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tanishq-chugh commented Sep 29, 2025

Uh oh!

deniskuzZ Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

deniskuzZ commented Sep 29, 2025

Uh oh!

tanishq-chugh commented Sep 29, 2025

Uh oh!

deniskuzZ commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

deniskuzZ commented Sep 29, 2025 •

edited

Loading

deniskuzZ Sep 29, 2025 •

edited

Loading