KAFKA-17509: Introduce a delayed action queue to complete purgatory actions outside purgatory. #17177

adixitconfluent · 2024-09-12T12:42:06Z

About

In reference to comment #16969 (comment) , I have introduced a DelayedActionQueue to add purgatory actions and try to complete them.

I've added code to add purgatory actions to DelayedActionQueue when partition locks are released after fetch in forceComplete. Also, code has been added to onExpiration to check the delayed actions queue and try to complete it. Since onExpiration serves as a callback for forceComplete, it should not lead to infinite call stack.
Also, fixed a few warning in some tests in DelayedShareFetchTestwhich were occurring due to insufficient mocking.

Testing

The code has been tested with the help of unit tests.

adixitconfluent · 2024-09-12T18:51:38Z

I've checked the 3 test failures. They are unrelated to the PR. I ran all of them locally and they all passed.

mumrah

Thanks for the patch @adixitconfluent!

Here's my understanding of the current share fetch handling

KafkaApis is calling into SPM to enqueue a share request
SPM#maybeProcessFetchQueue runs recursively (! 🙀) until the queue is empty
On each iteration, we get a share fetch request off the queue, do some validation and enqueue a DelayedShareFetch

Since adding the DelayedShareFetch to the purgatory is non-blocking, I'm pretty sure we are essentially not using the fetch queue any more. Or rather, we are now using the DelayedShareFetch purgatory as a fetch queue (which was the goal of the refactoring, after all).

For fetchQueue I don't see too many remaining usages:

Adding in SPM#fetchMessages (from KafkaApis)
completeExceptionally in SPM#close
Polling in SPM#maybeProcessFetchQueue

Since this closely matches our DelayedShareFetch usage, I'm wondering if we can remove the fetchQueue code in this PR.

WDYT?

mumrah · 2024-09-12T20:34:56Z

core/src/main/java/kafka/server/share/DelayedShareFetch.java

+                    // then we should check if there is a pending share fetch request for the topic-partition and complete it.
+                    // We add the action to delayed actions queue to avoid an infinite call stack, which could happen if
+                    // we directly call delayedShareFetchPurgatory.checkAndComplete
+                    delayedActionQueue.add(() -> {


@adixitconfluent I'm a little confused by the async code here. We are gathering some futures in ShareFetchUtils#processFetchResponse, but when I look down into SharePartition#acquire it's all synchronous/blocking code (it just returns a completed CompletableFuture).

Is this some leftovers from the refactoring? Or do we intend to make SharePartition#acquire async?

I ask this because if we're not keeping the CompletableFuture return type in SharePartition#acquire, we can fix it in this PR and avoid some complexity here.

@mumrah , we created a JIRA https://issues.apache.org/jira/browse/KAFKA-17522 for tracking this issue earlier. Yes, it makes sense that share partition acquire functionality need not return a future. I am not sure whether I should cover it in this PR itself.
@apoorvmittal10 any thoughts whether we should cover it in this PR or since the JIRA is assigned to you, if you're working on it already, we can have another PR for the resolution?

adixitconfluent · 2024-09-13T07:56:16Z

hi @mumrah,

I'm wondering if we can remove the fetchQueue code in this PR.

You're right, we don't need the fetch queue. I have created a JIRA https://issues.apache.org/jira/browse/KAFKA-17545 for it, and will prioritize it in the coming PRs.

junrao

@adixitconfluent : Thanks for the PR. Added a few comments.

core/src/main/java/kafka/server/share/DelayedShareFetch.java

core/src/main/java/kafka/server/share/SharePartitionManager.java

adixitconfluent · 2024-09-30T08:55:56Z

Hi @junrao @mumrah , I have responded/changed my code to address your comments. Please take a look when you can, thanks!

apoorvmittal10

1 comment for my knowledge.

apoorvmittal10 · 2024-09-30T14:26:56Z

core/src/main/scala/kafka/server/ReplicaManager.scala

+   */
+  def addCompleteDelayedShareFetchPurgatoryAction(topicIdPartitions: Seq[TopicIdPartition],
+                                                  groupId: String,
+                                                  delayedShareFetchPurgatory: DelayedOperationPurgatory[DelayedShareFetch]): Unit = {


Just for my knowledge, will it not be better to declare delayedShareFetchPurgatory in replica manager itself as like other purgatories and define methods to append the requests? I see a lot purgatories are already defined there.

Hi @apoorvmittal10 , we can do it but not sure whether it is good to add more scala code. If we declare it in replicaManager, we'll also have to write the functions to add functionalities for delayedShareFetchPurgatory.tryCompleteElseWatch and delayedShareFetchPurgatory.checkAndComplete in replicaManager which are going to be utilized in SharePartitionManager and SharePartition. Additionally, we'll need to pass a replicaManager object to SharePartition class as well, so its an added dependency. Considering all these factors, I feel it is better if we declare it in within SharePartitionManager. Let me know if my thoughts don't make sense.

I agree with your thoughts and they does make sense.

But I am skeptical about having a single purgatory out of replica manager is not a good idea, if we do have other instances then it's fine. There are metrics and definite standard code to shutdown purgatory and related instances as boilerplate code in replica manager. I know you could handle them separetely as well but it might be better to have them in single place.

I can see list offset purgatory is added recently as well, so I I think it might be ok to have the scala code in old classes. I am not an expert in the area and leave it to @junrao or @mumrah to decide. I just found the API to pass delayed purgatory in replica manager a bit odd.

junrao

@adixitconfluent : Thanks for the updated PR. Added one more comment.

junrao · 2024-09-30T21:20:38Z

core/src/main/java/kafka/server/share/DelayedShareFetch.java

+                    // then we should check if there is a pending share fetch request for the topic-partition and complete it.
+                    // We add the action to delayed actions queue to avoid an infinite call stack, which could happen if
+                    // we directly call delayedShareFetchPurgatory.checkAndComplete
+                    replicaManager.addCompleteDelayedShareFetchPurgatoryAction(CollectionConverters.asScala(result.keySet()).toSeq(), shareFetchData.groupId(), delayedShareFetchPurgatory);


It's a bit weird to add the delayedAction through ReplicaManager. Perhaps we could create DelayedActionQueue in KafkaApis and pass it to both ReplicaManager and DelayedShareFetch.

adixitconfluent added 2 commits September 12, 2024 15:53

Create branch

016f4bf

Added a delayed actions queue to complete pending actions of purgatory

7a40b26

apoorvmittal10 added the KIP-932 Queues for Kafka label Sep 12, 2024

mumrah reviewed Sep 12, 2024

View reviewed changes

junrao reviewed Sep 13, 2024

View reviewed changes

core/src/main/java/kafka/server/share/DelayedShareFetch.java Show resolved Hide resolved

core/src/main/java/kafka/server/share/DelayedShareFetch.java Outdated Show resolved Hide resolved

core/src/main/java/kafka/server/share/SharePartitionManager.java Outdated Show resolved Hide resolved

apoorvmittal10 mentioned this pull request Sep 27, 2024

KAFKA-17620: Simplifying share partition acquire API (kip-932) #17283

Open

3 tasks

Merge remote-tracking branch 'origin/trunk' into kafka-17509

054b12f

github-actions bot added the core Kafka Broker label Sep 30, 2024

Addressed Jun's round 1 comments

5a57423

adixitconfluent requested review from mumrah and junrao September 30, 2024 08:54

apoorvmittal10 reviewed Sep 30, 2024

View reviewed changes

adixitconfluent requested a review from apoorvmittal10 September 30, 2024 15:16

mumrah added the ci-approved label Sep 30, 2024

junrao reviewed Sep 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-17509: Introduce a delayed action queue to complete purgatory actions outside purgatory. #17177

KAFKA-17509: Introduce a delayed action queue to complete purgatory actions outside purgatory. #17177

adixitconfluent commented Sep 12, 2024 •

edited

Loading

adixitconfluent commented Sep 12, 2024

mumrah left a comment

mumrah Sep 12, 2024

adixitconfluent Sep 13, 2024

adixitconfluent commented Sep 13, 2024 •

edited

Loading

junrao left a comment

adixitconfluent commented Sep 30, 2024

apoorvmittal10 left a comment

apoorvmittal10 Sep 30, 2024

adixitconfluent Sep 30, 2024 •

edited

Loading

apoorvmittal10 Sep 30, 2024

junrao left a comment

junrao Sep 30, 2024

KAFKA-17509: Introduce a delayed action queue to complete purgatory actions outside purgatory. #17177

Are you sure you want to change the base?

KAFKA-17509: Introduce a delayed action queue to complete purgatory actions outside purgatory. #17177

Conversation

adixitconfluent commented Sep 12, 2024 • edited Loading

About

Testing

adixitconfluent commented Sep 12, 2024

mumrah left a comment

Choose a reason for hiding this comment

mumrah Sep 12, 2024

Choose a reason for hiding this comment

adixitconfluent Sep 13, 2024

Choose a reason for hiding this comment

adixitconfluent commented Sep 13, 2024 • edited Loading

junrao left a comment

Choose a reason for hiding this comment

adixitconfluent commented Sep 30, 2024

apoorvmittal10 left a comment

Choose a reason for hiding this comment

apoorvmittal10 Sep 30, 2024

Choose a reason for hiding this comment

adixitconfluent Sep 30, 2024 • edited Loading

Choose a reason for hiding this comment

apoorvmittal10 Sep 30, 2024

Choose a reason for hiding this comment

junrao left a comment

Choose a reason for hiding this comment

junrao Sep 30, 2024

Choose a reason for hiding this comment

adixitconfluent commented Sep 12, 2024 •

edited

Loading

adixitconfluent commented Sep 13, 2024 •

edited

Loading

adixitconfluent Sep 30, 2024 •

edited

Loading