Batch Kafka consumer #151

z0isch · 2024-03-06T15:41:07Z

Add ability to handle batches for our kafka consumers

z0isch · 2024-03-06T15:42:31Z

library/Freckle/App/Kafka/Consumer.hs

@@ -164,31 +164,39 @@ runConsumer
     , HasCallStack
     )
  => Timeout
+  -> Int
  -> (a -> m ())


Is this the right shape for the handler? Should it be [a] -> m () instead?

Seems like it'd more flexible for the client to do "smart" things if we hand them the entire batch. I'm thinking things like doing concurrent actions or grouping stuff together.

Good question. I defer to you.

Practically speaking, I'd put my money on consumers not doing smart things, but it's also trivial for them to do their own traverse_.

I think [a] -> m () also implies that we can't store or commit offsets on each event, so I guess for the first attempt at this I can go with the a -> m () shape and see what happens.

pbrisbin

So this re-introduces some duplicate processing. If we're deploying, a second consumer can come up and would reprocess any messages the existing consumers have stored but not yet committed, which is up to batchSize messages.

Seems like an OK middle-ground with how it was before. However, I still feel like we could've arrived at the same rough behavior in this regard if we had simply reduced the auto-commit setting to do so more frequently, no?

library/Freckle/App/Kafka/Consumer.hs

pbrisbin · 2024-03-06T15:46:26Z

If we wanted to keep the reduction in duplicate processing, I think still doing a store+commit on every message, but now with batching, could also help things... since you're polling less frequently it would take pressure off the broker even though you're committing more frequently. WDYT?

z0isch · 2024-03-06T15:56:00Z

If we wanted to keep the reduction in duplicate processing, I think still doing a store+commit on every message, but now with batching, could also help things... since you're polling less frequently it would take pressure off the broker even though you're committing more frequently. WDYT?

Probably worth a try to see what happens; ~~however a client could still get this behavior by having batchSize=1 right?~~

edit: Oh I guess we'd not get the poll reduction then

z0isch · 2024-03-08T15:54:49Z

Closing this as the performance is back to normal with doing auto-commits for the answers consumer.

It seems like there is just a trade off on size of the batch of offset commits vs. reprocessing events. For our current consumer re-processing is fine due to the idempotency of the consumer.

It also seems like there is just going to be a performance cost to pay when deploying new consumers that is un-related to reprocessing events. The performance hit seems likely due to the kafka rebalance when adding/removing consumers to a consumer group and not due to reprocessing a handful (100s of events).

pbrisbin · 2024-03-08T16:06:03Z

It also seems like there is just going to be a performance cost to pay when deploying new consumers that is un-related to reprocessing events

Did slowing the rotation from double-then-halve to plus-one-minus-one solve it?

z0isch · 2024-03-08T16:13:18Z

It also seems like there is just going to be a performance cost to pay when deploying new consumers that is un-related to reprocessing events

Did slowing the rotation from double-then-halve to plus-one-minus-one solve it?

Nope, it just made it have 3 blips instead of one. I think this makes sense because there was probably 3 re-balances

AFAICT it seems like the only way to avoid these performance issues is to not rebalance which I think implies doing some sort of static partition assignment for the consumers. This would require a lot of complexity in coordination between the consumers though, so I think we'd need to weigh that against how important it is to not have these spikes.

z0isch added 2 commits March 6, 2024 10:40

Add hie for LSP integration

3d5d2a9

Add batch handling to consumer

75cb67e

z0isch requested a review from pbrisbin March 6, 2024 15:41

z0isch commented Mar 6, 2024

View reviewed changes

pbrisbin approved these changes Mar 6, 2024

View reviewed changes

library/Freckle/App/Kafka/Consumer.hs Show resolved Hide resolved

z0isch closed this Mar 8, 2024

z0isch deleted the aj/batch-kafka-consumer branch March 15, 2024 18:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch Kafka consumer #151

Batch Kafka consumer #151

z0isch commented Mar 6, 2024

z0isch Mar 6, 2024 •

edited

Loading

pbrisbin Mar 6, 2024

z0isch Mar 6, 2024

pbrisbin left a comment •

edited

Loading

pbrisbin commented Mar 6, 2024 •

edited

Loading

z0isch commented Mar 6, 2024 •

edited

Loading

z0isch commented Mar 8, 2024

pbrisbin commented Mar 8, 2024

z0isch commented Mar 8, 2024 •

edited

Loading

Batch Kafka consumer #151

Batch Kafka consumer #151

Conversation

z0isch commented Mar 6, 2024

z0isch Mar 6, 2024 • edited Loading

Choose a reason for hiding this comment

pbrisbin Mar 6, 2024

Choose a reason for hiding this comment

z0isch Mar 6, 2024

Choose a reason for hiding this comment

pbrisbin left a comment • edited Loading

Choose a reason for hiding this comment

pbrisbin commented Mar 6, 2024 • edited Loading

z0isch commented Mar 6, 2024 • edited Loading

z0isch commented Mar 8, 2024

pbrisbin commented Mar 8, 2024

z0isch commented Mar 8, 2024 • edited Loading

z0isch Mar 6, 2024 •

edited

Loading

pbrisbin left a comment •

edited

Loading

pbrisbin commented Mar 6, 2024 •

edited

Loading

z0isch commented Mar 6, 2024 •

edited

Loading

z0isch commented Mar 8, 2024 •

edited

Loading