Are you using this in production yourself yet? #3

ulrikjohansson · 2019-02-23T09:27:56Z

Hi!

I just stumbled over this schema-registry/serializer project b.c. I was about to implement one myself =)

Where I work we have a working serializer/deserializer talking to the confluent schema registry, but it's a bit of a hack.
The schema fetching is done separately in a startup script before starting the consumer/producer, and the schemas are put into the aiohttp app container, so we have to lug that around everywhere.

So I was wondering if this library is ready for use yet? I noticed it's very new (january of this year).

The text was updated successfully, but these errors were encountered:

jonathansick · 2019-02-25T14:47:21Z

Hey @ulrikjohansson I'm happy you found this! Yeah, it's super new but we're starting to use it in apps that I would characterize as being prototypes/betas. I'm fairly certain Kafkit will become a core part of our infrastructure. We are building this at the same time as we're adopting Kafka, so we're still figuring out best practices.

An example of a producer is our Slack listener for chatops:
https://github.com/lsst-sqre/sqrbot-jr

There serializers are set up here: https://github.com/lsst-sqre/sqrbot-jr/blob/master/sqrbot/avroformat.py and the handler that converts an incoming HTTP event to a Kafka message is here: https://github.com/lsst-sqre/sqrbot-jr/blob/master/sqrbot/handlers/event.py

My thought so far is to keep original schemas in the producer apps and make them responsible for registering schemas. This way we don't have to bake schema IDs into the producer apps (although you could do that if you wanted).

An example of a consumer app is https://github.com/lsst-sqre/templatebot and specifically https://github.com/lsst-sqre/templatebot/blob/master/templatebot/slacklistener.py

If you want to use it, I'd suggest pinning because it's so new. If you'd like to contribute code, that'd be great too! It might be good to open an issue before writing code to make sure it doesn't conflict with something we've got coming down the pipe. API docs are at https://kafkit.lsst.io

ulrikjohansson · 2019-02-25T16:43:23Z

I'll take a look at those links, thanks for a thorough introduction!

Our setup looks roughly like this atm (we have 1 big monolith repo, with a bunch of smaller service repos budding off of the monolith):

Specify schemas for topics in a separate file in "the big monolith repo". Rule is 1 schema per topic
Load schemas into schema registry at monolith deploy time. Schema registry compatibility is set to FULL_TRANSITIVE so all schema versions should be backwards and forwards compatible.
Producers load the schema(s) from the registry for the topic(s) it wants to produce to at producer startup.
Consumers do the same.

This process is a result both of the fact that our kafka journey started in the monolith, and the fact that we started pre-fetching schemas b.c that was the quickest way to get stuff going.

This workflow is starting to hurt us though, since it slows down development of new services, and the services owning the topics/schemas don't have either topic or schema definitions in their respective repos.

So that makes your workflow very appealing to me. As soon as I can find the time, I'll try this library out on one of our less critical services, and we'll see how it goes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are you using this in production yourself yet? #3

Are you using this in production yourself yet? #3

ulrikjohansson commented Feb 23, 2019

jonathansick commented Feb 25, 2019

ulrikjohansson commented Feb 25, 2019 •

edited

Loading

Are you using this in production yourself yet? #3

Are you using this in production yourself yet? #3

Comments

ulrikjohansson commented Feb 23, 2019

jonathansick commented Feb 25, 2019

ulrikjohansson commented Feb 25, 2019 • edited Loading

ulrikjohansson commented Feb 25, 2019 •

edited

Loading