Hoptimator

A SQL control plane for multi-system data pipelines

Hoptimator turns SQL into running, multi-hop data pipelines that span Kafka, Flink, Venice, and anything else you plug in. You declare what you want — a materialized view from one system into another — and Hoptimator plans the topology, generates the specs, deploys them, and reconciles them.

CREATE MATERIALIZED VIEW ADS.AUDIENCE AS
  SELECT FIRST_NAME, LAST_NAME
  FROM ADS.PAGE_VIEWS NATURAL JOIN PROFILE.MEMBERS;

What that statement becomes depends on the templates and databases registered in your environment. With a typical Kafka + Flink setup, it expands into:

a View and a Pipeline resource,
a connector configuration on each side,
a Flink SQL job that maintains the result,
and any intermediate hops (e.g. CDC topics) the planner determined were needed to get from sources to sink.

Swap in different templates and the same SQL can target a different stack. The deployment target is pluggable — the bundled deployers target Kubernetes, but hoptimator-api is the actual extension point.

Why Hoptimator?

One SQL surface across many systems. Kafka, Flink, Venice, MySQL — and pluggable for the rest. The catalog is unified; joins span systems.
Multi-hop, declarative. You don't write Flink jobs and you don't request topics. The planner figures out the topology from a query.
Kubernetes out of the box, not as a hard requirement. The bundled deployers target Kubernetes, so pipelines show up as first-class CRDs and kubectl get pipelines Just Works. The Deployer interface is the actual extension point — anything that knows how to materialize a spec can take the place of the defaults.
Inspectable before it deploys. !specify (CLI) and plan (MCP) emit the exact specs Hoptimator would apply. No "magic" deploys.
Pluggable. New sources, sinks, engines, deployers, and validators are all extension points on hoptimator-api.

Quickstart

You need Docker Desktop with Kubernetes enabled (or kind), kubectl, and JDK 17+. Then:

make build install     # build the project and install the SQL CLI
make deploy-demo       # install CRDs and a couple of demo databases
./hoptimator           # start the SQL CLI
> !intro

Inside the CLI, declare a materialized view:

CREATE MATERIALIZED VIEW ADS.AUDIENCE AS
  SELECT FIRST_NAME, LAST_NAME
  FROM ADS.PAGE_VIEWS NATURAL JOIN PROFILE.MEMBERS;

Then in another terminal, watch what showed up:

kubectl get views
kubectl get pipelines

For a full walkthrough — including how to inspect the plan before deploying and how to clean up — see the Quickstart.

How it works

   SQL  ──▶  Planner  ──▶  Pipeline (sources, sink, job)
                              │
                              ▼
                          Deployers
                              │
                              ▼
                  Kubernetes resources
                  (Pipeline, KafkaTopic,
                   FlinkSessionJob, …)
                              │
                              ▼
                          Operator
                       (reconcile loop)

Hoptimator plays three roles: planner (parse + optimize the SQL across the unified catalog), adapter (translate plan elements into target-system specs), and operator (apply specs to Kubernetes and reconcile drift). The same machinery powers the SQL CLI, the JDBC driver, the MCP server, and the standalone operator.

For the long version, see the Architecture overview.

Documentation

The full docs live in docs/:

Getting started — quickstart, concepts, architecture.
User guide — SQL CLI, JDBC driver, MCP server, DDL reference, hints.
Kubernetes guide — operator, CRD reference, templates, triggers, configuration.
Extending Hoptimator — adding data sources, writing deployers, validators, config providers.
Learn more — engineering blog posts and case studies.

Project status

Hoptimator is alpha. APIs — including the SQL grammar, the hoptimator-api interfaces, and the v1alpha1 CRDs — are subject to change without notice. The project is still early-stage and experimental from an open source perspective; if you adopt it today, expect to follow main and pin to specific versions deliberately.

That said, Hoptimator is not a research toy: LinkedIn runs production pipelines on it internally. Pre-release artifacts for the modules in this repo are published to LinkedIn's JFrog Artifactory.

Contributing

Bug reports, feature requests, and PRs are welcome. See CONTRIBUTING.md for how to file an issue, send a pull request, or report a security vulnerability.

License

BSD 2-Clause.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github/workflows		.github/workflows
config		config
deploy		deploy
docs		docs
etc		etc
gradle		gradle
hoptimator-api		hoptimator-api
hoptimator-avro		hoptimator-avro
hoptimator-catalog		hoptimator-catalog
hoptimator-cli		hoptimator-cli
hoptimator-demodb		hoptimator-demodb
hoptimator-flink-adapter		hoptimator-flink-adapter
hoptimator-flink-runner		hoptimator-flink-runner
hoptimator-jdbc-driver-int		hoptimator-jdbc-driver-int
hoptimator-jdbc-driver		hoptimator-jdbc-driver
hoptimator-jdbc		hoptimator-jdbc
hoptimator-k8s		hoptimator-k8s
hoptimator-kafka-controller		hoptimator-kafka-controller
hoptimator-kafka		hoptimator-kafka
hoptimator-logical		hoptimator-logical
hoptimator-mcp-server-app		hoptimator-mcp-server-app
hoptimator-mcp-server		hoptimator-mcp-server
hoptimator-models		hoptimator-models
hoptimator-mysql		hoptimator-mysql
hoptimator-operator-integration		hoptimator-operator-integration
hoptimator-operator		hoptimator-operator
hoptimator-planner		hoptimator-planner
hoptimator-util		hoptimator-util
hoptimator-venice		hoptimator-venice
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
build.gradle		build.gradle
generate-models.sh		generate-models.sh
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
hoptimator		hoptimator
settings.gradle		settings.gradle
start-mcp-server		start-mcp-server

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hoptimator

A SQL control plane for multi-system data pipelines

Why Hoptimator?

Quickstart

How it works

Documentation

Project status

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hoptimator

A SQL control plane for multi-system data pipelines

Why Hoptimator?

Quickstart

How it works

Documentation

Project status

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages