Bus Floating Data

this project is a simple show case which shows how to create a streaming akka actor to ingest data from the http source to Kafka And consume those messages either via akka or spark to push them to a cassandra table.

The project is a multi project with cross compilation due to incompatabilities with kafka spark and akka when using the latest scala akka versions.

for a clean build from commandline:

sbt "; so clean ;so test; very publishLocal"

or simply call

sbt create

To compile it run:

sbt ;so test; very publishLocal

To run the applications

ingest:

so ingest/run

select the 1st entry as the simple does all steps at once or better

sbt runIngest

to prepare the ingest container call:

sbt createIngestContainer

akka frontend: so service/run sbt runService

to prepare the service container:

sbt createServerContainer

spark digest: The spark digest starts a local spark master. As a spark job requires a fat jar first create that one:

sbt createDigestUberJar

to run the kafka->cassandra spark job: sbt submitKafkaCassandra

to run the cluster spark job: sbt submitClusterSpark

Pre-conditions to run those, have

a) a cassandra running b) a zookeeper running c) a kafka server running

for b) and c) you can use brew to install those. e.g. brew cask install zookeeper and brew install kafka

to run zookeper (not as a service)

zkServer start

to run kafka (not as a service)

kafka-server-start /usr/local/etc/kafka/server.properties

Operating Kafka

to clean old data from kafka turn down the retention time in kafka for the topic

kafka-configs --zookeeper localhost:2181 --entity-type topics --alter --add-config retention.ms=1000 --entity-name METRO-Vehicles

this will result in a clean up afte about a minute

after that you can configure back the time to about 2 days (more isn't worth when testing)

kafka-configs --zookeeper localhost:2181 --entity-type topics --alter --add-config retention.ms=172800000 --entity-name METRO-Vehicles

to check the current setting do the following:

kafka-configs --zookeeper localhost:2181 --entity-type topics --describe --entity-name METRO-Vehicles

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
akka-digest/src/main/scala/de/nierbeck/floating/data/stream/digest		akka-digest/src/main/scala/de/nierbeck/floating/data/stream/digest
akka-ingest/src/main/scala/de/nierbeck/floating/data/stream		akka-ingest/src/main/scala/de/nierbeck/floating/data/stream
akka-server/src		akka-server/src
commons/src		commons/src
conf		conf
project		project
spark-digest/src/main		spark-digest/src/main
terraform		terraform
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
runKafka.sh		runKafka.sh
scalastyle-config.xml		scalastyle-config.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bus Floating Data

for a clean build from commandline:

To compile it run:

To run the applications

Pre-conditions to run those, have

Operating Kafka

About

Releases

Packages

Languages

License

mniehoff/BusFloatingData

Folders and files

Latest commit

History

Repository files navigation

Bus Floating Data

for a clean build from commandline:

To compile it run:

To run the applications

Pre-conditions to run those, have

Operating Kafka

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages