Cloudwick Spark CodeBase

This repository is a collection of Spark examples & use-case implementations for various components of the Spark eco-system including Spark-Core, Spark-Streaming, Spark-SQL, Spark-MLLib.

What does this repository contains ?

Spark core examples
- WordCount
Spark streaming examples
Spark core use-cases
- LognAnalytics
Spark streaming use-cases
- LogAnalytics A simple spark streaming use-case to perform apache log analysis which could read data from Kafka & Kinesis performs some analysis and persists the result's to cassandra.
Testing
- ScalaTest spec traits for Spark core, streaming and SQL API(s)
- Embedded Kafka and Zookeeper embedded server instances for testing

How to download ?

Simplest way is to clone the repository:

git clone https://github.com/cloudwicklabs/spark_codebase.git

How to run these ?

To run any of these examples or use-cases you have to package them using a uber-jar (most of the examples depend of external dependencies, hence have to be packaged as a assembly jar).

Building an assembly jar

From the project's home directory

sbt assembly

Running using `spark-submit`

spark-submit is the simplest way to submit a spark application to the cluster and supports all the cluster manager's like stand-alone, yarn and mesos.

Each of the main class has documentation on how to run it.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
project		project
src		src
.gitignore		.gitignore
README.md		README.md
SparkTuning.md		SparkTuning.md
Vagrantfile		Vagrantfile
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cloudwick Spark CodeBase

What does this repository contains ?

How to download ?

How to run these ?

Building an assembly jar

Running using `spark-submit`

About

Uh oh!

Releases

Packages

Languages

cloudwicklabs/spark_codebase

Folders and files

Latest commit

History

Repository files navigation

Cloudwick Spark CodeBase

What does this repository contains ?

How to download ?

How to run these ?

Building an assembly jar

Running using spark-submit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Running using `spark-submit`

Packages