Skip to content

bsathyamur/Apache-Spark-Programs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark programs using python

1.Degree of separation - Identify the degree of separation between two comic superheroes using the classical traversal Breadth first search algorithm 2. Identify popular movies with the dataset from movie ratings dataset 3. Identify popular comic super hero

INSTALLATION:

  1. Download spark from https://spark.apache.org/ and install as per the instructions
  2. From anaconda command prompt run the spark programs as shown below: spark-submit most-popular-superheroes.py

About

Implementation of Map Reduce using spark (pyspark)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages