Skip to content

giangzuzana/hadoop-examples

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop Examples/Testing

This repo contains:

  • A collection of example jobs:
    • Hadoop
    • Hadoop Streaming
    • Spark
    • PySpark
    • Hive
    • Pig
  • A script to run all or some of these in an automated way to ensure a properly operating cluster

Cluster Testing

python/util/cluster_test.py

The above script will upload data and run sample jobs to ensure a properly configured cluster.

Building

mvn package

The above command will build and package all of the Java and Scala code, as well as run MapReduce/Spark unit tests to ensure correctness.

About

Short hadoop example codes in multiple languages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 72.1%
  • Scala 10.8%
  • Python 10.6%
  • PigLatin 6.5%