Dominance-based queries on Apache Spark.

Skyline queries are a popular and powerful paradigm for extracting interesting objects from a multi-dimensional dataset. Given a set D of d-dimensional objects (or points), the skyline set of R is the set of Pareto-optimal, or undominated, points in D

Algorithms

Skyline query based on the Sort Filter Skyline (SFS) algorithm.
Top-k dominating based on the Skyline-based Top-k Dominating (STD).
Top-k dominating on Skyline

Datasets

There are 4 distributions of synthetic datasets to run the algorithms, from 2-d to 10-d.

Correlated
Uniform
Normal
Anti-correlated

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
data		data
spark-app		spark-app
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dominance-based queries on Apache Spark.

Algorithms

Datasets

About

Releases

Packages

Contributors 2

Languages

stergiosbamp/spark-dominance-based-queries

Folders and files

Latest commit

History

Repository files navigation

Dominance-based queries on Apache Spark.

Algorithms

Datasets

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages