Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[benchmarks] Add a container for MineBench #8

Open
rouvoy opened this issue Dec 17, 2015 · 0 comments
Open

[benchmarks] Add a container for MineBench #8

rouvoy opened this issue Dec 17, 2015 · 0 comments
Assignees
Milestone

Comments

@rouvoy
Copy link
Member

rouvoy commented Dec 17, 2015

NU-MineBench is a data mining benchmark suite containing a mix of several representative data mining applications from different application domains. This benchmark is intended for use in computer architecture research, systems research, performance evaluation, and high-performance computing. The well-known applications assembled in this benchmark suite have been collected from research groups in industry and academia. The applications contain highly optimized versions of the data mining algorithms. Scalable versions of the applications are also provided. Such extensions were designed and implemented by developers at Northwestern University. Currently, the benchmark has applications with algorithms based on clustering, association rules, classification, bayesian network, pattern recognition, support vector machines and several other well known data mining methodologies. These applications are used in diverse fields like bioinformatics, network intrusion, customer relationship management, and marketing.
If you would like to contribute any well-known and stable application to our benchmark suite, please do not hesitate to contact us.

List of algorithms and applications

  • Approximate Frequent Itemset Miner
  • Apriori association rule mining
  • Naive Bayesian Network data classifier
  • BIRCH data clustering
  • ECLAT association rule mining
  • GeneNet, a DNA sequencing application using Bayesian network
  • HOP, a density-based data clustering
  • K-means and Fuzzy K-means data clustering
  • Parallel ETI Mining
  • PLSA (Parallel Linear Space Alignment)
  • Recursive_Weak, Recursive_Weak_pp
  • RSearch, a sequence database searching with RNA structure queries
  • ScalParC decision-tree based data classification
  • Semphy, a structure learning algorithm that is based on phylogenetic trees
  • SNP (Single Nucleotide Polymorphisms) data classification
  • SVM-RFE (Support Vector Machines - Recursive Feature Elimination) is a feature selection algorithm
  • Utility mining, association rule-based mining algorithm
@rouvoy rouvoy added this to the 1.0 milestone Dec 17, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants