Skip to content

Latest commit

 

History

History
117 lines (85 loc) · 8.84 KB

DATASCIENCE.MD

File metadata and controls

117 lines (85 loc) · 8.84 KB

Marchine Learning

Table of Contents

Doc

Learning

Courses

Random Forest

Feature selection

gradient boosting

Dimensionality reduction

Clustering

  • annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
  • pysparnn - Approximate Nearest Neighbor Search for Sparse Data in Python!

benchmark

  • benchm-ml - A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).

Neural Network

Deep Learning

packages

TensorFlow

Optimization

  • Spearmint - Spearmint Bayesian optimization codebase