The prevalence of machine learning has been increasing tremendously in recent years due to the high demand and advancements in technology. The potential of machine learning to create value out of data has made it appealing for businesses in many different industries. Most machine learning products are designed and implemented with off-the-shelf machine learning algorithms with some tuning and minor changes.
There is a wide variety of machine learning algorithms that can be grouped in three main categories:
In this algorithms model the relationship between features (independent variables) and a label (target) given a set of observations. Then the model is used to predict the label of new observations using the features. Depending on the characteristics of target variable, it can be a classification (discrete target variable) or a regression (continuous target variable) task.
They try to find the structure in unlabeled data.
They works based on an action-reward principle. An agent learns to reach a goal by iteratively calculating the reward of its actions.
In this repository, I will cover the most common algorithms in the all categories. In further notes types are explained with definition.
How it works: This algorithm consist of a target / outcome variable (or dependent variable) which is to be predicted from a given set of predictors (independent variables). Using these set of variables, we generate a function that map inputs to desired outputs. The training process continues until the model achieves a desired level of accuracy on the training data. Examples of Supervised Learning: Regression, Decision Tree, Random Forest, KNN, Logistic Regression etc.
How it works: In this algorithm, we do not have any target or outcome variable to predict / estimate. It is used for clustering population in different groups, which is widely used for segmenting customers in different groups for specific intervention. Examples of Unsupervised Learning: Apriori algorithm, K-means.
How it works: Using this algorithm, the machine is trained to make specific decisions. It works this way: the machine is exposed to an environment where it trains itself continually using trial and error. This machine learns from past experience and tries to capture the best possible knowledge to make accurate business decisions. Example of Reinforcement Learning: Markov Decision Process
List of commonly used machine learning algorithms. These algorithms can be applied to almost any data problem:
-
Linear Regression
-
Logistic Regression
-
Decision Tree
-
SVM
-
Naive Bayes
-
kNN
-
K-Means
-
Random Forest
-
Dimensionality Reduction Algorithms
-
Gradient Boosting algorithms
a. GBM b. XGBoost c. LightGBM d. CatBoost
- Linear Regression
- Logistic Regression
- Decision Tree
- Random Forest
- K Means Clustering
- Hirarchical Clustering
- Spectral Clustering